The graphical representation method, Robust CoPlot, is a robust variant of the classical CoPlot method. CoPlot is an adaptation of multidimensional scaling (MDS), and is a practical tool for visual inspection and rich interpretation of multivariate data. CoPlot enables presentation of a multidimensional dataset in a two dimensions, in a manner that relations between both variables and observations to be analyzed together. It has also been used as a supplemental tool to cluster analysis, data envelopment analysis (DEA) and outlier detection methods in the literature. However, this method is very sensitive to outliers. When a multidimensional dataset contains outliers, this can lead to undesirable consequences such as the inaccurate representation of the variables. The motivation is to produce Robust CoPlot that is not unduly affected by outliers. In this study, we have presented a new MATLAB package RobCoP for generating robust graphical representation of a multidimensional dataset. This study serves a useful purpose for researchers studying the implementation of Robust CoPlot method by providing a description of the software package RobCoP; it also offers some limited information on the Robust CoPlot analysis itself. The package presented here has enough flexibility to allow a user to select an MDS type and vector correlation method to produce either classical or Robust CoPlot results.
Multivariate models such as the Cox regression model, if developed carefully, are powerful tools for making prognostic prediction which are frequently used in studies of clinical outcomes. Many applications require a large number of variables to be modelled by using a relatively small patient sample. Determination of the important variables in a model is critical to understand the behaviour of phenomena as the independent variables contribute the most to the outcome. From a practical perspective, a small subset of independent variables are usually selected from a large data set without the loss of any predictive efficiency. Automatic variable selection algorithms in scientific studies are commonly used for obtaining interpretable and practically applicable models. However, the careless use of these methods may lead to statistical problems. The performance of the generated models may be poor due to the violation of assumption, omission of the important variables, problems of overfitting, and the problem of multicollinearity and outliers. In order to enhance the accuracy of a model, it is essential to explore the data and its main characteristics before making any statistical inference. This study suggests an approach for acquiring a trustworthy model selection procedure for survival data by performing classical variables selection methods, accompanied by a graphical visualization method, namely robust coplot. Thus, it enables us to investigate the discrimination of observations, clusters of the variables and clusters of the observations that are highly characterized by a particular variable in a one graph. We present an application of combined method, as an integral part of statistical modelling, on survival data on multiple myeloma to show how coplot results are used in automatic variable selection algorithm in Cox regression model-building.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.