2018
DOI: 10.1007/s11634-018-0331-4
|View full text |Cite
|
Sign up to set email alerts
|

Assessing trimming methodologies for clustering linear regression data

Abstract: We assess the performance of state-of-the-art robust clustering tools for regression structures under a variety of different data configurations. We focus on two methodologies that use trimming and restrictions on group scatters as their main ingredients. We also give particular care to the data generation process through the development of a flexible simulation tool for mixtures of regressions, where the user can control the degree of overlap between the groups. Level of trimming and restriction factors are i… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5

Relationship

1
4

Authors

Journals

citations
Cited by 10 publications
(8 citation statements)
references
References 38 publications
0
8
0
Order By: Relevance
“…The original solution in TCLUST-REG was to fix a X in advance, although there is no established indication of the link between this proportion and the breakdown properties of the methodology. Torti et al (2018) have proposed to select a X adaptively from the data using a multivariate outlier detection procedure in the space of the explanatory variables. The observations surviving to the two trimming steps are then used for updating the regression coefficients, weights and scatter matrices.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…The original solution in TCLUST-REG was to fix a X in advance, although there is no established indication of the link between this proportion and the breakdown properties of the methodology. Torti et al (2018) have proposed to select a X adaptively from the data using a multivariate outlier detection procedure in the space of the explanatory variables. The observations surviving to the two trimming steps are then used for updating the regression coefficients, weights and scatter matrices.…”
Section: Methodsmentioning
confidence: 99%
“…This modification of the algorithm is usually referred in the literature as adaptive TCLUST-REG. Torti et al (2018) have also assessed how the performances of TCWM change in presence of possible misspecification of the distribution of the explanatory variables. Their experience is that the superior performance of TCWM can degenerate if the explanatory variable distribution is miss-specified and in this case the TCLUST-REG solution is preferable.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…WEM and WCEM can be initialized by subsampling (Markatou et al 1998 ; Neykov and Müller 2003 ; Neykov et al 2007 ; Torti et al 2019 ). A subsample of size is selected randomly from the data sample; then, the model is fitted to these observations by the classical EM (or CEM) algorithm to get a trial estimate.…”
Section: A Weighted Likelihood Em-type Algorithmmentioning
confidence: 99%
“…In particular, TCLUST-REG is characterized by group scatter constraints aimed at making the mixture fitting a well-posed problem and the addition of a second trimming step to mitigate the effect of outliers in the space of explanatory variables acting as leverage points. A very recent adaptive version of TCLUST-REG has been discussed in Torti et al (2019). An alternative approach meant to automatically take into account leverage points has been considered by García-Escudero et al (2017) where trimming and restrictions have been introduced to get a robust version of the cluster weighted model, named Trimmed Clustered Weighted Restricted Model (TCWRM).…”
Section: Introductionmentioning
confidence: 99%