2020
DOI: 10.1016/j.csda.2020.106944
|View full text |Cite
|
Sign up to set email alerts
|

Cellwise robust M regression

Abstract: The cellwise robust M regression estimator is introduced as the first estimator of its kind that intrinsically yields both a map of cellwise outliers consistent with the linear model, and a vector of regression coefficients that is robust against vertical outliers and leverage points. As a by-product, the method yields a weighted and imputed data set that contains estimates of what the values in cellwise outliers would need to amount to if they had fit the model. The method is illustrated to be equally robust … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
11
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
1
1

Relationship

1
5

Authors

Journals

citations
Cited by 23 publications
(11 citation statements)
references
References 22 publications
(26 reference statements)
0
11
0
Order By: Relevance
“…There is already a generalization in the literature from SVD13 to MI (Arciniegas‐Alarcón et al., 2014b), so this generalization could be studied with the robust extensions recommended here using other parameters of interest in multi‐environment experiments (Yan, 2014). Recent developments in statistics, such as the PCA, allowing for missingness and cellwise and row‐wise outliers, macroPCA (Hubert et al., 2019), and the cellwise robust M regression (Filzmoser et al., 2020) can be compared with the results obtained here, but in matrices and multivariate situations.…”
Section: Discussionmentioning
confidence: 80%
“…There is already a generalization in the literature from SVD13 to MI (Arciniegas‐Alarcón et al., 2014b), so this generalization could be studied with the robust extensions recommended here using other parameters of interest in multi‐environment experiments (Yan, 2014). Recent developments in statistics, such as the PCA, allowing for missingness and cellwise and row‐wise outliers, macroPCA (Hubert et al., 2019), and the cellwise robust M regression (Filzmoser et al., 2020) can be compared with the results obtained here, but in matrices and multivariate situations.…”
Section: Discussionmentioning
confidence: 80%
“…Our statement is however correct and in line with the pessimistic BDP perspective which only considers worst cases, i.e., harmful outliers that, depending on the concrete application and BDP notion, may not even be likely in practice. When having simulated data and a random cell-wise outlier scheme, it is extremely unlikely that such columnwise outliers that make the true model irretrievable would appear, so this situation should at most very rarely occur in a random simulation as for example done in Filzmoser et al [2020a] where first a set of instances is selected and for each of these instances, a random fraction of cells are contaminated. We do not dare to make a definite statement whether column-wise outliers can occur in practice, but, for example, Filzmoser et al [2020a] identify the column values with measurements made by a specific sensor, so if this sensor is corrupted across a whole study, one indeed would have column-wise outliers.…”
Section: Evidently It Turns Out That the Fraction Of Cell-wise Outlie...mentioning
confidence: 99%
“…As already pointed out, even in the presence of a clean regressor matrix, no regression (or other supervised machine learning method) procedure has a chance to infer the correct model if too many responses are contaminated. Even worse, if the model cannot be inferred, the contaminated responses cannot be suitably imputed in contrast to outlying cells as for example in Filzmoser et al [2020a].…”
Section: Proof I)+ii) Obvious and Already Partially Discussed Earliermentioning
confidence: 99%
See 1 more Smart Citation
“…Methods assuming a cellwise contamination are, however, still not that well investigated. For example, in a regression setting assuming n > p, MM regression was considered in Filzmoser, Höppner, Ortner, Serneels, and Verdonck (2020), a three step procedure based on S-estimators in Leung, Zhang, and Zamar (2016) shooting S-estimator in Öllerer, Alfons, and Croux (2016), combining ideas from simple S-regression with the 'shooting algorithm', which is a coordinate descent algorithm. The shooting S-estimator was recently extended to the high-dimensional setting in Bottmer, Croux, and Wilms (2020).…”
Section: Cellwise Contaminationmentioning
confidence: 99%