2022
DOI: 10.4171/msl/27
|View full text |Cite
|
Sign up to set email alerts
|

Distribution-free robust linear regression

Abstract: We study random design linear regression with no assumptions on the distribution of the covariates and with a heavy-tailed response variable. In this distribution-free regression setting, we show that boundedness of the conditional second moment of the response given the covariates is a necessary and sufficient condition for achieving non-trivial guarantees. As a starting point, we prove an optimal version of the classical in-expectation bound for the truncated least squares estimator due to Györfi, Kohler, Kr… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
9
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
4

Relationship

3
5

Authors

Journals

citations
Cited by 9 publications
(9 citation statements)
references
References 104 publications
0
9
0
Order By: Relevance
“…For the recent use of midpoint procedures in statistical literature, see, for example, (Mendelson, 2019;Bousquet and Zhivotovskiy, 2021;Mourtada, Vaškevičius, and Zhivotovskiy, 2022). Since f (mid) outputs 2-sparse convex combinations of elements of the dictionary G, similarly to the above analysis of Audibert's star algorithm, it is enough to establish that f (mid) satisfies the offset condition.…”
Section: Model Selection Aggregationmentioning
confidence: 99%
See 2 more Smart Citations
“…For the recent use of midpoint procedures in statistical literature, see, for example, (Mendelson, 2019;Bousquet and Zhivotovskiy, 2021;Mourtada, Vaškevičius, and Zhivotovskiy, 2022). Since f (mid) outputs 2-sparse convex combinations of elements of the dictionary G, similarly to the above analysis of Audibert's star algorithm, it is enough to establish that f (mid) satisfies the offset condition.…”
Section: Model Selection Aggregationmentioning
confidence: 99%
“…data (e.g., models arising from different tuning parameters, or different statistical estimators) is a fundamental problem in statistics. At the same time, deviation-optimal model selection aggregation procedures have been used to construct computable procedures (not necessarily computationally efficient) to demonstrate the achievability of some statistical minimax lower bounds (see, e.g., (Rakhlin, Sridharan, and Tsybakov, 2017;Mendelson, 2019;Mourtada, Vaškevičius, and Zhivotovskiy, 2022)).…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…We also show a high-probability excess risk bound for online to batch conversion of an online regression algorithm in the bounded setup, which is to say that both the feature vectors and derivatives of the losses are bounded. It was previously shown by Mourtada et al (2022) that online to batch converted versions of the optimal Vovk-Azoury-Warmuth forecaster (Vovk, 2001;Azoury and Warmuth, 2001) have constant excess risk with constant probability.…”
Section: Contributions and Outlinementioning
confidence: 99%
“…For a recent survey with focus on multivariate mean estimation, we refer to [41]. The central ideas behind the robust mean estimation found their applications in many related problems such as regression [28,9,43,19,46,52], covariance estimation [15,16,49,53,26,51], and clustering [33]. For related results in the context of covariance estimation for heavy-tailed distributions, we refer to the recent survey [32].…”
Section: Introductionmentioning
confidence: 99%