2013
DOI: 10.1214/12-aos1077
|View full text |Cite
|
Sign up to set email alerts
|

Valid post-selection inference

Abstract: It is common practice in statistical data analysis to perform data-driven variable selection and derive statistical inference from the resulting model. Such inference enjoys none of the guarantees that classical statistical theory provides for tests and confidence intervals when the model has been chosen a priori. We propose to produce valid ``post-selection inference'' by reducing the problem to one of simultaneous inference and hence suitably widening conventional confidence and retention intervals. Simultan… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

9
721
1

Year Published

2013
2013
2022
2022

Publication Types

Select...
6
3
1

Relationship

0
10

Authors

Journals

citations
Cited by 500 publications
(735 citation statements)
references
References 34 publications
9
721
1
Order By: Relevance
“…In general, the challenge in using adaptive methods as the basis for valid statistical inference is that selection bias can be difficult to quantify; see Berk et al [2013], Chernozhukov et al [2015], Taylor and Tibshirani [2015], and references therein for recent advances. In this paper, pairing "honest" trees with the subsampling mechanism of random forests enabled us to accomplish this goal in a simple yet principled way.…”
Section: Discussionmentioning
confidence: 99%
“…In general, the challenge in using adaptive methods as the basis for valid statistical inference is that selection bias can be difficult to quantify; see Berk et al [2013], Chernozhukov et al [2015], Taylor and Tibshirani [2015], and references therein for recent advances. In this paper, pairing "honest" trees with the subsampling mechanism of random forests enabled us to accomplish this goal in a simple yet principled way.…”
Section: Discussionmentioning
confidence: 99%
“…31 and 32. In Berk et al (32), an intuitive and practical strategy has been proposed, but it fails when the dimension of the variables is high. To our knowledge, there is no clear solution in statistics for post-model-selection inference problem for high-dimensional data; it is an interesting problem that may warrant future investigation.…”
Section: Methodsmentioning
confidence: 99%
“…This fundamental shift in focus allows us to circumvent, without contradicting, the impossibility results of Leeb and Pötscher. Valid post-selection inference has attracted considerable attention during the preparation of this paper: in contexts and with methods quite different from ours, contributions have been made by Belloni, Chernozhukov, and Wei (2013), Berk, Brown, Buja, Zhang, and Zhao (2013), Zhang and Zhang (2014), Efron (2014), van de Geer, Buhlmann, Ritov, and Dezeure (2014), and Belloni, Chernozhukov, Fernandez-Val, and Hansen (2014), among others.…”
Section: Introductionmentioning
confidence: 99%