2020
DOI: 10.48550/arxiv.2001.07835
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Knockoffs with Side Information

Abstract: We consider the problem of assessing the importance of multiple variables or factors from a dataset when side information is available. In principle, using side information can allow the statistician to pay attention to variables with a greater potential, which in turn, may lead to more discoveries. We introduce an adaptive knockoff filter, which generalizes the knockoff procedure (Barber and Candès, 2015;Candès et al., 2018) in that it uses both the data at hand and side information to adaptively order the va… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
5

Relationship

1
4

Authors

Journals

citations
Cited by 8 publications
(11 citation statements)
references
References 47 publications
0
11
0
Order By: Relevance
“…Fithian and Lei (2020) offers new technical tools for controlling FDR under dependence, including in settings with data-adaptive p-value weights, that may prove fruitful in relaxing the independence assumption. The adaptive knockoff method of Ren and Candès (2020) suggests another possible way forward by incorporating side information into multiple testing in supervised learning problems.…”
Section: Dependent P-valuesmentioning
confidence: 99%
“…Fithian and Lei (2020) offers new technical tools for controlling FDR under dependence, including in settings with data-adaptive p-value weights, that may prove fruitful in relaxing the independence assumption. The adaptive knockoff method of Ren and Candès (2020) suggests another possible way forward by incorporating side information into multiple testing in supervised learning problems.…”
Section: Dependent P-valuesmentioning
confidence: 99%
“…Another avenue of research would be the development of "wrapper" methods that enable the utilization of side-information by black-box supervised learning methods. For example, Ren and Candès [2020] develop methods for feature selection and ranking based on any feature importance statistic and any machine learning model can be enhanced by accounting for side-information.…”
Section: Discussionmentioning
confidence: 99%
“…This paradigm -flexible modeling plus calibration of a 1-dimensional parameter based on frequentist criteria-has proven to be fruitful for statistical applications going beyond the group-regularized ridge regression problem considered here. Further applications of this paradigm include feature selection in regression settings via knockoffs [Candès et al, 2018, Ren andCandès, 2020], multiple testing with side-information [Ignatiadis et al, 2016, Lei and Fithian, 2018, Ignatiadis and Huber, 2018, empirical Bayes shrinkage with side-information [Tan, 2016, Ignatiadis andWager, 2019] and conformal prediction [Vovk et al, 2005, Gupta et al, 2019.…”
Section: Discussionmentioning
confidence: 99%
“…To address this challenge, Ren and Candès (2020) developed an adaptive knockoff filter, extending the original knockoff filter of Barber and Candès (2015) and in order to increase its power by leveraging side information. This paper applies the method of Ren and Candès (2020) to analyze GWAS data from individuals with different ancestries , comparing its performance to that of a novel alternative approach. The main difference between the adaptive knockoff filter of Ren and Candès (2020) and the novel method proposed here in Section 2.3 is that the latter directly leverages side information while analyzing the raw genotype-phenotype data, while the former operates on pre-computed knockoff statistics.…”
Section: Related Workmentioning
confidence: 99%