2019
DOI: 10.1093/biostatistics/kxz010
|View full text |Cite
|
Sign up to set email alerts
|

The functional false discovery rate with applications to genomics

Abstract: Summary The false discovery rate (FDR) measures the proportion of false discoveries among a set of hypothesis tests called significant. This quantity is typically estimated based on p-values or test statistics. In some scenarios, there is additional information available that may be used to more accurately estimate the FDR. We develop a new framework for formulating and estimating FDRs and q-values when an additional piece of information, which we call an “informative variable”, is available. Fo… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
20
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
1
1

Relationship

1
7

Authors

Journals

citations
Cited by 36 publications
(20 citation statements)
references
References 36 publications
0
20
0
Order By: Relevance
“…The q value is a useful algorithm to correct for multiple testing given that with traditional FDR cannot be defined when there are no positive results. A recent update to this algorithm using an additional informative variable approach is available for processes such eQTL mapping or RNA-seq data where an additional variable such as read depth can enhance the overall amount of information available to calculate these q values [ 46 ]. The remaining genes (the general pool) were still examined to avoid ignoring potentially important surprise findings; however, their q values [ 45 ] were calculated independently from those of the candidate pool.…”
Section: Methodsmentioning
confidence: 99%
“…The q value is a useful algorithm to correct for multiple testing given that with traditional FDR cannot be defined when there are no positive results. A recent update to this algorithm using an additional informative variable approach is available for processes such eQTL mapping or RNA-seq data where an additional variable such as read depth can enhance the overall amount of information available to calculate these q values [ 46 ]. The remaining genes (the general pool) were still examined to avoid ignoring potentially important surprise findings; however, their q values [ 45 ] were calculated independently from those of the candidate pool.…”
Section: Methodsmentioning
confidence: 99%
“…Kaplan-Meier curve and log-rank test were conducted for survival analysis using GraphPad Prism v.5.0 (GraphPad Software, Inc.). The aberrantly expressed lncRNAs were explored according to Benjamini-Hochberg method ( 29 ). P<0.05 was considered to indicate a statistically significant difference.…”
Section: Methodsmentioning
confidence: 99%
“…We test m features, and for each feature i , we let X i be a factor with values in {0,1} expressing whether the feature was significantly differential in experiment 1 ( X i = 1), or not ( X i = 0). We are interested in whether the variable X = ( X 1 , …, X m ) is an informative covariate for experiment 2, using terminology from recent work in covariate-powered multiple hypothesis testing 42,43 .…”
Section: Methodsmentioning
confidence: 99%
“…= 0). We are interested in whether the variable = ( " , … , # ) is an informative covariate for experiment 2, using terminology from recent work in covariate-powered multiple hypothesis testing 42,43 .…”
Section: Testing For Statistically Significant Overlap Between Two LImentioning
confidence: 99%