2020
DOI: 10.1093/nargab/lqaa093
|View full text |Cite
|
Sign up to set email alerts
|

dearseq: a variance component score test for RNA-seq differential analysis that effectively controls the false discovery rate

Abstract: RNA-seq studies are growing in size and popularity. We provide evidence that the most commonly used methods for differential expression analysis (DEA) may yield too many false positive results in some situations. We present dearseq, a new method for DEA that controls the false discovery rate (FDR) without making any assumption about the true distribution of RNA-seq data. We show that dearseq controls the FDR while maintaining strong statistical power compared to the most popular methods. We demonstrate this be… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

1
24
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
3

Relationship

2
7

Authors

Journals

citations
Cited by 21 publications
(26 citation statements)
references
References 33 publications
1
24
0
Order By: Relevance
“…In particular, on GTEx datasets, differential expression analysis can be performed between two tissues or cell types; on TCGA datasets, differential expression analysis can be performed between two disease statuses or biological conditions. The four representative methods include two popular methods limma-voom [ 14 , 15 ] and NOISeq [ 16 ], a new method dearseq [ 11 ] (which claimed to overcome the FDR inflation issue of DESeq2 and edgeR on large-sample-size data), and the classic Wilcoxon rank-sum test [ 17 ]. Note that DESeq2, edgeR, and limma-voom are parametric methods that assume parametric models for data distribution, while NOISeq, dearseq, and the Wilcoxon rank-sum test are non-parametric methods that are less restrictive but require large sample sizes to have good power.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…In particular, on GTEx datasets, differential expression analysis can be performed between two tissues or cell types; on TCGA datasets, differential expression analysis can be performed between two disease statuses or biological conditions. The four representative methods include two popular methods limma-voom [ 14 , 15 ] and NOISeq [ 16 ], a new method dearseq [ 11 ] (which claimed to overcome the FDR inflation issue of DESeq2 and edgeR on large-sample-size data), and the classic Wilcoxon rank-sum test [ 17 ]. Note that DESeq2, edgeR, and limma-voom are parametric methods that assume parametric models for data distribution, while NOISeq, dearseq, and the Wilcoxon rank-sum test are non-parametric methods that are less restrictive but require large sample sizes to have good power.…”
Section: Resultsmentioning
confidence: 99%
“…From the literature, we found that several studies had reported the anticonservative behavior of DESeq2 and edgeR [9][10][11]; however, they were restricted to using simulated datasets with small sample sizes. Hence, for our large-sample-size scenario, their findings did not provide a direct answer to our question.…”
mentioning
confidence: 99%
“…The adjusted P- value and [logFC] were calculated. The Benjamini & Hochberg false discovery rate method was used as a correction factor for the adjusted P-value in DESeq2 [41]. The statistically significant DEGs were identified according to adjusted P < .05, and [logFC] > 2.576 for up regulated genes and [logFC] < -2.813 for down regulated genes.…”
Section: Methodsmentioning
confidence: 99%
“…Differentially expressed genes (DEGs) between T1DM samples and normal control samples were identified by analyzing NGS data with DESeq2 package of R software [24]. The Benjamini and Hochberg (BH) method was accomplished to adjust P value to reduce the false discover rate [25]. According to the standard, we used FC > 3.835 as the screening criterion for up regulation of DEGs, FC < < 0 for down regulation of DEGs and adjust P value was < 0.05.…”
Section: Methodsmentioning
confidence: 99%