Our system is currently under heavy load due to increased usage. We're actively working on upgrades to improve performance. Thank you for your patience.
2016
DOI: 10.1186/s40168-016-0208-8
|View full text |Cite
|
Sign up to set email alerts
|

Large-scale benchmarking reveals false discoveries and count transformation sensitivity in 16S rRNA gene amplicon data analysis methods used in microbiome studies

Abstract: BackgroundThere is an immense scientific interest in the human microbiome and its effects on human physiology, health, and disease. A common approach for examining bacterial communities is high-throughput sequencing of 16S rRNA gene hypervariable regions, aggregating sequence-similar amplicons into operational taxonomic units (OTUs). Strategies for detecting differential relative abundance of OTUs between sample conditions include classical statistical approaches as well as a plethora of newer methods, many bo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

9
160
0
1

Year Published

2017
2017
2024
2024

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 154 publications
(181 citation statements)
references
References 38 publications
9
160
0
1
Order By: Relevance
“…The similarities with respect to sparsity observed in both scRNA-seq and metagenomics data led us to pose the question of whether statistical methods developed for the differential expression of scRNA-seq data perform well on metagenomic DA analysis. Some benchmarking efforts have compared the performance of methods [9][10][11][12] both adapted from bulk RNA-seq and developed for microbiome DA 13,14 . While some tools exist to guide researchers 15 , a general consensus on the best approach is still missing, especially regarding the methods' capability of controlling false discoveries.…”
Section: Introductionmentioning
confidence: 99%
“…The similarities with respect to sparsity observed in both scRNA-seq and metagenomics data led us to pose the question of whether statistical methods developed for the differential expression of scRNA-seq data perform well on metagenomic DA analysis. Some benchmarking efforts have compared the performance of methods [9][10][11][12] both adapted from bulk RNA-seq and developed for microbiome DA 13,14 . While some tools exist to guide researchers 15 , a general consensus on the best approach is still missing, especially regarding the methods' capability of controlling false discoveries.…”
Section: Introductionmentioning
confidence: 99%
“…To quantify differences in proportions of features between two sampling groups [often referred to as ‘differential relative abundance testing’; Thorsen et al, , Weiss et al, ), posterior probability distributions (PPDs) for πj,k=1-πj,k=2 (Figure d) can be obtained. Consistent with convention, if 95% of the samples of this PPD of differences are either greater or less than zero, then there is a high certainty of a nonzero effect of sampling group on feature relative abundance.…”
Section: Methodsmentioning
confidence: 99%
“…To quantify differences in proportions of features between two sampling groups [often referred to as 'differential relative abundance testing'; Thorsen et al, 2016, Weiss et al, 2017, posterior probability distributions (PPDs) for j,k=1 − j,k=2 (Figure 2d) can be obtained.…”
Section: Dirichlet Multinomial Modelling Approachmentioning
confidence: 99%
See 1 more Smart Citation
“…Operational taxonomic units (OTUs) responding significantly across experimental design were extracted using previously described methodology ) using an analysis of deviance (AOD) after generalized linear modelling (GLM) of the raw counts using negative binomial distribution (nb) with 1000 resampling iterations with residual variance, using the package mvabund (nbGLM, likelihood ratio test, p < 0.05, Wang et al 2012). This method was recently suggested as one of the most accurate way to extract significantly responding OTUs by minimizing the risk of error (Thorsen et al 2016). A generalized heatmap of dominant (relative abundance >0.1%) and significantly responding OTUs was generated using previously described methodology .…”
Section: Dna Extractionmentioning
confidence: 99%