2013
DOI: 10.1371/journal.pone.0067019
|View full text |Cite
|
Sign up to set email alerts
|

ANOVA-Like Differential Expression (ALDEx) Analysis for Mixed Population RNA-Seq

Abstract: Experimental variance is a major challenge when dealing with high-throughput sequencing data. This variance has several sources: sampling replication, technical replication, variability within biological conditions, and variability between biological conditions. The high per-sample cost of RNA-Seq often precludes the large number of experiments needed to partition observed variance into these categories as per standard ANOVA models. We show that the partitioning of within-condition to between-condition variati… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

2
621
0
2

Year Published

2016
2016
2023
2023

Publication Types

Select...
7

Relationship

2
5

Authors

Journals

citations
Cited by 658 publications
(661 citation statements)
references
References 50 publications
2
621
0
2
Order By: Relevance
“…To avoid inappropriate statistical inferences made from compositional data, centred log-ratios (clr), a method previously described by Aitchison (Aitchison 1982), and adapted to microbiome data was used with unpaired Wilcoxon tests for comparisons of OTU level data (Fernandes et al 2013;Fernandes et al 2014). The Benjamini Hochberg (FDR) method was used to control for multiple testing with a significance threshold of 0.1.…”
Section: Microbiome Profilingmentioning
confidence: 99%
“…To avoid inappropriate statistical inferences made from compositional data, centred log-ratios (clr), a method previously described by Aitchison (Aitchison 1982), and adapted to microbiome data was used with unpaired Wilcoxon tests for comparisons of OTU level data (Fernandes et al 2013;Fernandes et al 2014). The Benjamini Hochberg (FDR) method was used to control for multiple testing with a significance threshold of 0.1.…”
Section: Microbiome Profilingmentioning
confidence: 99%
“…Thus, it is important to ensure that any analysis takes this random component into account (Fernandes et al 2013).…”
Section: Introductionmentioning
confidence: 99%
“…Compositional data are a term used to describe a data set in which the parts in each sample have an arbitrary or noninformative sum (Aitchison 1986), such as data obtained from high-throughput DNA sequencing (Friedman and Alm 2012;Fernandes et al 2013Fernandes et al , 2014. These data have long been known to be problematic (Pearson 1896), and we now understand that multivariate data analysis approaches such as ordination and clustering and univariate methods that measure differential abundance are invalid (Aitchison 1986;Warton et al 2012;Friedman and Alm 2012;Fernandes et al 2013;.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations