2017
DOI: 10.7287/peerj.preprints.2685v1
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Phylogenetic factorization of compositional data yields lineage-level associations in microbiome datasets

Abstract: Marker gene sequencing of microbial communities has generated big datasets of microbial relative abundances varying across environmental conditions, sample sites and treatments. These data often come with putative phylogenies, providing unique opportunities to investigate how shared evolutionary history affects microbial abundance patterns. Here, we present a method to identify the phylogenetic factors driving patterns in microbial community composition. We use the method, "phylofactorization", to reanalyze da… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

1
16
0

Year Published

2017
2017
2021
2021

Publication Types

Select...
9

Relationship

2
7

Authors

Journals

citations
Cited by 14 publications
(17 citation statements)
references
References 0 publications
1
16
0
Order By: Relevance
“…When the basis is a branch of a dendrogram, the ilr offers an intuitive way to contrast one set of components against another set of components. These contrasts, called balances, have been used to analyze meta-genomics data based on evolutionary trees [57,67], but could be applied to any data if a similarly meaningful tree were available.…”
Section: Part 2a: Transformation-dependent Analysesmentioning
confidence: 99%
“…When the basis is a branch of a dendrogram, the ilr offers an intuitive way to contrast one set of components against another set of components. These contrasts, called balances, have been used to analyze meta-genomics data based on evolutionary trees [57,67], but could be applied to any data if a similarly meaningful tree were available.…”
Section: Part 2a: Transformation-dependent Analysesmentioning
confidence: 99%
“…Null simulations may allow statistical statements stemming from a clear null model, but stopping criteria can be far more computationally efficient. Washburne et al (2017) proposed a stopping criterion for regression phylofactorization which extends to all methods of phylofactorization using an objective function whose null-distribution for a single edge is known. The original stopping criterion is based on the fact that, if the null hypothesis is true, the distribution of P values from multiple hypothesis tests is uniform.…”
Section: Statistical Challengesmentioning
confidence: 99%
“…Although more complicated than a simple log-ratio, having more parts means that a single balance can describe more variance than a single log-ratio. Balances have recently become popular for the analysis and classification of microbiome compositions [44,46,29,40,37].…”
Section: Introductionmentioning
confidence: 99%