2021
DOI: 10.1101/2021.09.06.459120
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

tascCODA: Bayesian tree-aggregated analysis of compositional amplicon and single-cell data

Abstract: Accurate generative statistical modeling of count data is of critical relevance for the analysis of biological datasets from high-throughput sequencing technologies. Important instances include the modeling of microbiome compositions from amplicon sequencing surveys and the analysis of cell type compositions derived from single-cell RNA sequencing. Microbial and cell type abundance data share remarkably similar statistical features, including their inherent compositionality and a natural hierarchical ordering … Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3

Citation Types

0
3
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
1
1

Relationship

2
0

Authors

Journals

citations
Cited by 2 publications
(3 citation statements)
references
References 66 publications
0
3
0
Order By: Relevance
“…Alternative generative statistical modeling approaches include the Dirichlet multinomial (mixture) framework, 22 latent Dirichlet allocation, 23,24 and Poisson distribution models 25 (including their respective zero‐inflated extensions). Several models also allow the inclusion of host or environmental covariate data in generative modeling, including Poisson factor models, 26‐28 latent Dirichlet allocation, 29 and Bayesian Dirichlet multinomial models 30‐32 . Due to the abundance of excess zeros at the amplicon or species level, some of these models also include a zero‐inflation modeling component.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Alternative generative statistical modeling approaches include the Dirichlet multinomial (mixture) framework, 22 latent Dirichlet allocation, 23,24 and Poisson distribution models 25 (including their respective zero‐inflated extensions). Several models also allow the inclusion of host or environmental covariate data in generative modeling, including Poisson factor models, 26‐28 latent Dirichlet allocation, 29 and Bayesian Dirichlet multinomial models 30‐32 . Due to the abundance of excess zeros at the amplicon or species level, some of these models also include a zero‐inflation modeling component.…”
Section: Introductionmentioning
confidence: 99%
“…Several models also allow the inclusion of host or environmental covariate data in generative modeling, including Poisson factor models, [26][27][28] latent Dirichlet allocation, 29 and Bayesian Dirichlet multinomial models. [30][31][32] Due to the abundance of excess zeros at the amplicon or species level, some of these models also include a zero-inflation modeling component. This and the high dimensionality of the data at the OTU/ASV level make both estimation and biological interpretability challenging.…”
Section: Introductionmentioning
confidence: 99%
“…Alternative generative statistical modeling approaches include the Dirichlet Multinomial (mixture) framework 22 , latent Dirichlet allocation 23,24 , and Poisson distribution models 25 (including their respective zero-inflated extensions). Several models also allow the inclusion of host or environmental covariate data in generative modeling, including Poisson factor models 26,27,28 , latent Dirichlet allocation 29 , and Bayesian Dirichlet multinomial models 30,31 . The majority of these procedures, however, tackles the arguably more challenging problem of modeling data at the amplicon or species level.…”
Section: Introductionmentioning
confidence: 99%