The platform will undergo maintenance on Sep 14 at about 7:45 AM EST and will be unavailable for approximately 2 hours.
2019
DOI: 10.1111/1755-0998.13109
|View full text |Cite
|
Sign up to set email alerts
|

Latent Dirichlet Allocation reveals spatial and taxonomic structure in a DNA‐based census of soil biodiversity from a tropical forest

Abstract: High‐throughput sequencing of amplicons from environmental DNA samples permits rapid, standardized and comprehensive biodiversity assessments. However, retrieving and interpreting the structure of such data sets requires efficient methods for dimensionality reduction. Latent Dirichlet Allocation (LDA) can be used to decompose environmental DNA samples into overlapping assemblages of co‐occurring taxa. It is a flexible model‐based method adapted to uneven sample sizes and to large and sparse data sets. Here, we… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
21
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
2

Relationship

1
7

Authors

Journals

citations
Cited by 20 publications
(23 citation statements)
references
References 63 publications
2
21
0
Order By: Relevance
“…Once we had selected the K value, we ran 100 independent MCMC chains on the whole dataset from random initial conditions. To check for potential insufficient mixing along the chains, we measured the similarity in the spatial distribution of assemblages across the chains (Table S1), using the metric defined in Sommeria-Klein et al (2019). We picked the chain with posterior probability closest to the mean across chains for the final interpretation.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Once we had selected the K value, we ran 100 independent MCMC chains on the whole dataset from random initial conditions. To check for potential insufficient mixing along the chains, we measured the similarity in the spatial distribution of assemblages across the chains (Table S1), using the metric defined in Sommeria-Klein et al (2019). We picked the chain with posterior probability closest to the mean across chains for the final interpretation.…”
Section: Methodsmentioning
confidence: 99%
“…The data encompass 250,057 eukaryotic Operational Taxonomic Units (OTUs) sampled globally at the surface and at the Deep Chlorophyl Maximum (DCM) across 129 stations. We use a probabilistic model that allows identification of a number of ‘assemblages’, each of which represents a set of OTUs that tend to co-occur across samples (Sommeria-Klein et al, 2019; Valle, Baiser, Woodall, & Chazdon, 2014; Methods). Each local planktonic community can then be seen as a sample drawn in various proportions from the assemblages.…”
Section: Main Textmentioning
confidence: 99%
“…Application of LDA to these data should help reveal the structure of microbial assemblages on a global scale [52]. For example, Sommeria-Klein et al recently applied LDA to taxonomic profiles of a tropical forest soil DNA dataset to reveal spatial structures [53]. The second direction is the extension of the LDA model-LDA has high model extensibility.…”
Section: Table 2 Functional Assemblage Having the Largest Relativementioning
confidence: 99%
“…We next tested LDA potential to stratify gut microbiota of the cohort participants. This unsupervised machine learning technique is increasingly finding acceptance in the field of microbiome [4648] for its unique ability to reveal latent or hidden groups within the data cloud. Supplementary Figure S4 shows LDA model’s perplexity parameter and log-likelihood values to find optimal number of clusters.…”
Section: Resultsmentioning
confidence: 99%