2020
DOI: 10.1093/bib/bbaa073
|View full text |Cite
|
Sign up to set email alerts
|

A novel deep learning method for predictive modeling of microbiome data

Abstract: With the development and decreasing cost of next-generation sequencing technologies, the study of the human microbiome has become a rapid expanding research field, which provides an unprecedented opportunity in various clinical applications such as drug response predictions and disease diagnosis. It is thus essential and desirable to build a prediction model for clinical outcomes based on microbiome data that usually consist of taxon abundance and a phylogenetic tree. Importantly, all microbial species are not… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
20
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
2

Relationship

1
7

Authors

Journals

citations
Cited by 24 publications
(21 citation statements)
references
References 47 publications
0
20
0
Order By: Relevance
“…Other ML model architectures, such as convolutional neural networks (CNNs), are commonly used for classification problems with certain structure in the input data, such as image classification. In the context of microbiome data, the CNN MDeep adds structure to OTU features through hierarchical agglomerative clustering of the phylogeny-induced correlation between OTUs [58]. As MDeep is currently only developed for OTU features, we assessed whether this CNN architecture led to greater classification performance with OTU abundance than our MLP architecture.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…Other ML model architectures, such as convolutional neural networks (CNNs), are commonly used for classification problems with certain structure in the input data, such as image classification. In the context of microbiome data, the CNN MDeep adds structure to OTU features through hierarchical agglomerative clustering of the phylogeny-induced correlation between OTUs [58]. As MDeep is currently only developed for OTU features, we assessed whether this CNN architecture led to greater classification performance with OTU abundance than our MLP architecture.…”
Section: Resultsmentioning
confidence: 99%
“…We implemented MDeep, a CNN architecture recently designed for microbiome data [58]. CNNs require an inherent structure to present in the data, which is added to the OTU dataset by hierarchical agglomerative clustering of the phylogeny-induced correlation between OTUs.…”
Section: Methodsmentioning
confidence: 99%
“…The classification of microbial species, the prediction of host phenotypes and ecological environments, the investigation of interactions between community members, and the prediction of associations between microbiome and disease are the key tasks (Qu et al, 2019;Zhou and Gallins, 2019). More and more DL models show advantages in human metagenomics data in detecting biomarkers that characterize the microbiome traits and host phenotype, such as MetaPheno (LaPierre et al, 2019) and MDeep (Wang et al, 2020). The ML method has succeeded in predicting productivity based on plant soil metagenomic data.…”
Section: Representation Learning For Microbiome Datamentioning
confidence: 99%
“…The MDeep model has been developed to simulate the phylogenetic tree structure of microbial taxa at different taxonomical levels. It indicated that convolutional neural network could automatically learn representation and map a complex feature to the simple one by convolutional multilayers (Wang et al, 2020).…”
Section: Data Representation and Feature Extractionmentioning
confidence: 99%
“…Even with these caveats, many studies have utilized phylogenetic measures of diversity to differentiate microbiomes by host disease state or other phenotype (9,10) . Moreover, phylogenies are successfully utilized in many recently published tools for microbiome data analysis, which include pseudo-count imputation (11) , dataset augmentation (12) , constrained ordination analysis (13) , transforming compositional abundance data (14) , and machine learning models for predicting host phenotypes (15,16) .…”
Section: Introductionmentioning
confidence: 99%