2018
DOI: 10.1101/257931
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

PopPhy-CNN: A Phylogenetic Tree Embedded Architecture for Convolution Neural Networks for Metagenomic Data

Abstract: Motivation: Accurate prediction of the host phenotype from a metgenomic sample and identification of the associated bacterial markers are important in metagenomic studies. We introduce PopPhy-CNN, a novel convolutional neural networks (CNN) learning architecture that effectively exploits phylogentic structure in microbial taxa. PopPhy-CNN provides an input format of 2D matrix created by embedding the phylogenetic tree that is populated with the relative abundance of microbial taxa in a metagenomic sample. This… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
14
0
1

Year Published

2019
2019
2022
2022

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 17 publications
(16 citation statements)
references
References 31 publications
0
14
0
1
Order By: Relevance
“…PopPhy-CNN was originally designed by our group [25,26]. PopPhy-CNN explores relationship between taxa by treating a populated taxonomic tree as a type of image.…”
Section: Learning Modulementioning
confidence: 99%
See 3 more Smart Citations
“…PopPhy-CNN was originally designed by our group [25,26]. PopPhy-CNN explores relationship between taxa by treating a populated taxonomic tree as a type of image.…”
Section: Learning Modulementioning
confidence: 99%
“…The novel procedure to extract features from CNNs was developed in our previous learning framework PopPhy-CNN [26]. We included the details on the full algorithm of feature importance scoring in Additional File 1 ( Fig.…”
Section: Feature Ranking Modulementioning
confidence: 99%
See 2 more Smart Citations
“…Reiman et al implemented a CNN by embedding phylogenetic trees into R 2 and used two-dimensional convolutional layers to construct a body-site classifier. 4 Fioravanti et al developed a model to diagnose inflammatory bowel disease (IBD) by projecting samples into a two-dimensional space using Mul-tiDimensional Scaling with the patristic distance between phylogenetic trees as the distance metric. 5 Both papers mapped phylogenetic data to a Euclidean domain to perform convolutions instead of operating in the original tree topology, as we do in this study.…”
Section: Previous Workmentioning
confidence: 99%