2018
DOI: 10.1093/bioinformatics/bty844
|View full text |Cite
|
Sign up to set email alerts
|

Strain-GeMS: optimized subspecies identification from microbiome data based on accurate variant modeling

Abstract: Motivation Subspecies identification is one of the most critical issues in microbiome studies, as it is directly related to their functions in response to the environmental stress and their feedbacks. However, identification of subspecies remains a challenge largely due to the small variation between different strains within the same species. Accurate identification of subspecies primarily relies on variant identification and categorization through microbiome data. However, current SNP callin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
6

Relationship

1
5

Authors

Journals

citations
Cited by 7 publications
(6 citation statements)
references
References 8 publications
0
6
0
Order By: Relevance
“…It is critical to identify and characterize microbial species in environments and individual human hosts in order to learn about human–microbial interactions. Many bioinformatics computational tools have been developed for the characterization and identification of microorganisms at species or strain levels, such as StrainPhlAn [ 33 ], ConStrains [ 34 ] and Strain-GeMS [ 2 ]. However, most of these traditional tools are based on genomic sequence comparison and marker genes such as 16S rRNA and thus often lack the resolution to reliably capture intraspecific genomic differences.…”
Section: Current Methods and Deep Learning For Microbiome Data Miningmentioning
confidence: 99%
See 2 more Smart Citations
“…It is critical to identify and characterize microbial species in environments and individual human hosts in order to learn about human–microbial interactions. Many bioinformatics computational tools have been developed for the characterization and identification of microorganisms at species or strain levels, such as StrainPhlAn [ 33 ], ConStrains [ 34 ] and Strain-GeMS [ 2 ]. However, most of these traditional tools are based on genomic sequence comparison and marker genes such as 16S rRNA and thus often lack the resolution to reliably capture intraspecific genomic differences.…”
Section: Current Methods and Deep Learning For Microbiome Data Miningmentioning
confidence: 99%
“…Numerous bioinformatics tools have taken advantage of the growing amount of genomic data to identify new species. For example, StrainPhlAn [ 33 ], ConStrains [ 34 ] and Strain-GeMS [ 2 ] are proposed for bacterial identification at the strain level based on genomic information and ArboTyping [ 61 ] for the identification of virus species and genotypes. By considering that species are organized according to the phylogenetic tree of life, the latter can be considered an ontology structure, and the identification of novel species is an ontology-related problem.…”
Section: Applications Of Onn In Microbiome Data Mining Contextsmentioning
confidence: 99%
See 1 more Smart Citation
“…Metagenome-based population genomics studies frequently focus on the microdiversity associated with SNVs, therefore linked to the potential optimization of traits [ 40 ]. Furthermore, there are methods that aim at reconstructing strains or haplotypes from the metagenomic data (e.g., ConStrains [ 110 ], DESMAN [ 111 ], STRONG [ 112 ], InStrain [ 108 ], and Strain-GeMS [ 113 ]). Given the space limitations, below, I will provide a few examples of some of these approaches applied to marine microbes to convey the central message without aiming for a comprehensive review.…”
Section: Ocean Microbes Are Key For the Functioning Of The Earth's Sy...mentioning
confidence: 99%
“…While species-specific microbiome approaches exist ( Karcher et al , 2020 ; Milani et al , 2014 ), no software yet exists to broadly delineate subspecies from metagenomic data. Many tools characterize population-level diversity [MIDAS ( Nayfach et al , 2016 ), metaSNV v1 ( Costea et al , 2017a ), POPGENOM ( Sjöqvist et al , 2021 ), inStrain ( Olm et al, 2021 ), StrainPhlAn ( Truong et al , 2017 )] and/or recover haplotypes or strains from metagenomes [DESMAN ( Quince et al , 2017 ), ConStrains ( Luo et al , 2015 ), InStrain ( Olm et al , 2021 ), strainGEMS ( Tan et al , 2019 )], with varying definitions of strains ( Van Rossum et al , 2020 ). However, none of these tools provide robust clustering of population diversity for data-driven identification of subspecies.…”
Section: Introductionmentioning
confidence: 99%