2017
DOI: 10.1186/s40168-017-0318-y
|View full text |Cite
|
Sign up to set email alerts
|

MetaMeta: integrating metagenome analysis tools to improve taxonomic profiling

Abstract: BackgroundMany metagenome analysis tools are presently available to classify sequences and profile environmental samples. In particular, taxonomic profiling and binning methods are commonly used for such tasks. Tools available among these two categories make use of several techniques, e.g., read mapping, k-mer alignment, and composition analysis. Variations on the construction of the corresponding reference sequence databases are also common. In addition, different tools provide good results in different datas… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
27
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 42 publications
(28 citation statements)
references
References 33 publications
1
27
0
Order By: Relevance
“…The success of microbiome studies (composition, structure, diversity, and function) is primarily ascribable to the development of bioinformatics tools embedded in creative algorithms specially tailored to overcome the technical challenges posed by the analysis of massively paralleled, high-throughput sequencing data ( Simon and Daniel, 2011 ; Siegwald et al, 2017 ). These bioinformatics tools make use of several techniques (e.g., read mapping, k-mer alignment, and composition analysis) ( Piro et al, 2017 ) and can be categorized into two distinct groups: (1) programs that use all available genome sequences ( Lindgreen et al, 2016 ), also called assignment-first approaches ( Siegwald et al, 2017 ) (e.g., CLARK – Ounit et al, 2015 ; GOTTCHA – Freitas et al, 2015 ; KRAKEN – Wood and Salzberg, 2014 ; MG-RAST – Meyer et al, 2008 ), and (2) programs that target a set of marker genes ( Lindgreen et al, 2016 ), also known as clustering-first approaches ( Siegwald et al, 2017 ) (e.g., QIIME – Caporaso et al, 2010 ; MOTHUR – Schloss et al, 2009 ; MetaPhlAn – Segata et al, 2012 ; mOTU – Sunagawa et al, 2013 ). In the assignment-first tools, all reads are assigned to the lowest taxonomy unit (lower common ancestor-LCA) within a reference database based on their annotations, while in the clustering-first approaches the reads are grouped into Operational Taxonomic Units (OTUs) using different OTU picking strategies (closed or open reference) to assign reads to a taxonomic group based on their sequence similarities ( Siegwald et al, 2017 ).…”
Section: Introductionmentioning
confidence: 99%
“…The success of microbiome studies (composition, structure, diversity, and function) is primarily ascribable to the development of bioinformatics tools embedded in creative algorithms specially tailored to overcome the technical challenges posed by the analysis of massively paralleled, high-throughput sequencing data ( Simon and Daniel, 2011 ; Siegwald et al, 2017 ). These bioinformatics tools make use of several techniques (e.g., read mapping, k-mer alignment, and composition analysis) ( Piro et al, 2017 ) and can be categorized into two distinct groups: (1) programs that use all available genome sequences ( Lindgreen et al, 2016 ), also called assignment-first approaches ( Siegwald et al, 2017 ) (e.g., CLARK – Ounit et al, 2015 ; GOTTCHA – Freitas et al, 2015 ; KRAKEN – Wood and Salzberg, 2014 ; MG-RAST – Meyer et al, 2008 ), and (2) programs that target a set of marker genes ( Lindgreen et al, 2016 ), also known as clustering-first approaches ( Siegwald et al, 2017 ) (e.g., QIIME – Caporaso et al, 2010 ; MOTHUR – Schloss et al, 2009 ; MetaPhlAn – Segata et al, 2012 ; mOTU – Sunagawa et al, 2013 ). In the assignment-first tools, all reads are assigned to the lowest taxonomy unit (lower common ancestor-LCA) within a reference database based on their annotations, while in the clustering-first approaches the reads are grouped into Operational Taxonomic Units (OTUs) using different OTU picking strategies (closed or open reference) to assign reads to a taxonomic group based on their sequence similarities ( Siegwald et al, 2017 ).…”
Section: Introductionmentioning
confidence: 99%
“…This, coupled with the previously reported possibility of high levels of false positives resulting from Kaiju assignment (27) and the fact that MetaPhlAn2 works off only a subset of marker genes per species (28), is why Kraken was preferentially employed, with a filter threshold of 0.2 to increase precision without detrimentally impacting sensitivity. Furthermore, to reduce the possibility of false positives (27), taxa were included only if present at a minimum of 1% relative abundance in at least one sample; otherwise, reads were assigned as "others." B. cereus was found to be the dominant species in 7 of the 12 monthly mesophilic sporeformer-enriched samples, i.e., those from January, February, March, May, July, October, and November.…”
Section: Resultsmentioning
confidence: 99%
“…Disadvantages that need to be overcome in order to allow for the routine use of the sequencing technologies employed in this study primarily relate to the cost of analysis, which is currently too expensive for large-scale routine use. Additionally, there are challenges relating to assembly of genomes from shotgun metagenomic sequencing (22) and difficulties arising from insufficient accuracies associated, to different extents, with taxonomic classifiers (27). There are some solutions emerging, whereby new lower-cost, rapid sequencers are arriving on the market, with MinION (45) leading the way toward quick portable detection systems for microorganisms.…”
Section: Discussionmentioning
confidence: 99%
“…Recently, research interests in microbial communities have been strongly increased due to findings on the impact of the microbiome on human health [11,12]. Microbiome studies often employ metaomics techniques such as metagenomics [13] that aims to analyze the genetic material from all members in a microbial community sample. Despite many advantages, metagenomics still presents a static gene-centric approach that cannot assess temporal dynamics and functional activities of complex microbial populations [14].…”
Section: Introductionmentioning
confidence: 99%
“…Used separately, metagenomics, metatranscriptomics, and metaproteomics are already powerful because they complement and mutually support each other. In the past, powerful tailored bioinformatic solutions have been developed for the individual meta-omics analysis levels [13,15,16]. However, the true strength unfolds when these analysis techniques are integrated [17,18].…”
Section: Introductionmentioning
confidence: 99%