MetaMeta: integrating metagenome analysis tools to improve taxonomic profiling

Piro, Vitor C.; Matschkowski, Marcel; Renard, Bernhard Y.

doi:10.1186/s40168-017-0318-y

Cited by 42 publications

(28 citation statements)

References 33 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The success of microbiome studies (composition, structure, diversity, and function) is primarily ascribable to the development of bioinformatics tools embedded in creative algorithms specially tailored to overcome the technical challenges posed by the analysis of massively paralleled, high-throughput sequencing data ( Simon and Daniel, 2011 ; Siegwald et al, 2017 ). These bioinformatics tools make use of several techniques (e.g., read mapping, k-mer alignment, and composition analysis) ( Piro et al, 2017 ) and can be categorized into two distinct groups: (1) programs that use all available genome sequences ( Lindgreen et al, 2016 ), also called assignment-first approaches ( Siegwald et al, 2017 ) (e.g., CLARK – Ounit et al, 2015 ; GOTTCHA – Freitas et al, 2015 ; KRAKEN – Wood and Salzberg, 2014 ; MG-RAST – Meyer et al, 2008 ), and (2) programs that target a set of marker genes ( Lindgreen et al, 2016 ), also known as clustering-first approaches ( Siegwald et al, 2017 ) (e.g., QIIME – Caporaso et al, 2010 ; MOTHUR – Schloss et al, 2009 ; MetaPhlAn – Segata et al, 2012 ; mOTU – Sunagawa et al, 2013 ). In the assignment-first tools, all reads are assigned to the lowest taxonomy unit (lower common ancestor-LCA) within a reference database based on their annotations, while in the clustering-first approaches the reads are grouped into Operational Taxonomic Units (OTUs) using different OTU picking strategies (closed or open reference) to assign reads to a taxonomic group based on their sequence similarities ( Siegwald et al, 2017 ).…”

Section: Introductionmentioning

confidence: 99%

Enhancing the Resolution of Rumen Microbial Classification from Metatranscriptomic Data Using Kraken and Mothur

Neves

Ghoshal

et al. 2017

Front. Microbiol.

View full text Add to dashboard Cite

The advent of next generation sequencing and bioinformatics tools have greatly advanced our knowledge about the phylogenetic diversity and ecological role of microbes inhabiting the mammalian gut. However, there is a lack of information on the evaluation of these computational tools in the context of the rumen microbiome as these programs have mostly been benchmarked on real or simulated datasets generated from human studies. In this study, we compared the outcomes of two methods, Kraken (mRNA based) and a pipeline developed in-house based on Mothur (16S rRNA based), to assess the taxonomic profiles (bacteria and archaea) of rumen microbial communities using total RNA sequencing of rumen fluid collected from 12 cattle with differing feed conversion ratios (FCR). Both approaches revealed a similar phyla distribution of the most abundant taxa, with Bacteroidetes, Firmicutes, and Proteobacteria accounting for approximately 80% of total bacterial abundance. For bacterial taxa, although 69 genera were commonly detected by both methods, an additional 159 genera were exclusively identified by Kraken. Kraken detected 423 species, while Mothur was not able to assign bacterial sequences to the species level. For archaea, both methods generated similar results only for the abundance of Methanomassiliicoccaceae (previously referred as RCC), which comprised more than 65% of the total archaeal families. Taxon R4-41B was exclusively identified by Mothur in the rumen of feed efficient bulls, whereas Kraken uniquely identified Methanococcaceae in inefficient bulls. Although Kraken enhanced the microbial classification at the species level, identification of bacteria or archaea in the rumen is limited due to a lack of reference genomes for the rumen microbiome. The findings from this study suggest that the development of the combined pipelines using Mothur and Kraken is needed for a more inclusive and representative classification of microbiomes.

show abstract

Section: Introductionmentioning

confidence: 99%

Enhancing the Resolution of Rumen Microbial Classification from Metatranscriptomic Data Using Kraken and Mothur

Neves

Ghoshal

et al. 2017

Front. Microbiol.

View full text Add to dashboard Cite

show abstract

“…This, coupled with the previously reported possibility of high levels of false positives resulting from Kaiju assignment (27) and the fact that MetaPhlAn2 works off only a subset of marker genes per species (28), is why Kraken was preferentially employed, with a filter threshold of 0.2 to increase precision without detrimentally impacting sensitivity. Furthermore, to reduce the possibility of false positives (27), taxa were included only if present at a minimum of 1% relative abundance in at least one sample; otherwise, reads were assigned as "others." B. cereus was found to be the dominant species in 7 of the 12 monthly mesophilic sporeformer-enriched samples, i.e., those from January, February, March, May, July, October, and November.…”

Section: Resultsmentioning

confidence: 99%

“…Disadvantages that need to be overcome in order to allow for the routine use of the sequencing technologies employed in this study primarily relate to the cost of analysis, which is currently too expensive for large-scale routine use. Additionally, there are challenges relating to assembly of genomes from shotgun metagenomic sequencing (22) and difficulties arising from insufficient accuracies associated, to different extents, with taxonomic classifiers (27). There are some solutions emerging, whereby new lower-cost, rapid sequencers are arriving on the market, with MinION (45) leading the way toward quick portable detection systems for microorganisms.…”

Section: Discussionmentioning

confidence: 99%

Mesophilic Sporeformers Identified in Whey Powder by Using Shotgun Metagenomic Sequencing

McHugh

Feehily

Tobin

et al. 2018

Appl Environ Microbiol

View full text Add to dashboard Cite

Spoilage and pathogenic spore-forming bacteria are a major cause of concern for producers of dairy products. Traditional agar-based detection methods employed by the dairy industry have limitations with respect to their sensitivity and specificity. The aim of this study was to identify low-abundance sporeformers in samples of a powdered dairy product, whey powder, produced monthly over 1 year, using novel culture-independent shotgun metagenomics-based approaches. Although mesophilic sporeformers were the main target of this study, in one instance thermophilic sporeformers were also targeted using this culture-independent approach. For comparative purposes, mesophilic and thermophilic sporeformers were also tested for within the same sample using culture-based approaches. Ultimately, the approaches taken highlighted differences in the taxa identified due to treatment and isolation methods. Despite this, low levels of transient, mesophilic, and in some cases potentially pathogenic sporeformers were consistently detected in powder samples. Although the specific sporeformers changed from one month to the next, it was apparent that 3 groups of mesophilic sporeformers, namely, ,/ , and a third, more heterogeneous group containing, dominated across the 12 samples. Total thermophilic sporeformer taxonomy was considerably different from mesophilic taxonomy, as well as from the culturable thermophilic taxonomy, in the one sample analyzed by all four approaches. Ultimately, through the application of shotgun metagenomic sequencing to dairy powders, the potential for this technology to facilitate the detection of undesirable bacteria present in these food ingredients is highlighted. The ability of sporeformers to remain dormant in a desiccated state is of concern from a safety and spoilage perspective in dairy powder. Traditional culturing techniques are slow and provide little information without further investigation. We describe the identification of mesophilic sporeformers present in powders produced over 1 year, using novel shotgun metagenomic sequencing. This method allows detection and identification of possible pathogens and spoilage bacteria in parallel. Strain-level analysis and functional gene analysis, such as identification of toxin genes, were also performed. This approach has the potential to be of great value with respect to the detection of spore-forming bacteria and could allow a processor to make an informed decision surrounding process changes to reduce the risk of spore contamination.

show abstract

“…Recently, research interests in microbial communities have been strongly increased due to findings on the impact of the microbiome on human health [11,12]. Microbiome studies often employ metaomics techniques such as metagenomics [13] that aims to analyze the genetic material from all members in a microbial community sample. Despite many advantages, metagenomics still presents a static gene-centric approach that cannot assess temporal dynamics and functional activities of complex microbial populations [14].…”

Section: Introductionmentioning

confidence: 99%

“…Used separately, metagenomics, metatranscriptomics, and metaproteomics are already powerful because they complement and mutually support each other. In the past, powerful tailored bioinformatic solutions have been developed for the individual meta-omics analysis levels [13,15,16]. However, the true strength unfolds when these analysis techniques are integrated [17,18].…”

Section: Introductionmentioning

confidence: 99%

gNOMO: a multi-omics pipeline for integrated host and microbiome analysis of non-model organisms

Muñoz-Benavent

Hartkopf

Bossche

et al. 2019

Preprint

Self Cite

View full text Add to dashboard Cite

Background The study of bacterial symbioses has grown exponentially in the recent past. However, existing bioinformatic workflows of microbiome data analysis do commonly not integrate multiple meta-omics levels and are mainly geared towards human microbiomes. Microbiota are better understood when analyzed in their biological context, which is together with their host or environment, but this is a limitation when studying non-model organisms mainly due to the lack of well-annotated sequence references. Results Here, we present gNOMO, a bioinformatic pipeline that is specifically designed to process and analyze non-model organism samples of up to three meta-omics levels: metagenomics, metatranscriptomics, and metaproteomics in an integrative manner. The pipeline has been developed using the Snakemake framework in order to obtain an automated and reproducible workflow. One of the key features is the on-the-fly creation of a tailored proteogenomic database based on metagenomics and metatranscriptomics data, leading to improved protein identification, taxonomic and functional analysis. gNOMO combines meta-omics analysis of the host with its bacterial population and allows to investigate both host and microbiome of non-model organisms with commonly insufficiently complete reference databases. Conclusions Using experimental datasets of the German cockroach Blattella germanica , a non-model organism with very complex gut microbiome, we show the capabilities of gNOMO with regard to meta-omics data integration, expression ratio comparison, taxonomic and functional analysis as well as intuitive output visualization. gNOMO includes functional information of metagenomics, metatranscriptomics, and metaproteomics data of the microbiome in the same visualization facilitating the interpretation of the results. Moreover, host data can be analyzed in parallel to obtain an equivalent output that allows to study the metabolic situation of the whole symbiotic system. Finally, the metaproteomics identification and annotation are optimized using a tailored proteogenomics database automatically obtained within the gNOMO workflow. In conclusion, gNOMO is a fully automated pipeline, for integrating and analyzing multiple meta-omics data and for producing useful output visualizations. In addition, it is specifically designed for data from non-model organisms. The gNOMO pipeline is freely available under the Apache 2.0 open-source license and can be downloaded from https://gitlab.com/rki_bioinformatics/gnomo .

show abstract

MetaMeta: integrating metagenome analysis tools to improve taxonomic profiling

Cited by 42 publications

References 33 publications

Enhancing the Resolution of Rumen Microbial Classification from Metatranscriptomic Data Using Kraken and Mothur

Enhancing the Resolution of Rumen Microbial Classification from Metatranscriptomic Data Using Kraken and Mothur

Mesophilic Sporeformers Identified in Whey Powder by Using Shotgun Metagenomic Sequencing

gNOMO: a multi-omics pipeline for integrated host and microbiome analysis of non-model organisms

Contact Info

Product

Resources

About