BackgroundMetagenomics is the study of microbial communities by sequencing of genetic material directly from environmental or clinical samples. The genes present in the metagenomes are quantified by annotating and counting the generated DNA fragments. Identification of differentially abundant genes between metagenomes can provide important information about differences in community structure, diversity and biological function. Metagenomic data is however high-dimensional, contain high levels of biological and technical noise and have typically few biological replicates. The statistical analysis is therefore challenging and many approaches have been suggested to date.ResultsIn this article we perform a comprehensive evaluation of 14 methods for identification of differentially abundant genes between metagenomes. The methods are compared based on the power to detect differentially abundant genes and their ability to correctly estimate the type I error rate and the false discovery rate. We show that sample size, effect size, and gene abundance greatly affect the performance of all methods. Several of the methods also show non-optimal model assumptions and biased false discovery rate estimates, which can result in too large numbers of false positives. We also demonstrate that the performance of several of the methods differs substantially between metagenomic data sequenced by different technologies.ConclusionsTwo methods, primarily designed for the analysis of RNA sequencing data (edgeR and DESeq2) together with a generalized linear model based on an overdispersed Poisson distribution were found to have best overall performance. The results presented in this study may serve as a guide for selecting suitable statistical methods for identification of differentially abundant genes in metagenomes.Electronic supplementary materialThe online version of this article (doi:10.1186/s12864-016-2386-y) contains supplementary material, which is available to authorized users.
BackgroundThe genome-scale metabolic model of Saccharomyces cerevisiae, first presented in 2003, was the first genome-scale network reconstruction for a eukaryotic organism. Since then continuous efforts have been made in order to improve and expand the yeast metabolic network.ResultsHere we present iTO977, a comprehensive genome-scale metabolic model that contains more reactions, metabolites and genes than previous models. The model was constructed based on two earlier reconstructions, namely iIN800 and the consensus network, and then improved and expanded using gap-filling methods and by introducing new reactions and pathways based on studies of the literature and databases. The model was shown to perform well both for growth simulations in different media and gene essentiality analysis for single and double knock-outs. Further, the model was used as a scaffold for integrating transcriptomics, and flux data from four different conditions in order to identify transcriptionally controlled reactions, i.e. reactions that change both in flux and transcription between the compared conditions.ConclusionWe present a new yeast model that represents a comprehensive up-to-date collection of knowledge on yeast metabolism. The model was used for simulating the yeast metabolism under four different growth conditions and experimental data from these four conditions was integrated to the model. The model together with experimental data is a useful tool to identify condition-dependent changes of metabolism between different environmental conditions.
BackgroundMetallo-β-lactamases are bacterial enzymes that provide resistance to carbapenems, the most potent class of antibiotics. These enzymes are commonly encoded on mobile genetic elements, which, together with their broad substrate spectrum and lack of clinically useful inhibitors, make them a particularly problematic class of antibiotic resistance determinants. We hypothesized that there is a large and unexplored reservoir of unknown metallo-β-lactamases, some of which may spread to pathogens, thereby threatening public health. The aim of this study was to identify novel metallo-β-lactamases of class B1, the most clinically important subclass of these enzymes.ResultsBased on a new computational method using an optimized hidden Markov model, we analyzed over 10,000 bacterial genomes and plasmids together with more than 5 terabases of metagenomic data to identify novel metallo-β-lactamase genes. In total, 76 novel genes were predicted, forming 59 previously undescribed metallo-β-lactamase gene families. The ability to hydrolyze imipenem in an Escherichia coli host was experimentally confirmed for 18 of the 21 tested genes. Two of the novel B1 metallo-β-lactamase genes contained atypical zinc-binding motifs in their active sites, which were previously undescribed for metallo-β-lactamases. Phylogenetic analysis showed that B1 metallo-β-lactamases could be divided into five major groups based on their evolutionary origin. Our results also show that, except for one, all of the previously characterized mobile B1 β-lactamases are likely to have originated from chromosomal genes present in Shewanella spp. and other Proteobacterial species.ConclusionsThis study more than doubles the number of known B1 metallo-β-lactamases. The findings have further elucidated the diversity and evolutionary history of this important class of antibiotic resistance genes and prepare us for some of the challenges that may be faced in clinics in the future.Electronic supplementary materialThe online version of this article (10.1186/s40168-017-0353-8) contains supplementary material, which is available to authorized users.
The protein secretory machinery in Eukarya is involved in post-translational modification (PTMs) and sorting of the secretory and many transmembrane proteins. While the secretory machinery has been well-studied using classic reductionist approaches, a holistic view of its complex nature is lacking. Here, we present the first genome-scale model for the yeast secretory machinery which captures the knowledge generated through more than 50 years of research. The model is based on the concept of a Protein Specific Information Matrix (PSIM: characterized by seven PTMs features). An algorithm was developed which mimics secretory machinery and assigns each secretory protein to a particular secretory class that determines the set of PTMs and transport steps specific to each protein. Protein abundances were integrated with the model in order to gain system level estimation of the metabolic demands associated with the processing of each specific protein as well as a quantitative estimation of the activity of each component of the secretory machinery.
Pharmaceuticals are designed to interact with specific molecular targets in humans and these targets generally have orthologs in other species. This provides opportunities for the drug discovery community to use alternative model species for drug development. It also means, however, there is potential for mode of action related effects in non-target wildlife species as many pharmaceuticals reach the environment through patient use and manufacturing wastes. Acquiring insight in drug target ortholog predictions across species and taxonomic groups has proven difficult because of the lack of an optimal strategy and because necessary information is spread across multiple and diverse sources and platforms. We introduce a new research platform tool, ECOdrug, that reliably connects drugs to their protein targets across divergent species. It harmonizes ortholog predictions from multiple sources via a simple user interface underpinning critical applications for a wide range of studies in pharmacology, ecotoxicology and comparative evolutionary biology. ECOdrug can be used to identify species with drug targets and identify drugs that interact with those targets. As such, it can be applied to support intelligent targeted drug safety testing by ensuring appropriate and relevant species are selected in ecological risk assessments. ECOdrug is freely accessible and available at: http://www.ecodrug.org.
Background Environmental and commensal bacteria maintain a diverse and largely unknown collection of antibiotic resistance genes (ARGs) that, over time, may be mobilized and transferred to pathogens. Metagenomics enables cultivation-independent characterization of bacterial communities but the resulting data is noisy and highly fragmented, severely hampering the identification of previously undescribed ARGs. We have therefore developed fARGene, a method for identification and reconstruction of ARGs directly from shotgun metagenomic data. Results fARGene uses optimized gene models and can therefore with high accuracy identify previously uncharacterized resistance genes, even if their sequence similarity to known ARGs is low. By performing the analysis directly on the metagenomic fragments, fARGene also circumvents the need for a high-quality assembly. To demonstrate the applicability of fARGene, we reconstructed β-lactamases from five billion metagenomic reads, resulting in 221 ARGs, of which 58 were previously not reported. Based on 38 ARGs reconstructed by fARGene, experimental verification showed that 81% provided a resistance phenotype in Escherichia coli . Compared to other methods for detecting ARGs in metagenomic data, fARGene has superior sensitivity and the ability to reconstruct previously unknown genes directly from the sequence reads. Conclusions We conclude that fARGene provides an efficient and reliable way to explore the unknown resistome in bacterial communities. The method is applicable to any type of ARGs and is freely available via GitHub under the MIT license. Electronic supplementary material The online version of this article (10.1186/s40168-019-0670-1) contains supplementary material, which is available to authorized users.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.