Independent component analysis (ICA) of bacterial transcriptomes has emerged as a powerful tool for obtaining co-regulated, independently-modulated gene sets (iModulons), inferring their activities across a range of conditions, and enabling their association to known genetic regulators. By grouping and analyzing genes based on observations from big data alone, iModulons can provide a novel perspective into how the composition of the transcriptome adapts to environmental conditions. Here, we present iModulonDB (imodulondb.org), a knowledgebase of prokaryotic transcriptional regulation computed from high-quality transcriptomic datasets using ICA. Users select an organism from the home page and then search or browse the curated iModulons that make up its transcriptome. Each iModulon and gene has its own interactive dashboard, featuring plots and tables with clickable, hoverable, and downloadable features. This site enhances research by presenting scientists of all backgrounds with co-expressed gene sets and their activity levels, which lead to improved understanding of regulator-gene relationships, discovery of transcription factors, and the elucidation of unexpected relationships between conditions and genetic regulatory activity. The current release of iModulonDB covers three organisms (Escherichia coli, Staphylococcus aureus and Bacillus subtilis) with 204 iModulons, and can be expanded to cover many additional organisms.
S. aureus is classified as a serious threat pathogen and is a priority that guides the discovery and development of new antibiotics. Despite growing knowledge of S. aureus metabolic capabilities, our understanding of its systems-level responses to different media types remains incomplete. Here, we develop a manually reconstructed genome-scale model (GEM-PRO) of metabolism with 3D protein structures for S. aureus USA300 str. JE2 containing 854 genes, 1,440 reactions, 1,327 metabolites and 673 3-dimensional protein structures. Computations were in 85% agreement with gene essentiality data from random barcode transposon site sequencing (RB-TnSeq) and 68% agreement with experimental physiological data. Comparisons of computational predictions with experimental observations highlight: 1) cases of non-essential biomass precursors; 2) metabolic genes subject to transcriptional regulation involved in Staphyloxanthin biosynthesis; 3) the essentiality of purine and amino acid biosynthesis in synthetic physiological media; and 4) a switch to aerobic fermentation upon exposure to extracellular glucose elucidated as a result of integrating time-course of quantitative exo-metabolomics data. An up-to-date GEM-PRO thus serves as a knowledge-based platform to elucidate S. aureus’ metabolic response to its environment.
We are firmly in the era of biological big data. Millions of omics datasets are publicly accessible and can be employed to support scientific research or build a holistic view of an organism. Here, we introduce a workflow that converts all public gene expression data for a microbe into a dynamic representation of the organism's transcriptional regulatory network. This five-step process walks researchers through the mining, processing, curation, analysis, and characterization of all available expression data, using Bacillus subtilis as an example. The resulting reconstruction of the B. subtilis regulatory network can be leveraged to predict new regulons and analyze datasets in the context of all published data. The results are hosted at https://imodulondb.org/, and additional analyses can be performed using the PyModulon Python package. As the number of publicly available datasets increases, this pipeline will be applicable to a wide range of microbial pathogens and cell factories.
BackgroundThe efficacy of antibiotics against M. tuberculosis has been shown to be influenced by experimental media conditions. Investigations of M. tuberculosis growth in physiological conditions have described an environment that is different from common in vitro media. Thus, elucidating the interplay between available nutrient sources and antibiotic efficacy has clear medical relevance. While genome-scale reconstructions of M. tuberculosis have enabled the ability to interrogate media differences for the past 10 years, recent reconstructions have diverged from each other without standardization. A unified reconstruction of M. tuberculosis H37Rv would elucidate the impact of different nutrient conditions on antibiotic efficacy and provide new insights for therapeutic intervention.ResultsWe present a new genome-scale model of M. tuberculosis H37Rv, named iEK1011, that unifies and updates previous M. tuberculosis H37Rv genome-scale reconstructions. We functionally assess iEK1011 against previous models and show that the model increases correct gene essentiality predictions on two different experimental datasets by 6% (53% to 60%) and 18% (60% to 71%), respectively. We compared simulations between in vitro and approximated in vivo media conditions to examine the predictive capabilities of iEK1011. The simulated differences recapitulated literature defined characteristics in the rewiring of TCA metabolism including succinate secretion, gluconeogenesis, and activation of both the glyoxylate shunt and the methylcitrate cycle. To assist efforts to elucidate mechanisms of antibiotic resistance development, we curated 16 metabolic genes related to antimicrobial resistance and approximated evolutionary drivers of resistance. Comparing simulations of these antibiotic resistance features between in vivo and in vitro media highlighted condition-dependent differences that may influence the efficacy of antibiotics.ConclusionsiEK1011 provides a computational knowledge base for exploring the impact of different environmental conditions on the metabolic state of M. tuberculosis H37Rv. As more experimental data and knowledge of M. tuberculosis H37Rv become available, a unified and standardized M. tuberculosis model will prove to be a valuable resource to the research community studying the systems biology of M. tuberculosis.Electronic supplementary materialThe online version of this article (10.1186/s12918-018-0557-y) contains supplementary material, which is available to authorized users.
Evolution fine-tunes biological pathways to achieve a robust cellular physiology. Two and a half billion years ago, rapidly rising levels of oxygen as a byproduct of blooming cyanobacterial photosynthesis resulted in a redox upshift in microbial energetics. The appearance of higher-redox-potential respiratory quinone, ubiquinone (UQ), is believed to be an adaptive response to this environmental transition. However, the majority of bacterial species are still dependent on the ancient respiratory quinone, naphthoquinone (NQ). Gammaproteobacteria can biosynthesize both of these respiratory quinones, where UQ has been associated with aerobic lifestyle and NQ with anaerobic lifestyle. We engineered an obligate NQ-dependent γ-proteobacterium, Escherichia coli ΔubiC, and performed adaptive laboratory evolution to understand the selection against the use of NQ in an oxic environment and also the adaptation required to support the NQ-driven aerobic electron transport chain. A comparative systems-level analysis of pre- and postevolved NQ-dependent strains revealed a clear shift from fermentative to oxidative metabolism enabled by higher periplasmic superoxide defense. This metabolic shift was driven by the concerted activity of 3 transcriptional regulators (PdhR, RpoS, and Fur). Analysis of these findings using a genome-scale model suggested that resource allocation to reactive oxygen species (ROS) mitigation results in lower growth rates. These results provide a direct elucidation of a resource allocation tradeoff between growth rate and ROS mitigation costs associated with NQ usage under oxygen-replete condition.
Mycobacterium tuberculosis H37Rv is one of the world's most impactful pathogens, and a large part of the success of the organism relies on the differential expression of its genes to adapt to its environment. The expression of the organism's genes is driven primarily by its transcriptional regulatory network, and most research on the TRN focuses on identifying and quantifying clusters of coregulated genes known as regulons.
The ability of Staphylococcus aureus to infect many different tissue sites is enabled, in part, by its Transcriptional Regulatory Network (TRN) that coordinates its gene expression to respond to different environments. We elucidated the organization and activity of this TRN by applying Independent Component Analysis (ICA) to a compendium of 108 RNAseq expression profiles from two S. aureus clinical strains (TCH1516 and LAC). ICA decomposed the S. aureus transcriptome into 29 independently modulated sets of genes (i-modulons) that revealed (1) high confidence associations between 21 i-modulons and known regulators; (2) an association between an i-modulon and σS, whose regulatory role was previously undefined; (3) the regulatory organization of 65 virulence factors in the form of three i-modulons associated with AgrR, SaeR and Vim-3, (4) the roles of three key transcription factors (codY, Fur and ccpA) in coordinating the metabolic and regulatory networks; and (5) a low dimensional representation, involving the function of few transcription factors, of changes in gene expression between two laboratory media (RPMI, CAMHB) and two physiological media (blood and serum). This representation of the TRN covers 842 genes representing 76% of the variance in gene expression that provides a quantitative reconstruction of transcriptional modules in S. aureus, and a platform enabling its full elucidation. Significance StatementStaphylococcus aureus infections impose an immense burden on the healthcare system. To establish a successful infection in a hostile host environment, S. aureus must coordinate its gene expression to respond to a wide array of challenges. This balancing act is largely orchestrated by the Transcriptional Regulatory Network (TRN). Here, we present a model of 29 independently modulated sets of genes that form the basis for a segment of the TRN in clinical USA300 strains of S. aureus . Using this model, we demonstrate the concerted role of various cellular systems (e.g. metabolism, virulence and stress response) underlying key physiological responses, including response during blood infection. framework to reevaluate the RNA-seq data accelerates discovery by (1) quantitatively formulating TRN organization, (2) simplifying complex changes across hundreds of genes into a few changes in regulator activities, (3) allowing for analysis of interactions among different regulators, (4) connecting transcriptional regulation to metabolism, and (5) defining previously unknown regulons. Main
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.