The availability and utility of genome-scale metabolic reconstructions have exploded since the first genome-scale reconstruction was published a decade ago. Reconstructions have now been built for a wide variety of organisms, and have been used toward five major ends: (1) contextualization of high-throughput data, (2) guidance of metabolic engineering, (3) directing hypothesis-driven discovery, (4) interrogation of multi-species relationships, and (5) network property discovery. In this review, we examine the many uses and future directions of genome-scale metabolic reconstructions, and we highlight trends and opportunities in the field that will make the greatest impact on many fields of biology. Molecular Systems Biology 5: 320; published online 3 November 2009; doi:10.1038/msb.2009 Subject Categories: metabolic and regulatory networks; simulation and data analysis
A cornerstone of biotechnology is the use of microorganisms for the efficient production of chemicals and the elimination of harmful waste. Pseudomonas putida is an archetype of such microbes due to its metabolic versatility, stress resistance, amenability to genetic modifications, and vast potential for environmental and industrial applications. To address both the elucidation of the metabolic wiring in P. putida and its uses in biocatalysis, in particular for the production of non-growth-related biochemicals, we developed and present here a genome-scale constraint-based model of the metabolism of P. putida KT2440. Network reconstruction and flux balance analysis (FBA) enabled definition of the structure of the metabolic network, identification of knowledge gaps, and pin-pointing of essential metabolic functions, facilitating thereby the refinement of gene annotations. FBA and flux variability analysis were used to analyze the properties, potential, and limits of the model. These analyses allowed identification, under various conditions, of key features of metabolism such as growth yield, resource distribution, network robustness, and gene essentiality. The model was validated with data from continuous cell cultures, high-throughput phenotyping data, 13C-measurement of internal flux distributions, and specifically generated knock-out mutants. Auxotrophy was correctly predicted in 75% of the cases. These systematic analyses revealed that the metabolic network structure is the main factor determining the accuracy of predictions, whereas biomass composition has negligible influence. Finally, we drew on the model to devise metabolic engineering strategies to improve production of polyhydroxyalkanoates, a class of biotechnologically useful compounds whose synthesis is not coupled to cell survival. The solidly validated model yields valuable insights into genotype–phenotype relationships and provides a sound framework to explore this versatile bacterium and to capitalize on its vast biotechnological potential.
Pseudomonas aeruginosa is a major life-threatening opportunistic pathogen that commonly infects immunocompromised patients. This bacterium owes its success as a pathogen largely to its metabolic versatility and flexibility. A thorough understanding of P. aeruginosa's metabolism is thus pivotal for the design of effective intervention strategies. Here we aim to provide, through systems analysis, a basis for the characterization of the genome-scale properties of this pathogen's versatile metabolic network. To this end, we reconstructed a genome-scale metabolic network of Pseudomonas aeruginosa PAO1. This reconstruction accounts for 1,056 genes (19% of the genome), 1,030 proteins, and 883 reactions. Flux balance analysis was used to identify key features of P. aeruginosa metabolism, such as growth yield, under defined conditions and with defined knowledge gaps within the network. BIOLOG substrate oxidation data were used in model expansion, and a genome-scale transposon knockout set was compared against in silico knockout predictions to validate the model. Ultimately, this genome-scale model provides a basic modeling framework with which to explore the metabolism of P. aeruginosa in the context of its environmental and genetic constraints, thereby contributing to a more thorough understanding of the genotype-phenotype relationships in this resourceful and dangerous pathogen.With sequenced genomes now routinely being made available to the public, detailed annotations and various publicly available genomic resources have enabled the formation of genome-scale models of metabolism for a wide variety of organisms (12,21,26,42,63,65). A wealth of data from wellcontrolled experiments, coupled with advancements in methods for computational network analysis, have allowed these models to aid interrogation of metabolic behavior. In addition, an iterative process to model development-cycles of in silico model predictions, experimental (i.e., wet lab) validation, and subsequent model refinement-has enabled the development of methods that have contributed to biological discovery, such as in determination of likely drug targets in Mycobacterium tuberculosis (3,26), metabolic engineering of cells for production of valuable compounds (5, 32, 34), and development of novel frameworks for contextualizing high-throughput "-omics" data sets (15,24,64).Pseudomonas aeruginosa is a ubiquitous gram-negative bacterium that is capable of surviving in a broad range of natural
In the past decade, over 50 genome-scale metabolic reconstructions have been built for a variety of single- and multi- cellular organisms. These reconstructions have enabled a host of computational methods to be leveraged for systems-analysis of metabolism, leading to greater understanding of observed phenotypes. These methods have been sparsely applied to comparisons between multiple organisms, however, due mainly to the existence of differences between reconstructions that are inherited from the respective reconstruction processes of the organisms to be compared. To circumvent this obstacle, we developed a novel process, termed metabolic network reconciliation, whereby non-biological differences are removed from genome-scale reconstructions while keeping the reconstructions as true as possible to the underlying biological data on which they are based. This process was applied to two organisms of great importance to disease and biotechnological applications, Pseudomonas aeruginosa and Pseudomonas putida, respectively. The result is a pair of revised genome-scale reconstructions for these organisms that can be analyzed at a systems level with confidence that differences are indicative of true biological differences (to the degree that is currently known), rather than artifacts of the reconstruction process. The reconstructions were re-validated with various experimental data after reconciliation. With the reconciled and validated reconstructions, we performed a genome-wide comparison of metabolic flexibility between P. aeruginosa and P. putida that generated significant new insight into the underlying biology of these important organisms. Through this work, we provide a novel methodology for reconciling models, present new genome-scale reconstructions of P. aeruginosa and P. putida that can be directly compared at a network level, and perform a network-wide comparison of the two species. These reconstructions provide fresh insights into the metabolic similarities and differences between these important Pseudomonads, and pave the way towards full comparative analysis of genome-scale metabolic reconstructions of multiple species.
Background: Optimization theory has been applied to complex biological systems to interrogate network properties and develop and refine metabolic engineering strategies. For example, methods are emerging to engineer cells to optimally produce byproducts of commercial value, such as bioethanol, as well as molecular compounds for disease therapy. Flux balance analysis (FBA) is an optimization framework that aids in this interrogation by generating predictions of optimal flux distributions in cellular networks. Critical features of FBA are the definition of a biologically relevant objective function (e.g., maximizing the rate of synthesis of biomass, a unit of measurement of cellular growth) and the subsequent application of linear programming (LP) to identify fluxes through a reaction network. Despite the success of FBA, a central remaining challenge is the definition of a network objective with biological meaning.
Culturing microorganisms is a critical step in understanding and utilizing microbial life. Here we map the landscape of existing culture media by extracting natural-language media recipes into a Known Media Database (KOMODO), which includes >18,000 strain–media combinations, >3300 media variants and compound concentrations (the entire collection of the Leibniz Institute DSMZ repository). Using KOMODO, we show that although media are usually tuned for individual strains using biologically common salts, trace metals and vitamins/cofactors are the most differentiating components between defined media of strains within a genus. We leverage KOMODO to predict new organism–media pairings using a transitivity property (74% growth in new in vitro experiments) and a phylogeny-based collaborative filtering tool (83% growth in new in vitro experiments and stronger growth on predicted well-scored versus poorly scored media). These resources are integrated into a web-based platform that predicts media given an organism's 16S rDNA sequence, facilitating future cultivation efforts.
System-level modeling is beginning to be used to decipher high throughput data in the context of disease. In this study, we present an integration of expression microarray data with a genome-scale metabolic reconstruction of Pseudomonas aeruginosa in the context of a chronic cystic fibrosis (CF) lung infection. A genome-scale reconstruction of P. aeruginosa metabolism was tailored to represent the metabolic states of two clonally related lineages of P. aeruginosa isolated from the lungs of a CF patient at different points over a 44-month time course, giving a mechanistic glimpse into how the bacterial metabolism adapts over time in the CF lung. Metabolic capacities were analyzed to determine how tradeoffs between growth and other important cellular processes shift during disease progression. Genes whose knockouts were either significantly growth reducing or lethal in silico were also identified for each time point and serve as hypotheses for future drug targeting efforts specific to the stages of disease progression.The last decade has witnessed an explosion in both the quantity and the pace of biological discovery. High throughput methods have been developed and leveraged at an expanding rate, with the accumulation of high throughput data outstripping the capacity for analysis using conventional methods (16,21). To face these new challenges, systems-focused methods have come to the forefront of biological discovery, enabling a synergistic merging of network analysis with the existing reductionist paradigms that have fueled biology for the past halfcentury (25,40).One of the most pressing applications of systems analysis is unraveling the myriad factors that combine to form human disease. This ambitious goal has motivated a surge of interest in the collection and analysis of microarray data, which has emerged as a dominant technology for gathering genome-scale data due to its relatively low cost, ubiquity, ease, and increasingly high resolution and reproducibility (42). In particular, microarrays for gene expression profiling have been used in longitudinal studies of disease, as it enables a glimpse at the internal changes cells undergo as a disease progresses. While many such studies have been published, very little modeldriven analysis has been leveraged toward interpreting these data at the network level. There is a tremendous need for this next level of analysis, as a network approach promises a deeper mechanistic understanding of whole-cell phenotypes that will be crucial for determining better therapies in the future.With the increase in life span of cystic fibrosis (CF) patients over the last several decades, bacterial infections of the thickened mucus of the lung have become the primary disease burden that must be managed in these patients today (23). The peculiarities of the CF lung mucosal environment render it a ripe environment for growth of Pseudomonas aeruginosa in particular, a notorious opportunistic pathogen that chronically infects the lungs of nearly every CF patient by an early age (32). Due ...
Drug side effects levy a massive cost on society through drug failures, morbidity, and mortality cases every year, and their early detection is critically important. Here, we describe the array of model-based phenotype predictors (AMPP), an approach that leverages medical informatics resources and a human genome-scale metabolic model (GSMM) to predict drug side effects. AMPP is substantially predictive (AUC > 0.7) for >70 drug side effects, including very serious ones such as interstitial nephritis and extrapyramidal disorders. We evaluate AMPP's predictive signal through cross-validation, comparison across multiple versions of a side effects database, and co-occurrence analysis of drug side effect associations in scientific abstracts (hypergeometric p value = 2.2e-40). AMPP outperforms a previous biochemical structure-based method in predicting metabolically based side effects (aggregate AUC = 0.65 versus 0.59). Importantly, AMPP enables the identification of key metabolic reactions and biomarkers that are predictive of specific side effects. Taken together, this work lays a foundation for future detection of metabolically grounded side effects during early stages of drug development.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.