The potential of the diverse chemistries present in natural products (NP) for biotechnology and medicine remains untapped because NP databases are not searchable with raw data and the NP community has no way to share data other than in published papers. Although mass spectrometry techniques are well-suited to high-throughput characterization of natural products, there is a pressing need for an infrastructure to enable sharing and curation of data. We present Global Natural Products Social molecular networking (GNPS, http://gnps.ucsd.edu), an open-access knowledge base for community wide organization and sharing of raw, processed or identified tandem mass (MS/MS) spectrometry data. In GNPS crowdsourced curation of freely available community-wide reference MS libraries will underpin improved annotations. Data-driven social-networking should facilitate identification of spectra and foster collaborations. We also introduce the concept of ‘living data’ through continuous reanalysis of deposited data.
After hundreds of generations of adaptive evolution at exponential growth, Escherichia coli grows as predicted using flux balance analysis (FBA) on genome-scale metabolic models (GEMs). However, it is not known whether the predicted pathway usage in FBA solutions is consistent with gene and protein expression in the wild-type and evolved strains. Here, we report that 498% of active reactions from FBA optimal growth solutions are supported by transcriptomic and proteomic data. Moreover, when E. coli adapts to growth rate selective pressure, the evolved strains upregulate genes within the optimal growth predictions, and downregulate genes outside of the optimal growth solutions. In addition, bottlenecks from dosage limitations of computationally predicted essential genes are overcome in the evolved strains. We also identify regulatory processes that may contribute to the development of the optimal growth phenotype in the evolved strains, such as the downregulation of known regulons and stringent response suppression. Thus, differential gene and protein expression from wild-type and adaptively evolved strains supports observed growth phenotype changes, and is consistent with GEM-computed optimal growth states.
Bacteria of the order Actinomycetales are one of the most important sources of pharmacologically active and industrially relevant secondary metabolites. Unfortunately, many of them are still recalcitrant to genetic manipulation, which is a bottleneck for systematic metabolic engineering. To facilitate the genetic manipulation of actinomycetes, we developed a highly efficient CRISPR-Cas9 system to delete gene(s) or gene cluster(s), implement precise gene replacements, and reversibly control gene expression in actinomycetes. We demonstrate our system by targeting two genes, actIORF1 (SCO5087) and actVB (SCO5092), from the actinorhodin biosynthetic gene cluster in Streptomyces coelicolor A3(2). Our CRISPR-Cas9 system successfully inactivated the targeted genes. When no templates for homology-directed repair (HDR) were present, the site-specific DNA double-strand breaks (DSBs) introduced by Cas9 were repaired through the error-prone nonhomologous end joining (NHEJ) pathway, resulting in a library of deletions with variable sizes around the targeted sequence. If templates for HDR were provided at the same time, precise deletions of the targeted gene were observed with near 100% frequency. Moreover, we developed a system to efficiently and reversibly control expression of target genes, deemed CRISPRi, based on a catalytically dead variant of Cas9 (dCas9). The CRISPR-Cas9 based system described here comprises a powerful and broadly applicable set of tools to manipulate actinomycetal genomes.
Genome-scale models (GEMs) of metabolism were constructed for 55 fully sequenced Escherichia coli and Shigella strains. The GEMs enable a systems approach to characterizing the pan and core metabolic capabilities of the E. coli species. The majority of pan metabolic content was found to consist of alternate catabolic pathways for unique nutrient sources. The GEMs were then used to systematically analyze growth capabilities in more than 650 different growth-supporting environments. The results show that unique strain-specific metabolic capabilities correspond to pathotypes and environmental niches. Twelve of the GEMs were used to predict growth on six differentiating nutrients, and the predictions were found to agree with 80% of experimental outcomes. Additionally, GEMs were used to predict strain-specific auxotrophies. Twelve of the strains modeled were predicted to be auxotrophic for vitamins niacin (vitamin B 3 ), thiamin (vitamin B 1 ), or folate (vitamin B 9 ). Six of the strains modeled have lost biosynthetic pathways for essential amino acids methionine, tryptophan, or leucine. Genome-scale analysis of multiple strains of a species can thus be used to define the metabolic essence of a microbial species and delineate growth differences that shed light on the adaptation process to a particular microenvironment.systems biology | mathematical modeling | core and pan genome
The ability to correlate the production of specialized metabolites to the genetic capacity of the organism that produces such molecules has become an invaluable tool in aiding the discovery of biotechnologically applicable molecules. Here, we accomplish this task by matching molecular families with gene cluster families, making these correlations to 60 microbes at one time instead of connecting one molecule to one organism at a time, such as how it is traditionally done. We can correlate these families through the use of nanospray desorption electrospray ionization MS/MS, an ambient pressure MS technique, in conjunction with MS/MS networking and peptidogenomics. We matched the molecular families of peptide natural products produced by 42 bacilli and 18 pseudomonads through the generation of amino acid sequence tags from MS/MS data of specific clusters found in the MS/MS network. These sequence tags were then linked to biosynthetic gene clusters in publicly accessible genomes, providing us with the ability to link particular molecules with the genes that produced them. As an example of its use, this approach was applied to two unsequenced Pseudoalteromonas species, leading to the discovery of the gene cluster for a molecular family, the bromoalterochromides, in the previously sequenced strain P. piscicida JCM 20779 T . The approach itself is not limited to 60 related strains, because spectral networking can be readily adopted to look at molecular family-gene cluster families of hundreds or more diverse organisms in one single MS/MS network.MS/MS molecular networking | mass spectrometry | microbial ecology T ens of thousands of sequenced microbial genomes or rough drafts of genomes are available at this time, and this number is predicted to grow into the millions over the next decades. This wealth of sequence data has the potential to be used for the discovery of small bioactive molecules through genome mining (1-6). Genome mining is a process in which small molecules are discovered by predicting what compound will be genetically encoded based on the sequences of biosynthetic gene clusters. However, the process of mining genetically encoded small molecules is not keeping pace with the rate by which genome sequences are being obtained. In general, genome mining is still done one gene cluster at a time and requires many person-years of effort to annotate a single molecule. The time and significant expertise that current genome mining requires also make genome mining very expensive. In light of this extensive effort and cost, alternative approaches to genome mining and annotating specialized metabolites must be developed that not only take advantage of the sequenced resources available and make it efficient to perform genome mining on a more global scale but also enable the molecular analysis of unsequenced organisms. Such methods will then significantly reduce the cost of genome mining by increasing the speed with which molecules are connected to candidate genes and using resources already available. Here, we put fo...
It has been hypothesized that some antibiotic resistance genes (ARGs) found in pathogenic bacteria derive from antibiotic-producing actinobacteria. Here we provide bioinformatic and experimental evidence supporting this hypothesis. We identify genes in proteobacteria, including some pathogens, that appear to be closely related to actinobacterial ARGs known to confer resistance against clinically important antibiotics. Furthermore, we identify two potential examples of recent horizontal transfer of actinobacterial ARGs to proteobacterial pathogens. Based on this bioinformatic evidence, we propose and experimentally test a ‘carry-back' mechanism for the transfer, involving conjugative transfer of a carrier sequence from proteobacteria to actinobacteria, recombination of the carrier sequence with the actinobacterial ARG, followed by natural transformation of proteobacteria with the carrier-sandwiched ARG. Our results support the existence of ancient and, possibly, recent transfers of ARGs from antibiotic-producing actinobacteria to proteobacteria, and provide evidence for a defined mechanism.
BackgroundMetabolic reconstructions (MRs) are common denominators in systems biology and represent biochemical, genetic, and genomic (BiGG) knowledge-bases for target organisms by capturing currently available information in a consistent, structured manner. Salmonella enterica subspecies I serovar Typhimurium is a human pathogen, causes various diseases and its increasing antibiotic resistance poses a public health problem.ResultsHere, we describe a community-driven effort, in which more than 20 experts in S. Typhimurium biology and systems biology collaborated to reconcile and expand the S. Typhimurium BiGG knowledge-base. The consensus MR was obtained starting from two independently developed MRs for S. Typhimurium. Key results of this reconstruction jamboree include i) development and implementation of a community-based workflow for MR annotation and reconciliation; ii) incorporation of thermodynamic information; and iii) use of the consensus MR to identify potential multi-target drug therapy approaches.ConclusionTaken together, with the growing number of parallel MRs a structured, community-driven approach will be necessary to maximize quality while increasing adoption of MRs in experimental design and interpretation.
Bacterial survival requires adaptation to different environmental perturbations such as exposure to antibiotics, changes in temperature or oxygen levels, DNA damage, and alternative nutrient sources. During adaptation, bacteria often develop beneficial mutations that confer increased fitness in the new environment. Adaptation to the loss of a major non-essential gene product that cripples growth, however, has not been studied at the whole-genome level. We investigated the ability of Escherichia coli K-12 MG1655 to overcome the loss of phosphoglucose isomerase (pgi) by adaptively evolving ten replicates of E. coli lacking pgi for 50 days in glucose M9 minimal medium and by characterizing endpoint clones through whole-genome re-sequencing and phenotype profiling. We found that 1) the growth rates for all ten endpoint clones increased approximately 3-fold over the 50-day period; 2) two to five mutations arose during adaptation, most frequently in the NADH/NADPH transhydrogenases udhA and pntAB and in the stress-associated sigma factor rpoS; and 3) despite similar growth rates, at least three distinct endpoint phenotypes developed as defined by different rates of acetate and formate secretion. These results demonstrate that E. coli can adapt to the loss of a major metabolic gene product with only a handful of mutations and that adaptation can result in multiple, alternative phenotypes.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.