BackgroundIn spite of its great promise, metabolomics has proven difficult to execute in an untargeted and generalizable manner. Liquid chromatography–mass spectrometry (LC–MS) has made it possible to gather data on thousands of cellular metabolites. However, matching metabolites to their spectral features continues to be a bottleneck, meaning that much of the collected information remains uninterpreted and that new metabolites are seldom discovered in untargeted studies. These challenges require new approaches that consider compounds beyond those available in curated biochemistry databases.DescriptionHere we present Metabolic In silico Network Expansions (MINEs), an extension of known metabolite databases to include molecules that have not been observed, but are likely to occur based on known metabolites and common biochemical reactions. We utilize an algorithm called the Biochemical Network Integrated Computational Explorer (BNICE) and expert-curated reaction rules based on the Enzyme Commission classification system to propose the novel chemical structures and reactions that comprise MINE databases. Starting from the Kyoto Encyclopedia of Genes and Genomes (KEGG) COMPOUND database, the MINE contains over 571,000 compounds, of which 93% are not present in the PubChem database. However, these MINE compounds have on average higher structural similarity to natural products than compounds from KEGG or PubChem. MINE databases were able to propose annotations for 98.6% of a set of 667 MassBank spectra, 14% more than KEGG alone and equivalent to PubChem while returning far fewer candidates per spectra than PubChem (46 vs. 1715 median candidates). Application of MINEs to LC–MS accurate mass data enabled the identity of an unknown peak to be confidently predicted.ConclusionsMINE databases are freely accessible for non-commercial use via user-friendly web-tools at http://minedatabase.mcs.anl.gov and developer-friendly APIs. MINEs improve metabolomics peak identification as compared to general chemical databases whose results include irrelevant synthetic compounds. Furthermore, MINEs complement and expand on previous in silico generated compound databases that focus on human metabolism. We are actively developing the database; future versions of this resource will incorporate transformation rules for spontaneous chemical reactions and more advanced filtering and prioritization of candidate structures.Graphical abstractMINE database construction and access methods. The process of constructing a MINE database from the curated source databases is depicted on the left. The methods for accessing the database are shown on the right.Electronic supplementary materialThe online version of this article (doi:10.1186/s13321-015-0087-1) contains supplementary material, which is available to authorized users.
BackgroundIt is now recognized that enzymatic or chemical side-reactions can convert normal metabolites to useless or toxic ones and that a suite of enzymes exists to mitigate such metabolite damage. Examples are the reactive imine/enamine intermediates produced by threonine dehydratase, which damage the pyridoxal 5'-phosphate cofactor of various enzymes causing inactivation. This damage is pre-empted by RidA proteins, which hydrolyze the imines before they do harm. RidA proteins belong to the YjgF/YER057c/UK114 family (here renamed the Rid family). Most other members of this diverse and ubiquitous family lack defined functions.ResultsPhylogenetic analysis divided the Rid family into a widely distributed, apparently archetypal RidA subfamily and seven other subfamilies (Rid1 to Rid7) that are largely confined to bacteria and often co-occur in the same organism with RidA and each other. The Rid1 to Rid3 subfamilies, but not the Rid4 to Rid7 subfamilies, have a conserved arginine residue that, in RidA proteins, is essential for imine-hydrolyzing activity. Analysis of the chromosomal context of bacterial RidA genes revealed clustering with genes for threonine dehydratase and other pyridoxal 5'-phosphate-dependent enzymes, which fits with the known RidA imine hydrolase activity. Clustering was also evident between Rid family genes and genes specifying FAD-dependent amine oxidases or enzymes of carbamoyl phosphate metabolism. Biochemical assays showed that Salmonella enterica RidA and Rid2, but not Rid7, can hydrolyze imines generated by amino acid oxidase. Genetic tests indicated that carbamoyl phosphate overproduction is toxic to S. enterica cells lacking RidA, and metabolomic profiling of Rid knockout strains showed ten-fold accumulation of the carbamoyl phosphate-related metabolite dihydroorotate.ConclusionsLike the archetypal RidA subfamily, the Rid2, and probably the Rid1 and Rid3 subfamilies, have imine-hydrolyzing activity and can pre-empt damage from imines formed by amine oxidases as well as by pyridoxal 5'-phosphate enzymes. The RidA subfamily has an additional damage pre-emption role in carbamoyl phosphate metabolism that has yet to be biochemically defined. Finally, the Rid4 to Rid7 subfamilies appear not to hydrolyze imines and thus remain mysterious.Electronic supplementary materialThe online version of this article (doi:10.1186/s12864-015-1584-3) contains supplementary material, which is available to authorized users.
NADH and NADPH undergo spontaneous and enzymatic reactions that produce R and S forms of NAD(P)H hydrates [NAD(P)HX], which are not electron donors and inhibit various dehydrogenases. In bacteria, yeast (Saccharomyces cerevisiae), and mammals, these hydrates are repaired by the tandem action of an ADP-or ATP-dependent dehydratase that converts (S)-NAD(P)HX to NAD(P)H and an epimerase that facilitates interconversion of the R and S forms. Plants have homologs of both enzymes, the epimerase homolog being fused to the vitamin B 6 salvage enzyme pyridoxine 59-phosphate oxidase. Recombinant maize (Zea mays) and Arabidopsis (Arabidopsis thaliana) NAD(P)HX dehydratases (GRMZM5G840928, At5g19150) were able to reconvert (S)-NAD(P)HX to NAD(P)H in an ATP-dependent manner. Recombinant maize and Arabidopsis epimerases (GRMZM2G061988, At5g49970) rapidly interconverted (R)-and (S)-NAD(P)HX, as did a truncated form of the Arabidopsis epimerase lacking the pyridoxine 59-phosphate oxidase domain. All plant NAD(P)HX dehydratase and epimerase sequences examined had predicted organellar targeting peptides with a potential second start codon whose use would eliminate the targeting peptide. In vitro transcription/translation assays confirmed that both start sites were used. Dual import assays with purified pea (Pisum sativum) chloroplasts and mitochondria, and subcellular localization of GFP fusion constructs in tobacco (Nicotiana tabacum) suspension cells, indicated mitochondrial, plastidial, and cytosolic localization of the Arabidopsis epimerase and dehydratase. Ablation of the Arabidopsis dehydratase gene raised seedling levels of all NADHX forms by 20-to 40-fold, and levels of one NADPHX form by 10-to 30-fold. We conclude that plants have a canonical two-enzyme NAD(P)HX repair system that is directed to three subcellular compartments via the use of alternative translation start sites.
5-Oxoproline (OP) is well-known as an enzymatic intermediate in the eukaryotic γ-glutamyl cycle, but it is also an unavoidable damage product formed spontaneously from glutamine and other sources. Eukaryotes metabolize OP via an ATP-dependent 5-oxoprolinase; most prokaryotes lack homologs of this enzyme (and the γ-glutamyl cycle) but are predicted to have some way to dispose of OP if its spontaneous formation is significant. Comparative analysis of prokaryotic genomes showed that the gene encoding pyroglutamyl peptidase, which removes N-terminal OP residues, clusters in diverse genomes with genes specifying homologs of a fungal lactamase (renamedrokaryotic 5-ooprolinase ,) and homologs of allophanate hydrolase subunits (renamed and). Inactivation of ,, or genes slowed growth, caused OP accumulation in cells and medium, and prevented use of OP as a nitrogen source. Assays of cell lysates showed that ATP-dependent 5-oxoprolinase activity disappeared when, , or was inactivated. 5-Oxoprolinase activity could be reconstituted by mixing recombinant PxpA, PxpB, and PxpC proteins. In addition, overexpressing genes in increased 5-oxoprolinase activity in lysates ≥1700-fold. This work shows that OP is a major universal metabolite damage product and that OP disposal systems are common in all domains of life. Furthermore, it illustrates how easily metabolite damage and damage-control systems can be overlooked, even for central metabolites in model organisms.
RidA (for Reactive Intermediate Deaminase A) proteins are ubiquitous, yet their function in eukaryotes is unclear. It is known that deleting Salmonella enterica ridA causes Ser sensitivity and that S. enterica RidA and its homologs from other organisms hydrolyze the enamine/imine intermediates that Thr dehydratase forms from Ser or Thr. In S. enterica, the Ser-derived enamine/imine inactivates a branched-chain aminotransferase; RidA prevents this damage. Arabidopsis thaliana and maize (Zea mays) have a RidA homolog that is predicted to be plastidial. Expression of either homolog complemented the Ser sensitivity of the S. enterica ridA mutant. The purified proteins hydrolyzed the enamines/imines formed by Thr dehydratase from Ser or Thr and protected the Arabidopsis plastidial branched-chain aminotransferase BCAT3 from inactivation by the Ser-derived enamine/imine. In vitro chloroplast import assays and in vivo localization of green fluorescent protein fusions showed that Arabidopsis RidA and Thr dehydratase are chloroplast targeted. Disrupting Arabidopsis RidA reduced root growth and raised the root and shoot levels of the branched-chain amino acid biosynthesis intermediate 2-oxobutanoate; Ser treatment exacerbated these effects in roots. Supplying Ile reversed the root growth defect. These results indicate that plastidial RidA proteins can preempt damage to BCAT3 and Ile biosynthesis by hydrolyzing the Ser-derived enamine/imine product of Thr dehydratase.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.