BackgroundIn spite of its great promise, metabolomics has proven difficult to execute in an untargeted and generalizable manner. Liquid chromatography–mass spectrometry (LC–MS) has made it possible to gather data on thousands of cellular metabolites. However, matching metabolites to their spectral features continues to be a bottleneck, meaning that much of the collected information remains uninterpreted and that new metabolites are seldom discovered in untargeted studies. These challenges require new approaches that consider compounds beyond those available in curated biochemistry databases.DescriptionHere we present Metabolic In silico Network Expansions (MINEs), an extension of known metabolite databases to include molecules that have not been observed, but are likely to occur based on known metabolites and common biochemical reactions. We utilize an algorithm called the Biochemical Network Integrated Computational Explorer (BNICE) and expert-curated reaction rules based on the Enzyme Commission classification system to propose the novel chemical structures and reactions that comprise MINE databases. Starting from the Kyoto Encyclopedia of Genes and Genomes (KEGG) COMPOUND database, the MINE contains over 571,000 compounds, of which 93% are not present in the PubChem database. However, these MINE compounds have on average higher structural similarity to natural products than compounds from KEGG or PubChem. MINE databases were able to propose annotations for 98.6% of a set of 667 MassBank spectra, 14% more than KEGG alone and equivalent to PubChem while returning far fewer candidates per spectra than PubChem (46 vs. 1715 median candidates). Application of MINEs to LC–MS accurate mass data enabled the identity of an unknown peak to be confidently predicted.ConclusionsMINE databases are freely accessible for non-commercial use via user-friendly web-tools at http://minedatabase.mcs.anl.gov and developer-friendly APIs. MINEs improve metabolomics peak identification as compared to general chemical databases whose results include irrelevant synthetic compounds. Furthermore, MINEs complement and expand on previous in silico generated compound databases that focus on human metabolism. We are actively developing the database; future versions of this resource will incorporate transformation rules for spontaneous chemical reactions and more advanced filtering and prioritization of candidate structures.Graphical abstractMINE database construction and access methods. The process of constructing a MINE database from the curated source databases is depicted on the left. The methods for accessing the database are shown on the right.Electronic supplementary materialThe online version of this article (doi:10.1186/s13321-015-0087-1) contains supplementary material, which is available to authorized users.
Botryococcene biosynthesis is thought to resemble that of squalene, a metabolite essential for sterol metabolism in all eukaryotes. Squalene arises from an initial condensation of two molecules of farnesyl diphosphate (FPP) to form presqualene diphosphate (PSPP), which then undergoes a reductive rearrangement to form squalene. In principle, botryococcene could arise from an alternative rearrangement of the presqualene intermediate. Because of these proposed similarities, we predicted that a botryococcene synthase would resemble squalene synthase and hence isolated squalene synthase-like genes from Botryococcus braunii race B. While B. braunii does harbor at least one typical squalene synthase, none of the other three squalene synthase-like (SSL) genes encodes for botryococcene biosynthesis directly. SSL-1 catalyzes the biosynthesis of PSPP and SSL-2 the biosynthesis of bisfarnesyl ether, while SSL-3 does not appear able to directly utilize FPP as a substrate. However, when combinations of the synthase-like enzymes were mixed together, in vivo and in vitro, robust botryococcene (SSL-1+SSL-3) or squalene biosynthesis (SSL1+SSL-2) was observed. These findings were unexpected because squalene synthase, an ancient and likely progenitor to the other Botryococcus triterpene synthases, catalyzes a two-step reaction within a single enzyme unit without intermediate release, yet in B. braunii, these activities appear to have separated and evolved interdependently for specialized triterpene oil production greater than 500 MYA. Coexpression of the SSL-1 and SSL-3 genes in different configurations, as independent genes, as gene fusions, or targeted to intracellular membranes, also demonstrate the potential for engineering even greater efficiencies of botryococcene biosynthesis.algae | biofuels | terpene enzymology
BackgroundIt is now recognized that enzymatic or chemical side-reactions can convert normal metabolites to useless or toxic ones and that a suite of enzymes exists to mitigate such metabolite damage. Examples are the reactive imine/enamine intermediates produced by threonine dehydratase, which damage the pyridoxal 5'-phosphate cofactor of various enzymes causing inactivation. This damage is pre-empted by RidA proteins, which hydrolyze the imines before they do harm. RidA proteins belong to the YjgF/YER057c/UK114 family (here renamed the Rid family). Most other members of this diverse and ubiquitous family lack defined functions.ResultsPhylogenetic analysis divided the Rid family into a widely distributed, apparently archetypal RidA subfamily and seven other subfamilies (Rid1 to Rid7) that are largely confined to bacteria and often co-occur in the same organism with RidA and each other. The Rid1 to Rid3 subfamilies, but not the Rid4 to Rid7 subfamilies, have a conserved arginine residue that, in RidA proteins, is essential for imine-hydrolyzing activity. Analysis of the chromosomal context of bacterial RidA genes revealed clustering with genes for threonine dehydratase and other pyridoxal 5'-phosphate-dependent enzymes, which fits with the known RidA imine hydrolase activity. Clustering was also evident between Rid family genes and genes specifying FAD-dependent amine oxidases or enzymes of carbamoyl phosphate metabolism. Biochemical assays showed that Salmonella enterica RidA and Rid2, but not Rid7, can hydrolyze imines generated by amino acid oxidase. Genetic tests indicated that carbamoyl phosphate overproduction is toxic to S. enterica cells lacking RidA, and metabolomic profiling of Rid knockout strains showed ten-fold accumulation of the carbamoyl phosphate-related metabolite dihydroorotate.ConclusionsLike the archetypal RidA subfamily, the Rid2, and probably the Rid1 and Rid3 subfamilies, have imine-hydrolyzing activity and can pre-empt damage from imines formed by amine oxidases as well as by pyridoxal 5'-phosphate enzymes. The RidA subfamily has an additional damage pre-emption role in carbamoyl phosphate metabolism that has yet to be biochemically defined. Finally, the Rid4 to Rid7 subfamilies appear not to hydrolyze imines and thus remain mysterious.Electronic supplementary materialThe online version of this article (doi:10.1186/s12864-015-1584-3) contains supplementary material, which is available to authorized users.
To synthesize the cofactor thiamin diphosphate (ThDP), plants must first hydrolyze thiamin monophosphate (ThMP) to thiamin, but dedicated enzymes for this hydrolysis step were unknown and widely doubted to exist. The classical thiaminrequiring th2-1 mutation in Arabidopsis thaliana was shown to reduce ThDP levels by half and to increase ThMP levels 5-fold, implying that the THIAMIN REQUIRING2 (TH2) gene product could be a dedicated ThMP phosphatase. Genomic and transcriptomic data indicated that TH2 corresponds to At5g32470, encoding a HAD (haloacid dehalogenase) family phosphatase fused to a TenA (thiamin salvage) family protein. Like the th2-1 mutant, an insertional mutant of At5g32470 accumulated ThMP, and the thiamin requirement of the th2-1 mutant was complemented by wild-type At5g32470. Complementation tests in Escherichia coli and enzyme assays with recombinant proteins confirmed that At5g32470 and its maize (Zea mays) orthologs GRMZM2G148896 and GRMZM2G078283 are ThMP-selective phosphatases whose activity resides in the HAD domain and that the At5g32470 TenA domain has the expected thiamin salvage activity. In vitro and in vivo experiments showed that alternative translation start sites direct the At5g32470 protein to the cytosol and potentially also to mitochondria. Our findings establish that plants have a dedicated ThMP phosphatase and indicate that modest (50%) ThDP depletion can produce severe deficiency symptoms.
NADH and NADPH undergo spontaneous and enzymatic reactions that produce R and S forms of NAD(P)H hydrates [NAD(P)HX], which are not electron donors and inhibit various dehydrogenases. In bacteria, yeast (Saccharomyces cerevisiae), and mammals, these hydrates are repaired by the tandem action of an ADP-or ATP-dependent dehydratase that converts (S)-NAD(P)HX to NAD(P)H and an epimerase that facilitates interconversion of the R and S forms. Plants have homologs of both enzymes, the epimerase homolog being fused to the vitamin B 6 salvage enzyme pyridoxine 59-phosphate oxidase. Recombinant maize (Zea mays) and Arabidopsis (Arabidopsis thaliana) NAD(P)HX dehydratases (GRMZM5G840928, At5g19150) were able to reconvert (S)-NAD(P)HX to NAD(P)H in an ATP-dependent manner. Recombinant maize and Arabidopsis epimerases (GRMZM2G061988, At5g49970) rapidly interconverted (R)-and (S)-NAD(P)HX, as did a truncated form of the Arabidopsis epimerase lacking the pyridoxine 59-phosphate oxidase domain. All plant NAD(P)HX dehydratase and epimerase sequences examined had predicted organellar targeting peptides with a potential second start codon whose use would eliminate the targeting peptide. In vitro transcription/translation assays confirmed that both start sites were used. Dual import assays with purified pea (Pisum sativum) chloroplasts and mitochondria, and subcellular localization of GFP fusion constructs in tobacco (Nicotiana tabacum) suspension cells, indicated mitochondrial, plastidial, and cytosolic localization of the Arabidopsis epimerase and dehydratase. Ablation of the Arabidopsis dehydratase gene raised seedling levels of all NADHX forms by 20-to 40-fold, and levels of one NADPHX form by 10-to 30-fold. We conclude that plants have a canonical two-enzyme NAD(P)HX repair system that is directed to three subcellular compartments via the use of alternative translation start sites.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.