BackgroundMass spectrometry (MS) coupled with online separation methods is commonly applied for differential and quantitative profiling of biological samples in metabolomic as well as proteomic research. Such approaches are used for systems biology, functional genomics, and biomarker discovery, among others. An ongoing challenge of these molecular profiling approaches, however, is the development of better data processing methods. Here we introduce a new generation of a popular open-source data processing toolbox, MZmine 2.ResultsA key concept of the MZmine 2 software design is the strict separation of core functionality and data processing modules, with emphasis on easy usability and support for high-resolution spectra processing. Data processing modules take advantage of embedded visualization tools, allowing for immediate previews of parameter settings. Newly introduced functionality includes the identification of peaks using online databases, MSn data support, improved isotope pattern support, scatter plot visualization, and a new method for peak list alignment based on the random sample consensus (RANSAC) algorithm. The performance of the RANSAC alignment was evaluated using synthetic datasets as well as actual experimental data, and the results were compared to those obtained using other alignment algorithms.ConclusionsMZmine 2 is freely available under a GNU GPL license and can be obtained from the project website at: http://mzmine.sourceforge.net/. The current version of MZmine 2 is suitable for processing large batches of data and has been applied to both targeted and non-targeted metabolomic analyses.
Molecular networking has become a key method to visualize and annotate the chemical space in non-targeted mass spectrometry data. We present Feature-Based Molecular Networking (FBMN) as an analysis method in the Global Natural Products Social Molecular Networking (GNPS) infrastructure that builds on chromatographic feature detection and alignment tools. The FBMN method brings quantitative analyses, isomeric resolution, including from ion-mobility spectrometry, into molecular networks.
Background The Chemistry Development Kit (CDK) is a widely used open source cheminformatics toolkit, providing data structures to represent chemical concepts along with methods to manipulate such structures and perform computations on them. The library implements a wide variety of cheminformatics algorithms ranging from chemical structure canonicalization to molecular descriptor calculations and pharmacophore perception. It is used in drug discovery, metabolomics, and toxicology. Over the last 10 years, the code base has grown significantly, however, resulting in many complex interdependencies among components and poor performance of many algorithms.Results We report improvements to the CDK v2.0 since the v1.2 release series, specifically addressing the increased functional complexity and poor performance. We first summarize the addition of new functionality, such atom typing and molecular formula handling, and improvement to existing functionality that has led to significantly better performance for substructure searching, molecular fingerprints, and rendering of molecules. Second, we outline how the CDK has evolved with respect to quality control and the approaches we have adopted to ensure stability, including a code review mechanism.ConclusionsThis paper highlights our continued efforts to provide a community driven, open source cheminformatics library, and shows that such collaborative projects can thrive over extended periods of time, resulting in a high-quality and performant library. By taking advantage of community support and contributions, we show that an open source cheminformatics project can act as a peer reviewed publishing platform for scientific computing software.Graphical abstractCDK 2.0 provides new features and improved performance Electronic supplementary materialThe online version of this article (doi:10.1186/s13321-017-0220-4) contains supplementary material, which is available to authorized users.
1Molecular networking has become a key method used to visualize and annotate the chemical space in 2 non-targeted mass spectrometry-based experiments. However, distinguishing isomeric compounds and
Salidroside is a bioactive tyrosine-derived phenolic natural product found in medicinal plants under the Rhodiola genus. In addition to their anti-fatigue and anti-anoxia roles in traditional medicine, Rhodiola total extract and salidroside have also displayed medicinal properties as anti-cardiovascular diseases and anti-cancer agents. The resulting surge in global demand of Rhodiola plants and salidroside has driven some species close to extinction. Here, we report the full elucidation of the Rhodiola salidroside biosynthetic pathway utilizing the first comprehensive transcriptomics and metabolomics datasets for Rhodiola rosea. Unlike the previously proposed pathway involving separate decarboxylation and deamination enzymatic steps from tyrosine to the key intermediate 4-hydroxyphenylacetaldehyde (4-HPAA), Rhodiola contains a pyridoxal phosphate-dependent 4-HPAA synthase that directly converts tyrosine to 4-HPAA. We further identified genes encoding the subsequent 4-HPAA reductase and tyrosol:UDP-glucose 8-O-glucosyltransferase, respectively, to complete salidroside biosynthesis in Rhodiola. We show that heterologous production of salidroside can be achieved in the yeast Saccharomyces cerevisiae as well as the plant Nicotiana benthamiana through transgenic expression of Rhodiola salidroside biosynthetic genes. This study provides new tools for engineering sustainable production of salidroside in heterologous hosts.
Scutellaria baicalensis Georgi is important in Chinese traditional medicine where preparations of dried roots, ''Huang Qin,'' are used for liver and lung complaints and as complementary cancer treatments. We report a high-quality reference genome sequence for S. baicalensis where 93% of the 408.14-Mb genome has been assembled into nine pseudochromosomes with a super-N50 of 33.2 Mb. Comparison of this sequence with those of closely related species in the order Lamiales, Sesamum indicum and Salvia splendens, revealed that a specialized metabolic pathway for the synthesis of 4 0 -deoxyflavone bioactives evolved in the genus Scutellaria. We found that the gene encoding a specific cinnamate coenzyme A ligase likely obtained its new function following recent mutations, and that four genes encoding enzymes in the 4 0 -deoxyflavone pathway are present as tandem repeats in the genome of S. baicalensis. Further analyses revealed that gene duplications, segmental duplication, gene amplification, and point mutations coupled to gene neo-and subfunctionalizations were involved in the evolution of 4 0 -deoxyflavone synthesis in the genus Scutellaria. Our study not only provides significant insight into the evolution of specific flavone biosynthetic pathways in the mint family, Lamiaceae, but also will facilitate the development of tools for enhancing bioactive productivity by metabolic engineering in microbes or by molecular breeding in plants. The reference genome of S. baicalensis is also useful for improving the genome assemblies for other members of the mint family and offers an important foundation for decoding the synthetic pathways of bioactive compounds in medicinal plants.
Salicylic acid (SA) is an important phytohormone mediating both local and systemic defense responses in plants. Despite over half a century of research, how plants biosynthesize SA remains unresolved. In Arabidopsis, a major part of SA is derived from isochorismate, a key intermediate produced by the isochorismate synthase, which is reminiscent of SA biosynthesis in bacteria. Whereas bacteria employ an isochorismate pyruvate lyase (IPL) that catalyzes the turnover of isochorismate to pyruvate and SA, plants do not contain an IPL ortholog and generate SA from isochorismate through an unknown mechanism. Combining genetic and biochemical approaches, we delineated the SA biosynthetic pathway downstream of isochorismate in Arabidopsis. We found that PBS3, a GH3 acyl adenylase-family enzyme important for SA accumulation, catalyzes ATP-and Mg 2+ -dependent conjugation of L-glutamate primarily to the 8-carboxyl of isochorismate and yields the key SA biosynthetic intermediate, isochorismoyl-glutamate A. Moreover, we discovered that EPS1, a BAHD acyltransferase-family protein with a previously implicated role in SA accumulation upon pathogen attack, harbors a noncanonical active site and an unprecedented isochorismoyl-glutamate A pyruvoylglutamate lyase activity that produces SA from the isochorismoyl-glutamate A substrate. Together, PBS3 and EPS1 form a two-step metabolic pathway to produce SA from isochorismate in Arabidopsis, which is distinct from how SA is biosynthesized in bacteria. This study closes a major knowledge gap in plant SA metabolism and would help develop new strategies for engineering disease resistance in crop plants.
Glucose as a source of energy is centrally important to our understanding of life. We investigated the cell division–quiescence behavior of the fission yeast Schizosaccharomyces pombe under a wide range of glucose concentrations (0–111 mm). The mode of S. pombe cell division under a microfluidic perfusion system was surprisingly normal under highly diluted glucose concentrations (5.6 mm, 1/20 of the standard medium, within human blood sugar levels). Division became stochastic, accompanied by a curious division-timing inheritance, in 2.2–4.4 mm glucose. A critical transition from division to quiescence occurred within a narrow range of concentrations (2.2–1.7 mm). Under starvation (1.1 mm) conditions, cells were mostly quiescent and only a small population of cells divided. Under fasting (0 mm) conditions, division was immediately arrested with a short chronological lifespan (16 h). When cells were first glucose starved prior to fasting, they possessed a substantially extended lifespan (∼14 days). We employed a quantitative metabolomic approach for S. pombe cell extracts, and identified specific metabolites (e.g. biotin, trehalose, ergothioneine, S-adenosyl methionine and CDP-choline), which increased or decreased at different glucose concentrations, whereas nucleotide triphosphates, such as ATP, maintained high concentrations even under starvation. Under starvation, the level of S-adenosyl methionine increased sharply, accompanied by an increase in methylated amino acids and nucleotides. Under fasting, cells rapidly lost antioxidant and energy compounds, such as glutathione and ATP, but, in fasting cells after starvation, these and other metabolites ensuring longevity remained abundant. Glucose-starved cells became resistant to 40 mm H2O2 as a result of the accumulation of antioxidant compounds.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.