In metabolomics, the objective is to identify differences in metabolite profiles between samples. A widely used tool in metabolomics investigations is gas chromatography-mass spectrometry (GC/MS). More than 400 compounds can be detected in a single analysis, if overlapping GC/MS peaks are deconvoluted. However, the deconvolution process is time-consuming and difficult to automate, and additional processing is needed in order to compare samples. Therefore, there is a need to improve and automate the data processing strategy for data generated in GC/MS-based metabolomics; if not, the processing step will be a major bottleneck for high-throughput analyses. Here we describe a new semiautomated strategy using a hierarchical multivariate curve resolution approach that processes all samples simultaneously. The presented strategy generates (after appropriate treatment, e.g., multivariate analysis) tables of all the detected metabolites that differ in relative concentrations between samples. The processing of 70 samples took similar time to that of the GC/TOFMS analyses of the samples. The strategy has been validated using two different sets of samples: a complex mixture of standard compounds and Arabidopsis samples.
In metabolomics, the purpose is to identify and quantify all the metabolites in a biological system. Combined gas chromatography and mass spectrometry (GC/MS) is one of the most commonly used techniques in metabolomics together with 1H NMR, and it has been shown that more than 300 compounds can be distinguished with GC/MS after deconvolution of overlapping peaks. To avoid having to deconvolute all analyzed samples prior to multivariate analysis of the data, we have developed a strategy for rapid comparison of nonprocessed MS data files. The method includes baseline correction, alignment, time window determinations, alternating regression, PLS-DA, and identification of retention time windows in the chromatograms that explain the differences between the samples. Use of alternating regression also gives interpretable loadings, which retain the information provided by m/z values that vary between the samples in each retention time window. The method has been applied to plant extracts derived from leaves of different developmental stages and plants subjected to small changes in day length. The data show that the new method can detect differences between the samples and that it gives results comparable to those obtained when deconvolution is applied prior to the multivariate analysis. We suggest that this method can be used for rapid comparison of large sets of GC/MS data, thereby applying time-consuming deconvolution only to parts of the chromatograms that contribute to explain the differences between the samples.
Analysis of the entire set of low molecular weight compounds (LMC), the metabolome, could provide deeper insights into mechanisms of disease and novel markers for diagnosis. In the investigation, we developed an extraction and derivatization protocol, using experimental design theory (design of experiment), for analyzing the human blood plasma metabolome by GC/MS. The protocol was optimized by evaluating the data for more than 500 resolved peaks using multivariate statistical tools including principal component analysis and partial least-squares projections to latent structures (PLS). The performance of five organic solvents (methanol, ethanol, acetonitrile, acetone, chloroform), singly and in combination, was investigated to optimize the LMC extraction. PLS analysis demonstrated that methanol extraction was particularly efficient and highly reproducible. The extraction and derivatization conditions were also optimized. Quantitative data for 32 endogenous compounds showed good precision and linearity. In addition, the determined amounts of eight selected compounds agreed well with analyses by independent methods in accredited laboratories, and most of the compounds could be detected at absolute levels of approximately 0.1 pmol injected, corresponding to plasma concentrations between 0.1 and 1 microM. The results suggest that the method could be usefully integrated into metabolomic studies for various purposes, e.g., for identifying biological markers related to diseases.
Amyotrophic lateral sclerosis (ALS) and Parkinson's disease (PD) are protein-aggregation diseases that lack clear molecular etiologies. Biomarkers could aid in diagnosis, prognosis, planning of care, drug target identification and stratification of patients into clinical trials. We sought to characterize shared and unique metabolite perturbations between ALS and PD and matched controls selected from patients with other diagnoses, including differential diagnoses to ALS or PD that visited our clinic for a lumbar puncture. Cerebrospinal fluid (CSF) and plasma from rigorously age-, sex- and sampling-date matched patients were analyzed on multiple platforms using gas chromatography (GC) and liquid chromatography (LC)-mass spectrometry (MS). We applied constrained randomization of run orders and orthogonal partial least squares projection to latent structure-effect projections (OPLS-EP) to capitalize upon the study design. The combined platforms identified 144 CSF and 196 plasma metabolites with diverse molecular properties. Creatine was found to be increased and creatinine decreased in CSF of ALS patients compared to matched controls. Glucose was increased in CSF of ALS patients and α-hydroxybutyrate was increased in CSF and plasma of ALS patients compared to matched controls. Leucine, isoleucine and ketoleucine were increased in CSF of both ALS and PD. Together, these studies, in conjunction with earlier studies, suggest alterations in energy utilization pathways and have identified and further validated perturbed metabolites to be used in panels of biomarkers for the diagnosis of ALS and PD.
BackgroundMetabolites are not only the catalytic products of enzymatic reactions but also the active regulators or the ultimate phenotype of metabolic homeostasis in highly complex cellular processes. The modes of regulation at the metabolome level can be revealed by metabolic networks. We investigated the metabolic network between wild-type and 2 mutant (methionine-over accumulation 1 [mto1] and transparent testa4 [tt4]) plants regarding the alteration of metabolite accumulation in Arabidopsis thaliana.ResultsIn the GC-TOF/MS analysis, we acquired quantitative information regarding over 170 metabolites, which has been analyzed by a novel score (ZMC, z-score of metabolite correlation) describing a characteristic metabolite in terms of correlation. Although the 2 mutants revealed no apparent morphological abnormalities, the overall correlation values in mto1 were much lower than those of the wild-type and tt4 plants, indicating the loss of overall network stability due to the uncontrolled accumulation of methionine. In the tt4 mutant, a new correlation between malate and sinapate was observed although the levels of malate, sinapate, and sinapoylmalate remain unchanged, suggesting an adaptive reconfiguration of the network. Gene-expression correlations presumably responsible for these metabolic networks were determined using the metabolite correlations as clues.ConclusionTwo Arabidopsis mutants, mto1 and tt4, exhibited the following changes in entire metabolome networks: the overall loss of metabolic stability (mto1) or the generation of a metabolic network of a backup pathway for the lost physiological functions (tt4). The expansion of metabolite correlation to gene-expression correlation provides detailed insights into the systemic understanding of the plant cellular process regarding metabolome and transcriptome.
A GGGGCC-repeat expansion in C9orf72 is the most common genetic cause of amyotrophic lateral sclerosis (ALS) and frontotemporal dementia (FTD) among Caucasians. However, little is known about the variability of the GGGGCC expansion in different tissues and whether this correlates with the observed phenotype. Here, we used Southern blotting to estimate the size of hexanucleotide expansions in C9orf72 in neural and non-neural tissues from 18 autopsied ALS and FTD patients with repeat expansion in blood. Digitalization of the Southern blot images allowed comparison of repeat number, smear distribution and expansion band intensity between tissues and between patients. We found marked intra-individual variation of repeat number between tissues, whereas there was less variation within each tissue group. In two patients, the size variation between tissues was extreme, with repeat numbers below 100 in all studied non-neural tissues, whereas expansions in neural tissues were 20-40 times greater and in the same size range observed in neural tissues of the other 16 patients. The expansion pattern in different tissues could not distinguish between diagnostic groups and no correlation was found between expansion size in frontal lobe and occurrence of cognitive impairment. In ALS patients, a less number of repeats in the cerebellum and parietal lobe correlated with earlier age of onset and a larger number of repeats in the parietal lobe correlated with a more rapid progression. In 43 other individuals without repeat expansion in blood, we find that repeat sizes up to 15 are stable, as no size variation between blood, brain and spinal cord was found.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.