A major purpose of exploratory metabolic profiling is for the identification of molecular species that are statistically associated with specific biological or medical outcomes; unfortunately, the structure elucidation process of unknowns is often a major bottleneck in this process. We present here new holistic strategies that combine different statistical spectroscopic and analytical techniques to improve and simplify the process of metabolite identification. We exemplify these strategies using study data collected as part of a dietary intervention to improve health and which elicits a relatively subtle suite of changes from complex molecular profiles. We identify three new dietary biomarkers related to the consumption of peas (N-methyl nicotinic acid), apples (rhamnitol), and onions (N-acetyl-S-(1Z)-propenyl-cysteine-sulfoxide) that can be used to enhance dietary assessment and assess adherence to diet. As part of the strategy, we introduce a new probabilistic statistical spectroscopy tool, RED-STORM (Resolution EnhanceD SubseT Optimization by Reference Matching), that uses 2D J-resolved 1H NMR spectra for enhanced information recovery using the Bayesian paradigm to extract a subset of spectra with similar spectral signatures to a reference. RED-STORM provided new information for subsequent experiments (e.g., 2D-NMR spectroscopy, solid-phase extraction, liquid chromatography prefaced mass spectrometry) used to ultimately identify an unknown compound. In summary, we illustrate the benefit of acquiring J-resolved experiments alongside conventional 1D 1H NMR as part of routine metabolic profiling in large data sets and show that application of complementary statistical and analytical techniques for the identification of unknown metabolites can be used to save valuable time and resources.
Caenorhabditis elegans and its cognate bacterial diet comprise a reliable, widespread model to study diet and microbiota effects on host physiology. Nonetheless, how diet influences the rate at which neurons die remains largely unknown. A number of models have been used in C. elegans as surrogates for neurodegeneration. One of these is a C. elegans strain expressing a neurotoxic allele of the mechanosensory abnormality protein 4 (MEC-4d) degenerin/epithelial Na + (DEG/ENaC) channel, which causes the progressive degeneration of the touch receptor neurons (TRNs). Using this model, our study evaluated the effect of various dietary bacteria on neurodegeneration dynamics. Although degeneration of TRNs was steady and completed at adulthood in the strain routinely used for C. elegans maintenance (Escherichia coli OP50), it was significantly reduced in environmental and other laboratory bacterial strains. Strikingly, neuroprotection reached more than 40% in the E. coli HT115 strain. HT115 protection was long lasting well into old age of animals and was not restricted to the TRNs. Small amounts of HT115 on OP50 bacteria as well as UV-killed HT115 were still sufficient to produce neuroprotection. Early growth of worms in HT115 protected neurons from degeneration during later growth in OP50. HT115 diet promoted the nuclear translocation of DAF-16 (ortholog of the FOXO family of transcription factors), a phenomenon previously reported to underlie neuroprotection caused by downregulation of the insulin receptor in this system. Moreover, a daf-16 loss-of-function mutation abolishes HT115-driven neuroprotection. Comparative genomics, transcriptomics, and metabolomics approaches pinpointed the neurotransmitter γ-aminobutyric acid (GABA) and lactate as metabolites differentially produced between E. coli HT115 and OP50. HT115 mutant lacking glutamate decarboxylase enzyme genes (gad), which catalyze the conversion of GABA from glutamate, lost the ability to produce GABA and also to stop neurodegeneration. Moreover, in situ GABA supplementation or heterologous expression of glutamate decarboxylase in E. coli OP50 conferred neuroprotective activity to this strain. Specific C. elegans GABA transporters and receptors were required for full PLOS BIOLOGY PLOS Biology | https://doi.
The identification of metabolites in complex biological matrices is a challenging task in 1D 1 H NMR based metabolomic studies. Statistical TOtal Correlation Spectroscopy (STOCSY) has emerged for aiding the structural elucidation by revealing the peaks that present high correlation to a driver peak of interest (which would likely belong to the same molecule). However, in these studies the signals from metabolites are normally present as a mixture of overlapping resonances, limiting the performance of STOCSY. 2D 1 H homonuclear J-resolved spectra (JRES), in its usual tilted and symmetrized processed form, were projected and STOCSY was applied on these 1D projections (p-JRES-STOCSY) as an alternative to avoid the overlap issue, but this approach suffers in cases where the signals are very close. In addition, STOCSY was applied to JRES spectra (also tilted) to identify correlated multiplets, although the overlap issue in itself was not addressed directly and the subsequent search in databases is complicated in cases of higher order coupling. With these limitations in mind, in the present work we propose a new methodology based on the application of STOCSY on a set of nontilted JRES spectra, detecting peaks that would overlap in 1D spectra of the same sample set. COrrelation COmparison Analysis for Peak Overlap Detection (COCOA-POD) is able to reconstruct projected 1D STOCSY traces that result in more suitable database queries, as all peaks are summed at their f2 resonances instead of the resonance corresponding to the multiplet center in the tilted JRES (the peak dispersion and resolution enhancement gained are not sacrificed by the projection). Besides improving database queries with better peak lists obtained from the projections of the 2D STOCSY analysis, the overlap region is examined and the multiplet itself is analyzed from the correlation trace at 45° to obtain a cleaner multiplet profile, free from contributions from uncorrelated neighboring peaks.
HYL1 is a double-stranded RNA binding protein involved in microRNA processing in plants. HYL1 enhances the efficiency and precision of the RNase III protein DCL1 and participates in microRNA strand selection. In this work, we dissect the contributions of the domains of HYL1 to the binding of RNA targets. We found that the first domain is the main contributor to RNA binding. Mapping of the interaction regions by nuclear magnetic resonance on the structure of HYL1 RNA-binding domains showed that the difference in binding capabilities can be traced to sequence divergence in β2-β3 loop. The possible role of each domain is discussed in light of previous experimental data.
Dicer-like ribonuclease III enzymes are involved in different paths related to RNA silencing in plants. Little is known about the structural aspects of these processes. Here we present a structural characterization of the second double-stranded RNA binding domain (dsRBD) of DCL1, which is presumed to participate in pri-micro-RNA recognition and subcellular localization of this protein. We determined the solution structure and found that it has a canonical fold but bears some variation with respect to other homologous domains. We also found that this domain binds both double-stranded RNA and double-stranded DNA, in contrast to most dsRBDs. Our characterization shows that this domain likely has functions other than substrate recognition and binding.
DCL1 is the ribonuclease that carries out miRNA biogenesis in plants. The enzyme has two tandem double stranded RNA binding domains (dsRBDs) in its C-terminus. Here we show that the first of these domains binds precursor RNA fragments when isolated and cooperates with the second domain in the recognition of substrate RNA. Remarkably, despite showing RNA binding activity, this domain is intrinsically disordered. We found that it acquires a folded conformation when bound to its substrate, being the first report of a complete dsRBD folding upon binding. The free unfolded form shows tendency to adopt folded conformations, and goes through an unfolded bound state prior to the folding event. The significance of these results is discussed by comparison with the behavior of other dsRBDs.
NMR-based metabolomics requires proper identification of metabolites to draw conclusions from the system under study. Normally, multivariate data analysis is performed using 1D 1H NMR spectra, and identification of peaks (and then compounds) relevant to the classification is accomplished using database queries as a first step. 1D 1H NMR spectra of complex mixtures often suffer from peak overlap. To overcome this issue, several studies employed the projections of the (tilted and symmetrized) 2D 1H J-resolved (JRES) spectra, p-JRES, which are similar to 1D 1H decoupled spectra. Nonetheless, there are no public databases available that allow searching for chemical shift spectral data for multiplets. We present the Chemical Shift Multiplet Database (CSMDB), built utilizing JRES spectra obtained from the Birmingham Metabolite Library. The CSMDB provides scoring accounting for both matched and unmatched peaks from a query list and the database hits. This input list is generated from a projection of a 2D statistical correlation analysis on the JRES spectra, p-(JRES-STOCSY), being able to compare the multiplets for the matched peaks, in essence, the f1 traces from the JRES-STOCSY spectrum and from the database hit. The inspection of the unmatched peaks for the database hit allows the retrieval of peaks in the query list that have a decreased correlation coefficient due to low intensities. The CSMDB is coupled to “ConQuer ABC”, which permits the assessment of biological correlation by means of consecutive queries with the unmatched peaks in the first and subsequent queries.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.