Analysis of a single analyte by mass spectrometry can result in the detection of more than 100 degenerate peaks. These degenerate peaks complicate spectral interpretation and are challenging to annotate. In mass spectrometry-based metabolomics, this degeneracy leads to inflated false discovery rates, data sets containing an order of magnitude more features than analytes, and an inefficient use of resources during data analysis. Although software has been introduced to annotate spectral degeneracy, current approaches are unable to represent several important classes of peak relationships. These include heterodimers and higher complex adducts, distal fragments, relationships between peaks in different polarities, and complex adducts between features and background peaks. Here we outline sources of peak degeneracy in mass spectra that are not annotated by current approaches and introduce a software package called mz.unity to detect these relationships in accurate mass data. Using mz.unity, we find that data sets contain many more complex relationships than we anticipated. Examples include the adduct of glutamate and nicotinamide adenine dinucleotide (NAD), fragments of NAD detected in the same or opposite polarities, and the adduct of glutamate and a background peak. Further, the complex relationships we identify show that several assumptions commonly made when interpreting mass spectral degeneracy do not hold in general. These contributions provide new tools and insight to aid in the annotation of complex spectral relationships and provide a foundation for improved data set identification. Mz.unity is an R package and is freely available at https://github.com/nathaniel-mahieu/mz.unity as well as our laboratory Web site http://pattilab.wustl.edu/software/ .
Motivation: Current informatic techniques for processing raw chromatography/mass spectrometry data break down under several common, non-ideal conditions. Importantly, hydrophilic liquid interaction chromatography (a key separation technology for metabolomics) produces data which are especially challenging to process. We identify three critical points of failure in current informatic workflows: compound specific drift, integration region variance, and naive missing value imputation. We implement the Warpgroup algorithm to address these challenges. Results: Warpgroup adds peak subregion detection, consensus integration bound detection, and intelligent missing value imputation steps to the conventional informatic workflow. When compared with the conventional workflow, Warpgroup made major improvements to the processed data. The coefficient of variation for peaks detected in replicate injections of a complex Escherichia Coli extract were halved (a reduction of 19%). Integration regions across samples were much more robust. Additionally, many signals lost by the conventional workflow were 'rescued' by the Warpgroup refinement, thereby resulting in greater analyte coverage in the processed data. Availability and implementation: Warpgroup is an open source R package available on GitHub at github.com/nathaniel-mahieu/warpgroup. The package includes example data and XCMS compatibility wrappers for ease of use.
Existing hydrophilic interaction liquid chromatography (HILIC) methods, considered individually, each exhibit poor chromatographic performance for a substantial fraction of polar metabolites. In addition to limiting metabolome coverage, such deficiencies also complicate automated data processing. Here we show that some of these analytical challenges can be addressed for the ZIC-pHILIC, a zwitterionic stationary phase commonly used in metabolomics, with the addition of trace levels of phosphate. Specifically, micromolar phosphate extended metabolome coverage by hundreds of credentialed features, improved peak shapes, and reduced peak-detection errors during informatic processing. Although the addition of high levels of phosphate (millimolar) as a HILIC mobile phase buffer has been explored previously, such concentrations interfere with mass spectrometric (MS) detection. We show that using phosphate as a trace additive at micromolar concentrations improves analysis by electrospray MS, increasing signal for a diverse set of polar standards. Given the small amount of phosphate needed, comparable chromatographic improvements were also achieved by direct addition of phosphate to the sample during reconstitution. Our results suggest that defects in ZIC-pHILIC performance are predominantly driven by electrostatic interactions, which can be modulated by phosphate. These findings constitute both a methodological improvement for untargeted metabolomics and an advance in our understanding of the mechanisms limiting HILIC coverage.
Although it is common in untargeted metabolomics to apply reversed-phase liquid chromatography (RPLC) and hydrophilic interaction liquid chromatography (HILIC) methods that have been systematically optimized for lipids and central carbon metabolites, here we show that these established protocols provide poor coverage of semipolar metabolites because of inadequate retention. Our objective was to develop an RPLC approach that improved detection of these metabolites without sacrificing lipid coverage. We initially evaluated columns recently released by Waters under the CORTECS line by analyzing 47 small-molecule standards that evenly span the nonpolar and semipolar ranges. An RPLC method commonly used in untargeted metabolomics was considered a benchmarking reference. We found that highly nonpolar and semipolar metabolites cannot be reliably profiled with any single method because of retention and solubility limitations of the injection solvent. Instead, we optimized a multiplexed approach using the CORTECS T3 column to analyze semipolar compounds and the CORTECS C column to analyze lipids. Strikingly, we determined that combining these methods allowed detection of 41 of the total 47 standards, whereas our reference RPLC method detected only 10 of the 47 standards. We then applied credentialing to compare method performance at the comprehensive scale. The tandem method showed more than a fivefold increase in credentialing coverage relative to our RPLC benchmark. Our results demonstrate that comprehensive coverage of metabolites amenable to reversed-phase separation necessitates two reconstitution solvents and chromatographic methods. Thus, we suggest complementing HILIC methods with a dual T3 and C RPLC approach to increase coverage of semipolar metabolites and lipids for untargeted metabolomics. Graphical abstract Analysis of semipolar and nonpolar metabolites necessitates two reversed-phase chromatography (RPLC) methods, which extend metabolome coverage more than fivefold for untargeted profiling. HILIC hydrophilic interaction liquid chromatography.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.