Abstract:Liquid chromatography-mass spectrometry (LC-MS)-based proteomics studies of large sample cohorts can easily require from months to years to complete. Acquiring consistent, high-quality data in such large-scale studies is challenging because of normal variations in instrumentation performance over time, as well as artifacts introduced by the samples themselves, such as those because of collection, storage and processing. Existing quality control methods for proteomics data primarily focus on post-hoc analysis t… Show more
“…See Supplementary Table S1 for demographic, serology, OGTT, and other information/parameters. Samples from all subjects were divided into four equal aliquots that were individually subjected to proteomics, metabolomics, lipidomics, and transcriptomics (miRNAs) analyses (i.e., parallel quadra-omics) that were performed in collaboration with the Miami Integrative Metabolomics Research Center (lipidomics) [ 58 ], Pacific Northwest National Laboratories (proteomics) [ 59 ], Ocean Ridge Biosciences (transcriptomics) [ 60 ], and the Stedman Metabolomics Laboratory at Duke University Medical Center (metabolomics) [ 61 ].…”
Background: Biomarkers are crucial for detecting early type-1 diabetes (T1D) and preventing significant β-cell loss before the onset of clinical symptoms. Here, we present proof-of-concept studies to demonstrate the potential for identifying integrated biomarker signature(s) of T1D using parallel multi-omics. Methods: Blood from human subjects at high risk for T1D (and healthy controls; n = 4 + 4) was subjected to parallel unlabeled proteomics, metabolomics, lipidomics, and transcriptomics. The integrated dataset was analyzed using Ingenuity Pathway Analysis (IPA) software for disturbances in the at-risk subjects compared to controls. Results: The final quadra-omics dataset contained 2292 proteins, 328 miRNAs, 75 metabolites, and 41 lipids that were detected in all samples without exception. Disease/function enrichment analyses consistently indicated increased activation, proliferation, and migration of CD4 T-lymphocytes and macrophages. Integrated molecular network predictions highlighted central involvement and activation of NF-κB, TGF-β, VEGF, arachidonic acid, and arginase, and inhibition of miRNA Let-7a-5p. IPA-predicted candidate biomarkers were used to construct a putative integrated signature containing several miRNAs and metabolite/lipid features in the at-risk subjects. Conclusions: Preliminary parallel quadra-omics provided a comprehensive picture of disturbances in high-risk T1D subjects and highlighted the potential for identifying associated integrated biomarker signatures. With further development and validation in larger cohorts, parallel multi-omics could ultimately facilitate the classification of T1D progressors from non-progressors.
“…See Supplementary Table S1 for demographic, serology, OGTT, and other information/parameters. Samples from all subjects were divided into four equal aliquots that were individually subjected to proteomics, metabolomics, lipidomics, and transcriptomics (miRNAs) analyses (i.e., parallel quadra-omics) that were performed in collaboration with the Miami Integrative Metabolomics Research Center (lipidomics) [ 58 ], Pacific Northwest National Laboratories (proteomics) [ 59 ], Ocean Ridge Biosciences (transcriptomics) [ 60 ], and the Stedman Metabolomics Laboratory at Duke University Medical Center (metabolomics) [ 61 ].…”
Background: Biomarkers are crucial for detecting early type-1 diabetes (T1D) and preventing significant β-cell loss before the onset of clinical symptoms. Here, we present proof-of-concept studies to demonstrate the potential for identifying integrated biomarker signature(s) of T1D using parallel multi-omics. Methods: Blood from human subjects at high risk for T1D (and healthy controls; n = 4 + 4) was subjected to parallel unlabeled proteomics, metabolomics, lipidomics, and transcriptomics. The integrated dataset was analyzed using Ingenuity Pathway Analysis (IPA) software for disturbances in the at-risk subjects compared to controls. Results: The final quadra-omics dataset contained 2292 proteins, 328 miRNAs, 75 metabolites, and 41 lipids that were detected in all samples without exception. Disease/function enrichment analyses consistently indicated increased activation, proliferation, and migration of CD4 T-lymphocytes and macrophages. Integrated molecular network predictions highlighted central involvement and activation of NF-κB, TGF-β, VEGF, arachidonic acid, and arginase, and inhibition of miRNA Let-7a-5p. IPA-predicted candidate biomarkers were used to construct a putative integrated signature containing several miRNAs and metabolite/lipid features in the at-risk subjects. Conclusions: Preliminary parallel quadra-omics provided a comprehensive picture of disturbances in high-risk T1D subjects and highlighted the potential for identifying associated integrated biomarker signatures. With further development and validation in larger cohorts, parallel multi-omics could ultimately facilitate the classification of T1D progressors from non-progressors.
“…In every step of a data analysis procedure, it is advisable to identify outliers (Bittremieux, Meysman, Martens, Valkenborg, & Laukens, 2016;Kauffmann & Huber, 2010;Norton, Vaquero-Garcia, Lahens, Grant, & Barash, 2018;Stanfill et al, 2018), including samples or features with an abnormal number of missing values, samples that display substantially different distributions in their quantitative features, or samples that do not cluster with the rest of their group. Such cases need to be handled carefully; if the outlying nature of a sample cannot be corrected (through appropriate missing data imputation and data normalization, or after correction of mis-annotated samples), the offending sample might be better removed completely for the subsequent data analysis.…”
Section: Data Cleaning and Normalization For Molecular Networkmentioning
Network Medicine applies network science approaches to investigate disease pathogenesis. Many different analytical methods have been used to infer relevant molecular networks, including protein-protein interaction networks, correlation-based networks, gene regulatory networks, and Bayesian networks. Network Medicine applies these integrated approaches to Omics Big Data (including genetics, epigenetics, transcriptomics, metabolomics, and proteomics) using computational biology tools and, thereby, has the potential to provide improvements in the diagnosis, prognosis, and treatment of complex diseases. We discuss briefly the types of molecular data that are used in molecular network analyses, survey the analytical methods for inferring molecular networks, and review efforts to validate and visualize molecular networks. Successful applications of molecular network analysis have been reported in pulmonary arterial hypertension, coronary heart disease, diabetes mellitus, chronic lung diseases, and drug development. Important knowledge gaps in Network Medicine include incompleteness of the molecular interactome, challenges in identifying key genes within genetic association regions, and limited applications to human diseases.
“…For experiments that contain many samples, and where the time between data acquisition and data analysis is long, this QC/QA step should not be delayed; rather QC/QA should happen immediately upon data acquisition to give feedback as soon as possible to the instrument operator 28 . To fill this need in targeted proteomics studies, we created Q4SRM which can analyze the heavy labeled reference peptides in an LC-SRM-MS data file within one minute.…”
Targeted proteomics experiments based on selected reaction monitoring (SRM) have gained wide adoption in clinical biomarker, cellular modeling and numerous other biological experiments due to their highly accurate and reproducible quantification. The quantitative accuracy in targeted proteomics experiments is reliant on the stable-isotope, heavy-labeled peptide standards which are spiked into a sample and used as a reference when calculating the abundance of endogenous peptides. Therefore, the quality of measurement for these standards is a critical factor in determining whether data acquisition was successful. With improved MS instrumentation that enables the monitoring of hundreds of peptides in hundreds to thousands of samples, quality assessment is increasingly important and cannot be performed manually. We present Q4SRM, a software tool that rapidly checks the signal from all heavy labeled peptides and flags those that fail quality control metrics. Using four metrics, the tool detects problems both with individual SRM transitions and the collective group of transitions that monitor a single 2 peptide. The program's speed enables its use at the point of data acquisition and can be ideally run immediately upon the completion of an LC-SRM-MS analysis.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.