Label-free quantification (LFQ) and isobaric labeling quantification (ILQ) are among the most popular protein quantification workflows in discovery proteomics. Here, we compared the TMT SPS/MS3 10-plex workflow to a label free single shot data-independent acquisition (DIA) workflow on a controlled sample set. The sample set consisted of ten samples derived from 10 biological replicates of mouse cerebelli spiked with the UPS2 protein standard in five different concentrations. For a fair comparison, we matched the instrument time for the two workflows. The LC–MS data were acquired at two facilities to assess interlaboratory reproducibility. Both methods resulted in a high proteome coverage (>5000 proteins) with low missing values on protein level (<2%). The TMT workflow led to 15–20% more identified proteins and a slightly better quantitative precision, whereas the quantitative accuracy was better for the DIA method. The quantitative performance was benchmarked by the number of true positives (UPS2 proteins) within the top 100 candidates. TMT and DIA showed a similar performance. The quantitative performance of the DIA data stayed in a similar range when searching the spectra against a fasta database directly, instead of using a project-specific library. Our experiments also demonstrated that both workflows are readily transferrable between facilities.
Tandem mass tag (TMT) is a multiplexing technology widely-used in proteomic research. It enables relative quantification of proteins from multiple biological samples in a single mass spectrometry run with high efficiency and high throughput. However, experiments often require more biological replicates or conditions than can be accommodated by a single run, and involve multiple TMT mixtures and multiple runs. Such larger-scale experiments combine sources of biological and technical variation in patterns that are complex, unique to TMT-based workflows, and challenging for the downstream statistical analysis. These patterns cannot be adequately characterized by statistical methods designed for other technologies, such as label-free proteomics or transcriptomics. This manuscript proposes a general statistical approach for relative protein quantification in mass spectrometry-based experiments with TMT labeling. It is applicable to experiments with multiple conditions, multiple biological replicate runs and multiple technical replicate runs, and unbalanced designs. It is based on a flexible family of linear mixed-effects models that handle complex patterns of technical artifacts and missing values. The approach is implemented in MSstatsTMT, a freely available open-source R/Bioconductor package compatible with data processing tools such as Proteome Discoverer, MaxQuant, OpenMS and SpectroMine. Evaluation on a controlled mixture, simulated datasets, and three biological investigations with diverse designs demonstrated that MSstatsTMT balanced the sensitivity and the specificity of detecting differentially abundant proteins, in particular in large-scale experiments with multiple biological mixtures.
Assembling peptides identified from tandem mass spectra into a list of proteins, referred to as protein inference, is a critical step in proteomics research. Due to the existence of degenerate peptides and 'one-hit wonders', it is very difficult to determine which proteins are present in the sample. In this paper, we review existing protein inference methods and classify them according to the source of peptide identifications and the principle of algorithms. It is hoped that the readers will gain a good understanding of the current development in this field after reading this review and come up with new protein inference algorithms.
MassIVE.quant is a repository infrastructure and data resource for reproducible quantitative mass spectrometry-based proteomics, which is compatible with all mass spectrometry data acquisition types and computational analysis tools. A branch structure enables MassIVE.quant to systematically store raw experimental data, metadata of the experimental design, scripts of the quantitative analysis workflow, intermediate input and output files, as well as alternative reanalyses of the same dataset.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.