AlphaPeptDeep: A modular deep learning framework to predict peptide properties for proteomics

Zeng, Wen‐Feng; Zhou, Xie-Xuan; Willems, Sander; Ammar, Constantin; Wahle, Maria; Bludau, Isabell; Voytik, Eugenia; Strauss, Maximilian T.; Mann, Matthias

doi:10.1101/2022.07.14.499992

Cited by 5 publications

(8 citation statements)

References 57 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Support for more DL models may also benefit HLA rescoring, where models such as Prosit have been trained on nonspecific peptides [47]. Finally, transfer learning and fine-tuning implemented in pDeep3 [40] and AlphaPeptDeep [41] may help to analyze MS/MS spectra acquired using different fragmentation mechanisms, or when identifying peptides containing rare PTMs.…”

Section: Discussionmentioning

confidence: 99%

“…More recently, however, a wave of deep learning (DL) models have been trained to predict the physicochemical properties of peptides and MS/MS spectra [37][38][39][40][41][42]. By training on millions of available peptides, these models can learn general rules to make accurate predictions for new peptides, assuming they are not vastly different from those on which the models were trained.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

MSBooster: Improving Peptide Identification Rates using Deep Learning-Based Features

Yang

Teo

et al. 2022

Preprint

View full text Add to dashboard Cite

Peptide identification in liquid chromatography-tandem mass spectrometry (LC-MS/MS) experiments relies on computational algorithms for matching acquired MS/MS spectra against sequences of candidate peptides using database search tools, such as MSFragger. Here, we present a new tool, MSBooster, for rescoring peptide-to-spectrum matches using additional features incorporating deep learning-based predictions of peptide properties, such as LC retention time, ion mobility, and MS/MS spectra. We demonstrate the utility of MSBooster, in tandem with MSFragger and Percolator, in several different workflows, including nonspecific searches (immunopeptidomics), direct identification of peptides from data independent acquisition data, single-cell proteomics, and data generated on an ion mobility separation-enabled timsTOF MS platform. MSBooster is fast, robust, and fully integrated into the widely used FragPipe computational platform.

show abstract

Section: Discussionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

MSBooster: Improving Peptide Identification Rates using Deep Learning-Based Features

Yang

Teo

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

“…Three different strategies are commonly used: experimental libraries, typically acquired by DDA; pseudospectra-based libraries extracted by directDIA as introduced by DIA-Umpire 16 and implemented in Spectronaut; and libraries in which fragment intensities are predicted by deep learning 14,17 . In connection with the latter approach, we recently introduced a deep learning based framework called AlphaPeptDeep, which predicts spectral libraries tailored for different MS platforms, only based on a database file of the proteome in FASTA format or just a peptide list as input 18 . It contains the PeptDeep-HLA model which makes use of the inherent similarity of immunopeptides present within one person based on their HLA type.…”

Section: Resultsmentioning

confidence: 99%

“…As part of our workflow, we have also implemented Data Independent Acquisition (DIA) to expand the depth of the immunopeptidomic data. To tackle the challenge of creating a suitable search space for immunopeptidomics, we employed personalized HLA peptide libraries 18 . This considerably reduces the number of potential 9mers to 12mers in a human FASTA to be searched, increasing the number of significant identifications.…”

Section: Discussion and Outlookmentioning

confidence: 99%

The potential of plasma HLA peptides beyond neoepitopes

Wahle,

Thielert,

Zwiebel

et al. 2023

Preprint

Self Cite

View full text Add to dashboard Cite

Distinction of non-self from self is the major task of the immune system. Immunopeptidomics studies the peptide repertoire presented by the human leukocyte antigen (HLA) protein, usually on tissues. However, HLA peptides are also bound to plasma soluble HLA (sHLA), but little is known about their origin and potential for biomarker discovery in this readily available biofluid. Currently, immunopeptidomics is hampered by complex workflows and limited sensitivity, generally requiring several mL of plasma for the detection of hundreds of HLA peptides. Here, we take advantage of recent improvements in the throughput and sensitivity of mass spectrometry (MS)-based proteomics to develop a highly-sensitive, automated and economical workflow for HLA peptide analysis, termed Immunopeptidomics by Biotinylated Antibodies and Streptavidin (IMBAS). IMBAS-MS quantifies more than 5,000 HLA class I peptides from only 200 uL of plasma, in just 30 minutes. Our technology revealed that the plasma immunopeptidome of healthy donors is remarkably stable throughout a year and strongly correlated between individuals with overlapping HLA types. Immunopeptides originating from diverse tissues, including the brain, are proportionately represented. We conclude that sHLAs are a promising avenue for immunology and precision oncology.

show abstract

“…Frequently used software tools such as DIA-NN (2,3) or Spectronaut (4) match peptides from a library into each of the DIA runs. These libraries were traditionally acquired experimentally with deep data dependent acquisition (DDA)-based measurements of the proteome of interest, but are now often generated directly from the DIA data (5) or in silico from the entire proteome using deep learning (6)(7)(8)(9)(10)(11). Given these advantages very deep and quantitatively accurate data sets can now routinely be generated by DIA.…”

Section: Introductionmentioning

confidence: 99%

Synchro-PASEF allows precursor-specific fragment ion extraction and interference removal in data-independent acquisition

Skowronek

Krohs

Lubeck

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

Data-independent acquisition (DIA) methods have become increasingly popular in mass spectrometry (MS)-based proteomics because they enable continuous acquisition of fragment spectra for all precursors simultaneously. However, these advantages come with the challenge of correctly reconstructing the precursor-fragment relationships in these highly convoluted spectra for reliable identification and quantification. Here we introduce a scan mode for the combination of trapped ion mobility spectrometry (TIMS) with parallel accumulation - serial fragmentation (PASEF) that seamlessly and continuously follows the natural shape of the ion cloud in ion mobility and peptide precursor mass dimensions. Termed synchro-PASEF, it increases the detected fragment ion current several-fold at sub-second cycle times. Consecutive quadrupole selection windows move synchronously through the mass and ion mobility range, defining precursor-quadrupole relationships. In this process, the quadrupole slices through the peptide precursors, which separates fragment ion signals of each precursor into adjacent synchro-PASEF scans. This precisely defines precursor - fragment relationships in ion mobility and mass dimensions and effectively deconvolutes the DIA fragment space. Importantly, the partitioned parts of the fragment ion transitions provide a further dimension of specificity via a lock and key mechanism. This is also advantageous for quantification, where signals from interfering precursors in the DIA selection window do not affect all partitions of the fragment ion, allowing to retain only the specific parts for quantification. Overall, we establish the defining features of synchro-PASEF and explore its potential for proteomic analyses.

show abstract

AlphaPeptDeep: A modular deep learning framework to predict peptide properties for proteomics

Cited by 5 publications

References 57 publications

MSBooster: Improving Peptide Identification Rates using Deep Learning-Based Features

MSBooster: Improving Peptide Identification Rates using Deep Learning-Based Features

The potential of plasma HLA peptides beyond neoepitopes

Synchro-PASEF allows precursor-specific fragment ion extraction and interference removal in data-independent acquisition

Contact Info

Product

Resources

About