2020
DOI: 10.1101/2020.05.19.102285
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Deep learning the collisional cross sections of the peptide universe from a million training samples

Abstract: The size and shape of peptide ions in the gas phase are an under-explored dimension for mass spectrometry-based proteomics. To explore the nature and utility of the entire peptide collisional cross section (CCS) space, we measure more than a million data points from whole-proteome digests of five organisms with trapped ion mobility spectrometry (TIMS) and parallel accumulation -serial fragmentation (PASEF). The scale and precision (CV <1%) of our data is sufficient to train a deep recurrent neural network that… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

2
7
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
4
1

Relationship

2
3

Authors

Journals

citations
Cited by 6 publications
(9 citation statements)
references
References 73 publications
2
7
0
Order By: Relevance
“…3C). To achieve high data completeness between single-cell measurements, we next replaced dda by diaPASEF, in which fragment level matching is further supported by ion mobility data 35 . We found that combining diaPASEF scan repetitions further improved protein identification numbers (Fig.…”
Section: Single-cell Protein Extraction Coupled To Low Flow Chromatogmentioning
confidence: 99%
See 1 more Smart Citation
“…3C). To achieve high data completeness between single-cell measurements, we next replaced dda by diaPASEF, in which fragment level matching is further supported by ion mobility data 35 . We found that combining diaPASEF scan repetitions further improved protein identification numbers (Fig.…”
Section: Single-cell Protein Extraction Coupled To Low Flow Chromatogmentioning
confidence: 99%
“…Chemical noise is widely distributed as a result of its heterogeneous nature and the ten-fold increased peak capacity due to TIMS (Fig. 1a, b) 24 . These precursors can be fragmented in a highly sensitive manner, either in data dependent (ddaPASEF) or data independent (diaPASEF) mode, resulting in very high ion utilization and data completeness 25 .…”
Section: Noise-reduced Quantitative Mass Spectramentioning
confidence: 99%
“…At the time of our work, Meier et al concurrently developed a deep learning CCS prediction model using 570,000 unique combinations of sequence, charge state (2+, 3+ and 4+), including peptides with oxidized methionine. 38 However, many machine learning approaches often operate as "black boxes", providing limited information on the underlying separation mechanisms. Meier et al have demonstrated the contributions of the 20 amino acid residues and the qualitative trends for hydrophobicity, Pro content, and position of His.…”
Section: Introductionmentioning
confidence: 99%
“…Meier et al have demonstrated the contributions of the 20 amino acid residues and the qualitative trends for hydrophobicity, Pro content, and position of His. 38 They highlighted the difficulty to model the observed physicochemical properties along with sequence dependent features directly with the linear sequences and our work here is able to address such difficulties as well as investigate finer composition and position dependent features that are novel to our approach. Therefore, a semiempirical Sequence-Specific Retention Calculator (SSRCalc) approach based on positiondependent correction coefficients was applied in this work to establish a Sequence-Specific Ion mobility Calculator (SSICalc).…”
Section: Introductionmentioning
confidence: 99%
“…Numeric results have confirmed that for these data, features from deep representation networks outperform those from conventional feature engineering strategies as the latter may not be able to exploit the intrinsic joint distribution of the signals in the high-dimension feature space towards the designated target of learning 35 . For DIA data analysis 36 , deep learning has also been recently introduced for the prediction of fragment intensities, retention time and ion mobility [37][38][39][40][41] , and de novo sequencing 42 .…”
mentioning
confidence: 99%