2021
DOI: 10.1093/bioinformatics/btab311
|View full text |Cite|
|
Sign up to set email alerts
|

On the feasibility of deep learning applications using raw mass spectrometry data

Abstract: Summary In recent years, SWATH-MS has become the proteomic method of choice for data-independent–acquisition, as it enables high proteome coverage, accuracy and reproducibility. However, data analysis is convoluted and requires prior information and expert curation. Furthermore, as quantification is limited to a small set of peptides, potentially important biological information may be discarded. Here we demonstrate that deep learning can be used to learn discriminative features directly from… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
15
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
6
2

Relationship

1
7

Authors

Journals

citations
Cited by 10 publications
(16 citation statements)
references
References 30 publications
0
15
0
Order By: Relevance
“…By utilizing a larger data set for initially training on a similar “out of domain” task of classifying photos of animals, the model may perform better once fine-tuned for the new “domain” of classifying photos of plants. Transfer learning has already been utilized for MSI, sample classification, , RT prediction, and spectra refinement . However, the repurposing and retraining of domain/task-specific models for MS data as starting points remain limited, although some groups have developed domain/task specific transfer learning approaches from the ground up .…”
Section: Future Directionsmentioning
confidence: 99%
“…By utilizing a larger data set for initially training on a similar “out of domain” task of classifying photos of animals, the model may perform better once fine-tuned for the new “domain” of classifying photos of plants. Transfer learning has already been utilized for MSI, sample classification, , RT prediction, and spectra refinement . However, the repurposing and retraining of domain/task-specific models for MS data as starting points remain limited, although some groups have developed domain/task specific transfer learning approaches from the ground up .…”
Section: Future Directionsmentioning
confidence: 99%
“…The terms ML and DL are often used interchangeably (and not entirely incorrectly so) but there is an important difference: ML requires the user to define features in the data, whereas DL selects these by itself, therefore making the results more objective and allowing for higher throughput [161]. DL has already found its way to the field of proteomics [40,[162][163][164], is suggested to be a valuable tool in the integration of omics data [38,165] and has shown its merit in MS imaging [166]. Furthermore, the increase in ML algorithm performance eventually levels off for increasing data set size, whereas DL algorithm performance keeps improving, thereby surpassing ML algorithm performance.…”
Section: Deep Learningmentioning
confidence: 99%
“…As the most prominent example, convolutional neural networks (CNNs) have been applied to classify proteomic samples by converting their peptide profile into a heatmap-like image, distinguishing conditions based on their spatial peak patterns (4)(5)(6)(7)(8). To this end, CNNs rely on binning data of proximate signals to achieve a uniform grid input, and to employ pre-trained networks (4), which aligns individual samples that suffer from variabilities in the measurement (6). Nevertheless, this requires complex pre-processing, and it is unclear to what extent binning affects the data and may cause loss of information.…”
Section: Introductionmentioning
confidence: 99%