2019
DOI: 10.1101/849331
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

A machine learning strategy that leverages large datasets to boost statistical power in small-scale experiments

Abstract: 1Machine learning methods have proven invaluable for increasing the sensitivity of peptide de-2 tection in proteomics experiments. Most modern tools, such as Percolator and PeptideProphet, 3 use semi-supervised algorithms to learn models directly from the datasets that they analyze. 4 Although these methods are effective for many proteomics experiments, we suspected that they 5 may be suboptimal for experiments of smaller scale. In this work, we found that the power and 6 consistency of Percolator results w… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2020
2020
2020
2020

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 40 publications
0
2
0
Order By: Relevance
“…In previous studies of coral skeletal proteomes which detailed differences between solubility fractions, the instruments used were low resolution, low mass accuracy, which tend to result in a lower percent identification of the data. Additionally, most search engines and FDR-calculating algorithms struggle with very small datasets [66,67]. Our use of Byonic helps to alleviate statistical limitations that can result in false negative results.…”
Section: Discussionmentioning
confidence: 99%
“…In previous studies of coral skeletal proteomes which detailed differences between solubility fractions, the instruments used were low resolution, low mass accuracy, which tend to result in a lower percent identification of the data. Additionally, most search engines and FDR-calculating algorithms struggle with very small datasets [66,67]. Our use of Byonic helps to alleviate statistical limitations that can result in false negative results.…”
Section: Discussionmentioning
confidence: 99%
“…In previous studies of coral skeletal proteomes which detailed differences between solubility fractions, the instruments used were low resolution, low mass accuracy, which tend to result in a lower percent identification of the data. Additionally, most search engines and FDR-calculating algorithms struggle with very small datasets [63, 64]. Our use of Byonic helps to alleviate statistical limitations that can result in false negative results.…”
Section: Discussionmentioning
confidence: 99%