2007
DOI: 10.1021/pr0605320
|View full text |Cite
|
Sign up to set email alerts
|

Estimating the Statistical Significance of Peptide Identifications from Shotgun Proteomics Experiments

Abstract: We present a wrapper-based approach to estimate and control the false discovery rate for peptide identifications using the outputs from multiple commercially available MS/MS search engines. Features of the approach include the flexibility to combine output from multiple search engines with sequence and spectral derived features in a flexible classification model to produce a score associated with correct peptide identifications. This classification model score from a reversed database search is taken as the nu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
89
0

Year Published

2008
2008
2018
2018

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 62 publications
(89 citation statements)
references
References 38 publications
0
89
0
Order By: Relevance
“…1,2 Peptide assignments are typically accomplished using automated database searching algorithms (e.g., SEQUEST, 2 MASCOT 3 , X!Tandem 4 ) that compare tandem mass spectra with in silico generated model spectra derived from candidate peptide sequences, using scoring schemes to determine relative confidence levels. [5][6][7] low FDRs (e.g.,<1%) have been obtained from conventional precision LC-MS/MS data, 5 the accuracy of such estimates is uncertain. The effectiveness of the identification process decreases as the size of the peptide candidates increases, 8 and thus proteome coverage is decreased if the FDR is to be held constant.…”
Section: Introductionmentioning
confidence: 99%
“…1,2 Peptide assignments are typically accomplished using automated database searching algorithms (e.g., SEQUEST, 2 MASCOT 3 , X!Tandem 4 ) that compare tandem mass spectra with in silico generated model spectra derived from candidate peptide sequences, using scoring schemes to determine relative confidence levels. [5][6][7] low FDRs (e.g.,<1%) have been obtained from conventional precision LC-MS/MS data, 5 the accuracy of such estimates is uncertain. The effectiveness of the identification process decreases as the size of the peptide candidates increases, 8 and thus proteome coverage is decreased if the FDR is to be held constant.…”
Section: Introductionmentioning
confidence: 99%
“…Parameters were set as follows: a mass tolerance of 2 Da for precursors and 0.7 Da for fragment ions, two missed cleavage sites allowed for trypsin, carbamidomethyl cysteine as fixed modification, and oxidized methionine as optional modification. The q-value represents peptide false identification rate and was calculated by a recently published method [18] which incorporates Sequest and X!Tandem results in addition to a number of other relevant factors such as Δ [M+H]+ and charge state. Protein identifications were assigned to one of four groups as discussed previously [19] with priority 1 identifications (IDs) regarded as high-confidence and requiring ≥2 unique peptides and a q-value ≤0.1.…”
Section: Methodsmentioning
confidence: 99%
“…Data collection was performed in the "Triple-Play" mode (MS scan, Zoom scan, and MS/MS scan). Acquired data was filtered and analyzed by a previously published proprietary algorithm developed by Higgs et al [27,28].…”
Section: Liquid Chromatography-tandem Mass Spectrometry (Lc/ms/ Ms)mentioning
confidence: 99%
“…Peptides with peptide ID confidence <75% were filtered out of this study. The estimation of the confidence levels, which is based on a random forest recursive partition supervised learning algorithm was described previously [28].…”
Section: Protein Identification Quantification and Statistical Analmentioning
confidence: 99%
See 1 more Smart Citation