Accurate prediction of peptide fragment ion mass spectra is one of the critical factors to guarantee confident peptide identification by protein sequence database search in bottom-up proteomics. In an attempt to accurately and comprehensively predict this type of mass spectra, a framework named MS(2)PBPI is proposed. MS(2)PBPI first extracts fragment ions from large-scale MS/MS spectra data sets according to the peptide fragmentation pathways and uses binary trees to divide the obtained bulky data into tens to more than 1000 regions. For each adequate region, stochastic gradient boosting tree regression model is constructed. By constructing hundreds of these models, MS(2)PBPI is able to predict MS/MS spectra for unmodified and modified peptides with reasonable accuracy. Moreover, high consistency between predicted and experimental MS/MS spectra derived from different ion trap instruments with low and high resolving power is achieved. MS(2)PBPI outperforms existing algorithms MassAnalyzer and PeptideART.
Scrambled ions have become the focus of recent investigations of peptide fragmentation. Here, an investigation of more than 390,000 high quality CID mass spectra is presented to explore the extent of scrambled ions in mass spectra and the possible fragmentation rules during scramble reactions. For the former, scrambled ions generally make up more than 10 % of mass spectra in number, although the abundances are less than 0.1 of the base peak. For the latter, relatively preferential re-opening sites were found for aliphatic residues Ala, Ile, Leu, and other residues such as Met, Gln, Ser, Phe, and Thr, whereas disfavored sites were found for basic residues Arg, Lys, and His, and Trp for both scrambled b and a ions. Similar preferential order in re-opening reaction was found in the reaction of losing internal residues when cleavage occurs at C-terminal side of 20 residues. However, when cleavage occurs at N-terminal side, Glu, Phe, and Trp become the most preferential sites. These results provide a deep insight into cleavage rules during scramble reactions for prediction of peptide mass spectra. Also, an additional investigation of whether scrambled ions could help discriminate false identifications from correct identifications was performed. Probing the number fraction of scrambled ions in falsely and correctly interpreted spectra and analyzing the correlation between scrambled ions and SEQUEST scores XCorr and Sp showed scrambled ions could at some extent help improve the discrimination in singly charged identifications, whereas no improvement was found for multiply charged results.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.