Abstract:Although protein identification by matching tandem mass spectra (MS/MS) against protein databases is a widespread tool in mass spectrometry, the question about reliability of such searches remains open. Absence of rigorous significance scores in MS/MS database search makes it difficult to discard random database hits and may lead to erroneous protein identification, particularly in the case of mutated or post-translationally modified peptides. This problem is especially important for high-throughput MS/MS proj… Show more
“…A second possibility is to interpret tandem mass spectra of peptides using specialized software that creates amino acid sequences de novo [21][22][23][24]. Although the software utilizes different computational principles, sequences of short peptides can be produced rapidly and accurately.…”
Section: Efforts Towards the Identification Of Proteins By Ms/ms And mentioning
confidence: 99%
“…However, the absence of a rigorous scoring system may lead to erroneous identifications as the correct sequence may be present in the list, but may not be ranked among the top hits. Even though it is difficult to use these sequences for cloning (where the requirement is that the sequences should be long, 100% accurate, and encode for low degeneracy primers), they can be successfully used for identifying proteins in a sequence database using various sequence similarity search algorithms [23][24][25].…”
Section: Efforts Towards the Identification Of Proteins By Ms/ms And mentioning
Due to the limited applicability of conventional protein identification methods to the proteomes of organisms with unsequenced genomes, researchers have developed approaches to identify proteins using mass spectrometry and sequence similarity database searches. Both the integration of mass spectrometry with bioinformatics and genomic sequencing drive the expanding organismal scope of proteomics.
“…A second possibility is to interpret tandem mass spectra of peptides using specialized software that creates amino acid sequences de novo [21][22][23][24]. Although the software utilizes different computational principles, sequences of short peptides can be produced rapidly and accurately.…”
Section: Efforts Towards the Identification Of Proteins By Ms/ms And mentioning
confidence: 99%
“…However, the absence of a rigorous scoring system may lead to erroneous identifications as the correct sequence may be present in the list, but may not be ranked among the top hits. Even though it is difficult to use these sequences for cloning (where the requirement is that the sequences should be long, 100% accurate, and encode for low degeneracy primers), they can be successfully used for identifying proteins in a sequence database using various sequence similarity search algorithms [23][24][25].…”
Section: Efforts Towards the Identification Of Proteins By Ms/ms And mentioning
Due to the limited applicability of conventional protein identification methods to the proteomes of organisms with unsequenced genomes, researchers have developed approaches to identify proteins using mass spectrometry and sequence similarity database searches. Both the integration of mass spectrometry with bioinformatics and genomic sequencing drive the expanding organismal scope of proteomics.
“…However, this solution, leading to an exponential number of possibilities, is of course too time consuming [8]. The other one, SpectralAlignment [1,9,10], is a dynamic programming algorithm that has been designed to identify peptides even in presence of modifications. This method works rather well for one or two modifications, but for a larger number of modifications, SpectralAlignment is not really sustainable [9].…”
Section: Introductionmentioning
confidence: 99%
“…The other one, SpectralAlignment [1,9,10], is a dynamic programming algorithm that has been designed to identify peptides even in presence of modifications. This method works rather well for one or two modifications, but for a larger number of modifications, SpectralAlignment is not really sustainable [9]. Spectra comparison has an essential advantage, namely the precision in the comparison, allowing information to be drawn even from spectra which used to be unexploitable with a de novo approach.…”
Abstract. We introduce a new algorithm for the mass spectrometric identification of proteins. Experimental spectra obtained by tandem MS/MS are directly compared to theoretical spectra generated from proteins of evolutionarily closely related organisms. This work is motivated by the need of a method that allows the identification of proteins of unsequenced species against a database containing proteins of related organisms. The idea is that matching spectra of unknown peptides to very similar MS/MS spectra generated from this database of annotated proteins can lead to annotate unknown proteins. This process is similar to ortholog annotation in protein sequence databases. The difficulty with such an approach is that two similar peptides, even with just one modification (i.e. insertion, deletion or substitution of one or several amino acid(s)) between them, usually generate very dissimilar spectra. In this paper, we present a new dynamic programming based algorithm: PacketSpectralAlignment. Our algorithm is tolerant to modifications and fully exploits two important properties that are usually not considered: the notion of inner symmetry, a relation linking pairs of spectrum peaks, and the notion of packet inside each spectrum to keep related peaks together. Our algorithm, PacketSpectralAlignment is then compared to SpectralAlignment [1] on a dataset of simulated spectra. Our tests show that PacketSpectralAlignment behaves better, in terms of results and execution time.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.