2012
DOI: 10.1074/mcp.m111.015768
|View full text |Cite
|
Sign up to set email alerts
|

Shotgun Protein Sequencing with Meta-contig Assembly

Abstract: Full-length de novo sequencing from tandem mass (MS/ MS) spectra of unknown proteins such as antibodies or proteins from organisms with unsequenced genomes remains a challenging open problem. Conventional algorithms designed to individually sequence each MS/MS spectrum are limited by incomplete peptide fragmentation or low signal to noise ratios and tend to result in short de novo sequences at low sequencing accuracy. Our shotgun protein sequencing (SPS) approach was developed to ameliorate these limitations b… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
56
0

Year Published

2013
2013
2018
2018

Publication Types

Select...
5
1
1

Relationship

2
5

Authors

Journals

citations
Cited by 26 publications
(59 citation statements)
references
References 44 publications
(31 reference statements)
0
56
0
Order By: Relevance
“…These methods may well be effective for simple spectra, derived from short peptides, yet they are still inadequate for the analysis of spectra derived from larger peptides and proteins. This is not to say that de-novo sequencing has no future in Top-Down proteomics; though still in development, multiple publications have presented some promise for Top-Down proteomics, either through a combination of Bottom-Up and Top-Down data[147,148] or through a constrained approach [149] that requires a sequence "anchor" from which the de-novo algorithm works.Despite its limitations Top-Down proteomics, either in its more "primitive form" or the modern one, provided an immensely important finding about venom biology: the understanding that venom production and secretion is not monolithic and uniform, but rather a complex mixture whose composition is susceptible to biochemical and behavioral modulation.Studies have hinted of this phenomenon in scorpions [17, 103], snake [150] and cone snails [16, 123, 151]. It is still…”
mentioning
confidence: 99%
“…These methods may well be effective for simple spectra, derived from short peptides, yet they are still inadequate for the analysis of spectra derived from larger peptides and proteins. This is not to say that de-novo sequencing has no future in Top-Down proteomics; though still in development, multiple publications have presented some promise for Top-Down proteomics, either through a combination of Bottom-Up and Top-Down data[147,148] or through a constrained approach [149] that requires a sequence "anchor" from which the de-novo algorithm works.Despite its limitations Top-Down proteomics, either in its more "primitive form" or the modern one, provided an immensely important finding about venom biology: the understanding that venom production and secretion is not monolithic and uniform, but rather a complex mixture whose composition is susceptible to biochemical and behavioral modulation.Studies have hinted of this phenomenon in scorpions [17, 103], snake [150] and cone snails [16, 123, 151]. It is still…”
mentioning
confidence: 99%
“…Remaining peaks not predicted to be y-ions were converted to charge one by a previously described MS/MS deconvolution tool (38). Deconvoluted DTA spectra that originated from identified MS/MS scans were then paired with the MSGF ϩ peptide IDs and passed to PepNovo ϩ for training.…”
Section: Lc-ms/ms-samplesmentioning
confidence: 99%
“…Yet, recent advances in de novo sequencing have demonstrated 97-99% sequencing accuracy (percent of amino acids in matched peptides that are correct) at nearly the same level of coverage (percent of amino acids in target peptides that were matched) as that of database search for small mixtures of target proteins (Guthals et al, 2012a(Guthals et al, , 2013. At the heart of this approach is the pairing of spectra from overlapping peptides (i.e., peptides that have overlapping sequences) to construct spectral networks (Bandeira et al, 2004;Guthals et al, 2012b) where a node represents an individual spectrum [or a consensus spectrum from a clustered set of spectra from the same precursor (Frank et al, 2008)] and edges denote pairs of spectra from peptides with overlapping sequences.…”
Section: Introductionmentioning
confidence: 99%
“…At the heart of this approach is the pairing of spectra from overlapping peptides (i.e., peptides that have overlapping sequences) to construct spectral networks (Bandeira et al, 2004;Guthals et al, 2012b) where a node represents an individual spectrum [or a consensus spectrum from a clustered set of spectra from the same precursor (Frank et al, 2008)] and edges denote pairs of spectra from peptides with overlapping sequences. It is then shown that de novo sequences assembled by simultaneous interpretation of multiple spectra from overlapping peptides are much more accurate than individual per-spectrum interpretations (Guthals et al, 2012a(Guthals et al, , 2013. Use of multiple enzyme digestions and strong cation exchange (SCX) (Edelmann, 2011) fractionation is becoming more common in MS/MS protocols to generate broader coverage of protein sequences and yield wider distributions of overlapping peptides, but current statistical methods still ignore the peptide sequence overlaps and separately compute the significance of individual peptides matched to individual spectra (Swaney et al, 2010).…”
Section: Introductionmentioning
confidence: 99%