2014
DOI: 10.1038/nmeth.3144
|View full text |Cite
|
Sign up to set email alerts
|

Proteogenomics: concepts, applications and computational strategies

Abstract: Proteogenomics is an area of research at the interface of proteomics and genomics. In this approach, customized protein sequence databases generated using genomic and transcriptomic information are used to help identify novel peptides (not present in reference protein sequence databases) from mass spectrometry-based proteomic data; in turn, the proteomic data can be used to provide protein-level evidence of gene expression and to help refine gene models. In recent years, owing to the emergence of next generati… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

4
706
0
6

Year Published

2015
2015
2022
2022

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 673 publications
(741 citation statements)
references
References 101 publications
4
706
0
6
Order By: Relevance
“…On the other hand, the original raw MS files are stored in separate repositories (e.g. PeptideAtlas, PRIDE PRoteomics IDEntifications (PRIDE), CHORUS) and more recently, the SWATHAtlas database (82,140). Ideally, this information should be integrated and centralized within the same database.…”
Section: Identification and Quantification Of Mhc-associated Peptidesmentioning
confidence: 99%
“…On the other hand, the original raw MS files are stored in separate repositories (e.g. PeptideAtlas, PRIDE PRoteomics IDEntifications (PRIDE), CHORUS) and more recently, the SWATHAtlas database (82,140). Ideally, this information should be integrated and centralized within the same database.…”
Section: Identification and Quantification Of Mhc-associated Peptidesmentioning
confidence: 99%
“…However, due to the unconventional sequence population of the databases in proteogenomics, and their often extensive size, the estimation of false positive rates by traditional approaches can be impaired 83, 94. In the near future, it is therefore expected that the creation of such sequence databases will be coupled to ribosome profiling data, to discern the exact start of translation of putative proteins 95.…”
Section: Introductionmentioning
confidence: 99%
“…Proteogenomics is defined as the use of proteomic data, often derived from MS, to improve and refine genome annotation (10,11). Pioneering work by Yates et al (12) and Jaffe et al (13) opened new avenues to high-throughput gene annotation.…”
mentioning
confidence: 99%
“…Pioneering work by Yates et al (12) and Jaffe et al (13) opened new avenues to high-throughput gene annotation. In recent years, proteogenomics has emerged as a promising and indispensable approach to genome annotation (10,11). It has been applied for genome annotation including identification of novel genes, correction and validation of predicted genes in various organisms (14 -21).…”
mentioning
confidence: 99%
See 1 more Smart Citation