2019
DOI: 10.3791/59589
|View full text |Cite
|
Sign up to set email alerts
|

Mass Spectrometry-Based Proteomics Analyses Using the OpenProt Database to Unveil Novel Proteins Translated from Non-Canonical Open Reading Frames

Abstract: Genome annotation is central to today's proteomic research as it draws the outlines of the proteomic landscape. Traditional models of open reading frame (ORF) annotation impose two arbitrary criteria: a minimum length of 100 codons and a single ORF per transcript. However, a growing number of studies report expression of proteins from allegedly non-coding regions, challenging the accuracy of current genome annotations. These novel proteins were found encoded either within non-coding RNAs, 5' or 3' untranslated… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
9
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
6
2

Relationship

3
5

Authors

Journals

citations
Cited by 12 publications
(10 citation statements)
references
References 35 publications
0
9
0
Order By: Relevance
“…Some identified proteins and interactions are found to be different from the initial study because a different methodology was applied in the data reuse. This is a consequence of using a larger size database, including both RefProts and AltProts, then forcing the utilization of Proteome Discoverer in place of MaxQuant, following the recommendations of the OpenProt developers [ 18 , 28 ]. However, strong FDR filter is used, a unique peptide is verified for each identified protein, and a cutoff threshold sample/control of 2 is applied to define an interactor.…”
Section: Resultsmentioning
confidence: 99%
“…Some identified proteins and interactions are found to be different from the initial study because a different methodology was applied in the data reuse. This is a consequence of using a larger size database, including both RefProts and AltProts, then forcing the utilization of Proteome Discoverer in place of MaxQuant, following the recommendations of the OpenProt developers [ 18 , 28 ]. However, strong FDR filter is used, a unique peptide is verified for each identified protein, and a cutoff threshold sample/control of 2 is applied to define an interactor.…”
Section: Resultsmentioning
confidence: 99%
“…Some identified proteins and interactions are found to be different from the initial study because a different methodology was applied in the data reuse. This is a consequence of using a larger size database including both RefProts and AltProts, then forcing the utilization of Proteome Discoverer in place of Maxquant, following the recommendations of the OpenProt developers [18,26]. However, strong FDR filter is used, unique peptide is verified for each identified protein, and a cutoff threshold sample/control of 2 is applied to define an interactor.…”
Section: Resultsmentioning
confidence: 99%
“…In addition to already annotated proteins, the OpenProt database includes all predicted altProts and novel isoforms. Since large databases result in a large increase of false positive rates (Jeong et al , 2012; Nesvizhskii, 2014), this effect is balanced using an FDR of 0.001% as previously described (Brunet et al , 2020; Brunet & Roucou, 2019) (PMID: 32780568, 31033953). The protein library contained a non redundant list of all reference proteins from Uniprot (release 2019_03_01), Ensembl (GRCh38.95), and RefSeq (GRCh38.p12) (134477 proteins) in addition to all alternative protein (488956 proteins) and novel isoforms (68612 proteins) predictions from OpenProt 1.6.…”
Section: Methodsmentioning
confidence: 99%