2014
DOI: 10.1002/pmic.201300078
|View full text |Cite
|
Sign up to set email alerts
|

SpliceProt: A protein sequence repository of predicted human splice variants

Abstract: The mechanism of alternative splicing in the transcriptome may increase the proteome diversity in eukaryotes. In proteomics, several studies aim to use protein sequence repositories to annotate MS experiments or to detect differentially expressed proteins. However, the available protein sequence repositories are not designed to fully detect protein isoforms derived from mRNA splice variants. To foster knowledge for the field, here we introduce SpliceProt, a new protein sequence repository of transcriptome expe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
20
0

Year Published

2015
2015
2023
2023

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 16 publications
(20 citation statements)
references
References 27 publications
0
20
0
Order By: Relevance
“…We anticipate that our inability to detect additional splice variants was due to a low sample abundance issue in addition to the fact that some splice junctions will simply not be amenable to detection using trypsin. This approach is helpful in identifying splice variants in both large-scale and single protein analyses as a companion to searches that make use of important splice variant databases [20, 21]. …”
Section: Resultsmentioning
confidence: 99%
“…We anticipate that our inability to detect additional splice variants was due to a low sample abundance issue in addition to the fact that some splice junctions will simply not be amenable to detection using trypsin. This approach is helpful in identifying splice variants in both large-scale and single protein analyses as a companion to searches that make use of important splice variant databases [20, 21]. …”
Section: Resultsmentioning
confidence: 99%
“…Following a similar strategy of concatenating peptide sequences to reference sequence databases, Tavares et al [9] designed a customized database (SpliceProt) comprising a reference database (Uniprot/SwissProt) and peptide sequences from dbEST, RefSeq, and RNA-Seq data. Using the same proteomic data and statistical assignment presented by Sheynkman et al [8], the usage of the SpliceProt sequence repository yielded the identification of 54 novel peptides that were not present in the UniprotKB/TrEMBL data set.…”
Section: Database Design Using Rna-seq To Identify Novel Splice Variantsmentioning
confidence: 99%
“…For example, RNA-Seq enabled the discovery of new potential splice isoforms [8,9] and the analysis of transcript expression levels [10] in different cell types and conditions. The methods of sequencing and the assembly of reads have been compared [11] and improved over the past few years, promoting new perspectives in transcriptomics [12], including noncoding RNAs (reviewed in [13]).…”
Section: Introductionmentioning
confidence: 99%
“…Beyond standard proteomic databases, several specialized databases have been created that focus on a particular variation type, such as SNPs (6167), splice variants (6870), or chimeric transcripts (47, 71). In many cases, these databases were designed so that the relevant amino acid sequences corresponding to the variation could be downloaded in a format amenable to MS database searching.…”
Section: Proteogenomic Database Constructionmentioning
confidence: 99%
“…To identify protein isoforms, the generic protein database must include the relevant isoform sequences. In the case where an isoform is not in the database, one can use a comprehensive splice database such as ECGene or SpliceProt (68, 70) to identify peptides corresponding to novel isoforms specific to a sample or relevant to a disease (6870, 105, 127). However, though the number and size of splice-specific databases is high, RNA-Seq data collected on hundreds of sample types have shown that the catalog of human isoforms is still incomplete (128).…”
Section: Complexity Of the Human Proteomementioning
confidence: 99%