2015
DOI: 10.1021/acs.jproteome.5b00490
|View full text |Cite
|
Sign up to set email alerts
|

PPLine: An Automated Pipeline for SNP, SAP, and Splice Variant Detection in the Context of Proteogenomics

Abstract: The fundamental mission of the Chromosome-Centric Human Proteome Project (C-HPP) is the research of human proteome diversity, including rare variants. Liver tissues, HepG2 cells, and plasma were selected as one of the major objects for C-HPP studies. The proteogenomic approach, a recently introduced technique, is a powerful method for predicting and validating proteoforms coming from alternative splicing, mutations, and transcript editing. We developed PPLine, a Python-based proteogenomic pipeline providing au… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
35
0

Year Published

2016
2016
2020
2020

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 67 publications
(38 citation statements)
references
References 57 publications
0
35
0
Order By: Relevance
“…Processing of transcriptomic data was performed using PPLine toolkit [8] including read preprocessing (trimmomatic), mapping (STAR) and counting (HTSeq-count). The further analysis was done with R programming language (R core Team).…”
Section: Methodsmentioning
confidence: 99%
“…Processing of transcriptomic data was performed using PPLine toolkit [8] including read preprocessing (trimmomatic), mapping (STAR) and counting (HTSeq-count). The further analysis was done with R programming language (R core Team).…”
Section: Methodsmentioning
confidence: 99%
“…The results revealed 2172 and 149 differentially expressed splicesoforms respectively including RAC1, OSBPL3, MKI67, and SYK . PPLine is a python‐based proteogenomic pipeline assisting discovery of SAPs, INDELs, and ASVs from transcriptome and exome sequence data, besides facilitating the annotation and filtration of SNPs and the prediction of proteotypic peptides …”
Section: Current Development In Enabling Technologiesmentioning
confidence: 99%
“… Datasets to map the canonical and spliced forms of missing proteins of Chr18 in the liver tissue ( a ) and in the HepG2 cell line ( b ). PE: protein evidence according to neXtProt; DB: information from several mass-spectrometry databases on protein detection in the biosample; Chr18 HPP transcriptomic data [ 19 , 20 , 26 ]; CF: level of expression of canonical form; S2–S7: levels of expression of the splice forms. Colored boxes represent the quantitative value assigned to the descriptor.…”
Section: Figurementioning
confidence: 99%