2013
DOI: 10.1021/pr400820p
|View full text |Cite
|
Sign up to set email alerts
|

Tools to Covisualize and Coanalyze Proteomic Data with Genomes and Transcriptomes: Validation of Genes and Alternative mRNA Splicing

Abstract: Direct links between proteomic and genomic/transcriptomic data are not frequently made, partly because of lack of appropriate bioinformatics tools. To help address this, we have developed the PG Nexus pipeline. The PG Nexus allows users to covisualize peptides in the context of genomes or genomic contigs, along with RNA-seq reads. This is done in the Integrated Genome Viewer (IGV). A Results Analyzer reports the precise base position where LC-MS/MS-derived peptides cover genes or gene isoforms, on the chromoso… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
60
0

Year Published

2014
2014
2018
2018

Publication Types

Select...
7

Relationship

1
6

Authors

Journals

citations
Cited by 40 publications
(60 citation statements)
references
References 72 publications
0
60
0
Order By: Relevance
“…However, in both cases, DBs specific for the conditions studied are generated, requiring bioinformatics expertise and limiting the general applicability of the resource. The GFF file we provide can be very valuable for other proteogenomics software solutions like GenoSuite (Kumar et al 2013), PGP (Tovchigrechko et al 2014), and PG Nexus (Pang et al 2014), which allow users to search their data against a six-frame translation and later visualize identified peptides onto a genome sequence but lack integrated and consolidated annotations.…”
Section: Discussionmentioning
confidence: 99%
See 3 more Smart Citations
“…However, in both cases, DBs specific for the conditions studied are generated, requiring bioinformatics expertise and limiting the general applicability of the resource. The GFF file we provide can be very valuable for other proteogenomics software solutions like GenoSuite (Kumar et al 2013), PGP (Tovchigrechko et al 2014), and PG Nexus (Pang et al 2014), which allow users to search their data against a six-frame translation and later visualize identified peptides onto a genome sequence but lack integrated and consolidated annotations.…”
Section: Discussionmentioning
confidence: 99%
“…However, they do not integrate different annotations of the same genome. PG Nexus (Pang et al 2014) uses the NCBI RefSeq annotation, a Glimmer ab initio prediction (Delcher et al 2007), and a six-frame translation against which peptides are searched with Mascot (Perkins et al 1999) and later visualized onto the genome. However, the annotations are not integrated and consolidated; the boundaries of novel ORFs still have to be discovered based on peptide evidence, which requires substantial manual effort.…”
Section: A General Integrative Proteogenomics Approachmentioning
confidence: 99%
See 2 more Smart Citations
“…It follows naturally that the ideal unified coordinate system for proteogenomics should remain genomic in nature. Indeed, effective tools that can map MS-based proteomics results onto genomic coordinates have recently become available (Peppy, 2 Proteogenomic Mapping Tool, 3 Pepline, 4 MS-Dictionary, 5 GappedDictionary, 6 IggyPep, 7 MSProGene, 8 ProteoAnnotator, 9 PGNexus, 10 and GalaxyP 11 ); however, these tools are usually couched in a relatively involved and comprehensive pipeline (e.g., the GalaxyP pipeline consists of up to 140 steps) and typically impose a specific mass-informatic 12 workflow on the practitioner, by, for example, requiring the generation of short peptide sequence tags (PSTs) or some complex form of de novo peptide sequencing followed by a lookup against the full six-frame translation of the genomic sequence. Our experience suggests that a more common scenario involves the production, by the genomic arm of the workflow, of a (liberally) predicted proteome (containing what is assumed to be a superset of the observable proteome) so as to leverage existing PSM search engines (such as Mascot, 13 Sequest, 14 X!Tandem 15 ) that require a straightforward representation of the predicted proteome (in the form of a FASTA file).…”
Section: Introductionmentioning
confidence: 99%