2009
DOI: 10.1101/gr.080531.108
|View full text |Cite
|
Sign up to set email alerts
|

The consensus coding sequence (CCDS) project: Identifying a common protein-coding gene set for the human and mouse genomes

Abstract: Effective use of the human and mouse genomes requires reliable identification of genes and their products. Although multiple public resources provide annotation, different methods are used that can result in similar but not identical representation of genes, transcripts, and proteins. The collaborative consensus coding sequence (CCDS) project tracks identical protein annotations on the reference mouse and human genomes with a stable identifier (CCDS ID), and ensures that they are consistently represented on th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
453
2
1

Year Published

2009
2009
2016
2016

Publication Types

Select...
9

Relationship

0
9

Authors

Journals

citations
Cited by 494 publications
(460 citation statements)
references
References 32 publications
2
453
2
1
Order By: Relevance
“…2 A ), using the SeqCap EZ Human Exome Library v2.0 (Roche NimbleGen, Madison, WI, USA),15 and DNA sequences determined using a 100‐bp paired‐end read protocol on an Illumina HiSeq2000 platform 15. A minimum of 20× vertical read depth was obtained for >88% of the coding exome, as specified by the consensus coding sequence (CCDS) project,16 in both individuals. Reads were aligned to the Human Sequence version 37d5 (hs37d5) reference genome using Stampy17 and variant calling of single nucleotide variants (SNVs) and short insertions and deletions (indels) was undertaken using Platypus (v0.5.1) 18.…”
Section: Methodsmentioning
confidence: 99%
“…2 A ), using the SeqCap EZ Human Exome Library v2.0 (Roche NimbleGen, Madison, WI, USA),15 and DNA sequences determined using a 100‐bp paired‐end read protocol on an Illumina HiSeq2000 platform 15. A minimum of 20× vertical read depth was obtained for >88% of the coding exome, as specified by the consensus coding sequence (CCDS) project,16 in both individuals. Reads were aligned to the Human Sequence version 37d5 (hs37d5) reference genome using Stampy17 and variant calling of single nucleotide variants (SNVs) and short insertions and deletions (indels) was undertaken using Platypus (v0.5.1) 18.…”
Section: Methodsmentioning
confidence: 99%
“…The extent of human-specific genic constraint was estimated for each of the 24 co-expression modules by using the genic protein-coding intolerance scores (RVIS) 34 . RVIS was only calculated for protein-coding genes that had at least one protein-coding transcript that was publically approved among the CCDS Release 9 database 74 , and that had ≥70% of their CCDS realestate adequately covered among the population database adopted in their original manuscript (ESP6500) 34 . This resulted in scores for 16,956 assessable CCDS release 9 genes, thus all RVIS comparisons are restricted to these 16,956 "assessable" genes.…”
Section: Gwas-enrichment Analysismentioning
confidence: 99%
“…We focussed on genes that have an unambiguous annotation of a coding sequence from the consensus coding sequence project (CCDS) 39 . We removed genes that overlapped with other genes or those whose tested coding sequences were < 100 bp in size.…”
Section: Replication Timingmentioning
confidence: 99%
“…We focussed on regions classified concordantly by all of the three gene definitions. Coding regions (coding) were obtained from the consensus CDS project 39 .…”
Section: Replication Timingmentioning
confidence: 99%