2019
DOI: 10.1002/bies.201900066
|View full text |Cite
|
Sign up to set email alerts
|

The Protein‐Coding Human Genome: Annotating High‐Hanging Fruits

Abstract: The major transcript variants of human protein-coding genes are annotated to a certain degree of accuracy combining manual curation, transcript data, and proteomics evidence. However, there is considerable disagreement on the annotation of about 2000 genes-they can be protein-coding, noncoding, or pseudogenes-and on the annotation of most of the predicted alternative transcripts. Pure transcriptome mapping approaches seem to be limited in discriminating functional expression from noise. These limitations have … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
11
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
2
1
1

Relationship

1
9

Authors

Journals

citations
Cited by 18 publications
(12 citation statements)
references
References 151 publications
(134 reference statements)
1
11
0
Order By: Relevance
“…Setting aside the controversy in cell type definition ( 110 ), our work already provides tools and best practices to achieve better reference annotations and to share the gene signatures that capture the knowledge about how they were derived, which is novel compared to most current studies. As the human reference genome, which does not ultimately reflect a human genome consensus ( 111 ), but serves many practical purposes ( 112 ), accelerated genomic research, such reference cell type annotations will accelerate our understanding of biological systems even though they reflect only a subset of a cell's characteristics.…”
Section: Discussionmentioning
confidence: 99%
“…Setting aside the controversy in cell type definition ( 110 ), our work already provides tools and best practices to achieve better reference annotations and to share the gene signatures that capture the knowledge about how they were derived, which is novel compared to most current studies. As the human reference genome, which does not ultimately reflect a human genome consensus ( 111 ), but serves many practical purposes ( 112 ), accelerated genomic research, such reference cell type annotations will accelerate our understanding of biological systems even though they reflect only a subset of a cell's characteristics.…”
Section: Discussionmentioning
confidence: 99%
“…However, using the GENCODE transcript set as a reference increased the sensitivity, with the sensitivity of coding transcripts of some samples going above 60% and the noncoding sensitivity of some samples reaching 43%. Previous studies have reported that some genuine transcripts are missing in the GENCODE annotation [25,42,47,48]. Therefore, we repeated the analyses using the hPSC filtered transcript set [42].…”
Section: Impacts Of Sequencing Depth On Human Pluripotent Stem Cell Transcript Assemblymentioning
confidence: 99%
“…The sequencing of the human genome and that of other species has made great strides in cataloguing and annotating protein-coding genes. Systematic proteomic approaches have validated the existence of proteins for almost 20,000 protein-coding genes 1 . Notwithstanding, nearly 10% of these genes still lack functional annotation 2 .…”
Section: Introductionmentioning
confidence: 99%