2021
DOI: 10.1038/s41467-021-21790-4
|View full text |Cite
|
Sign up to set email alerts
|

Prioritizing non-coding regions based on human genomic constraint and sequence context with deep learning

Abstract: Elucidating functionality in non-coding regions is a key challenge in human genomics. It has been shown that intolerance to variation of coding and proximal non-coding sequence is a strong predictor of human disease relevance. Here, we integrate intolerance to variation, functional genomic annotations and primary genomic sequence to build JARVIS: a comprehensive deep learning model to prioritize non-coding regions, outperforming other human lineage-specific scores. Despite being agnostic to evolutionary conser… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

3
49
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
3
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 51 publications
(52 citation statements)
references
References 49 publications
3
49
0
Order By: Relevance
“…For most regions, even nominal significance was not detected using burden testing indicating the likelihood of variants with bidirectional effects further complicating clinical interpretation. When burden signals were detected, observed effects were typically larger than common non-coding variants and less than rare coding variants, with the exception of LDLR , consistent with whole genome mutational constraint models 49, 50, 51 .…”
Section: Discussionsupporting
confidence: 60%
“…For most regions, even nominal significance was not detected using burden testing indicating the likelihood of variants with bidirectional effects further complicating clinical interpretation. When burden signals were detected, observed effects were typically larger than common non-coding variants and less than rare coding variants, with the exception of LDLR , consistent with whole genome mutational constraint models 49, 50, 51 .…”
Section: Discussionsupporting
confidence: 60%
“…We compared our metric with other four genome-wide predictive scores – Orion 13 , CDTS 14 , gwRVIS 17 , and JARVIS 17 ) – in their correlation with experimental measurements on 11 enhancers tested by MAVE 77 . Each predictive score was downloaded from the original study, lifted over to GRCh38 (for Orion), and applied to score enhancers by taking the average over corresponding genomic regions.…”
Section: Methodsmentioning
confidence: 99%
“…Quantifying the depletion of natural variation in human populations provides a powerful approach to identify variants of large effect [1][2][3][4][5][6][7][8] . Since variants causing severe early-onset disorders are under selective pressure, they are observed less often than functionally neutral variants.…”
Section: Mainmentioning
confidence: 99%
“…Since variants causing severe early-onset disorders are under selective pressure, they are observed less often than functionally neutral variants. Such depletion of genetic variation (constraint) has been shown to provide strong evidence to prioritise diseaseassociated genes [1][2][3] , identify critical regions within genes 4,5 , and investigate the effect of non-coding variants [6][7][8] .…”
Section: Mainmentioning
confidence: 99%