2019
DOI: 10.1101/784439
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Evaluating the informativeness of deep learning annotations for human complex diseases

Abstract: Deep learning models have shown great promise in predicting genome-wide regulatory effects from DNA sequence, but their informativeness for human complex diseases and traits is not fully understood. Here, we evaluate the disease informativeness of two types of deep learning annotations: (1) variant-level annotations (based on the reference allele), assessing whether they are more informative for complex disease than the underlying experimental data used to train the predictive models; and (2) allelic-effect an… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
18
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
5

Relationship

2
3

Authors

Journals

citations
Cited by 5 publications
(19 citation statements)
references
References 79 publications
1
18
0
Order By: Relevance
“…We conditioned all analyses on a "baseline-LD-deep model" defined by 86 coding, conserved, regulatory and LD-related annotations from the baseline-LD model (v2.1) 34,35 and 14 additional jointly significant annotations from ref. 40 : 1 non-tissue-specific allelic-effect Basenji annotation, 3 Roadmap and 5 ChromHMM annotations and 5 other annotations (100 annotations total) ( Table ?? and Table ??…”
Section: Introductionmentioning
confidence: 99%
“…We conditioned all analyses on a "baseline-LD-deep model" defined by 86 coding, conserved, regulatory and LD-related annotations from the baseline-LD model (v2.1) 34,35 and 14 additional jointly significant annotations from ref. 40 : 1 non-tissue-specific allelic-effect Basenji annotation, 3 Roadmap and 5 ChromHMM annotations and 5 other annotations (100 annotations total) ( Table ?? and Table ??…”
Section: Introductionmentioning
confidence: 99%
“…Basenji employs a Poisson likelihood model to analyze 130kb of human reference sequence around each SNP using dilated convolutional layers. Our previous work 16 focused on unsigned (absolute) allelic-effect annotations for DNase and three histone marks, H3K27ac, H3K4me1 and H3K4me3 (associated with active enhancers and promoters). Here, we integrate signed allelic-effect annotations with other types of data -fine-mapped SNPs, SNPs linked to genes, and gene expression -to generate more disease-informative unsigned annotations.…”
Section: Overview Of Methodsmentioning
confidence: 99%
“…The boosted allelic-effect annotations derived from DeepBoost (DeepSEA∆-boosted and Basenji∆-boosted; we use ∆ to denote allelic-effect annotations) were only moderately correlated with allelic-effect annotations derived from a simple maximum of published deep learning annotations across 4 chromatin marks (DNase, H3K27ac, H3K4me1 and H3K4me3) and 27 blood cell types, as in ref. 16 (DeepSEA∆-published and Basenji∆-published) (average r = 0.35) ( Figure S2). We broadly investigated which features of the DeepSEA and Basenji models contributed the most to the DeepSEA∆-boosted and Basenji∆-boosted annotations by applying Shapley Additive Explanation (SHAP) 36 , a widely used tool for interpreting machine-learning models.…”
Section: Deepboost Deep Learning Annotations Restricted To Snps Implimentioning
confidence: 99%
See 2 more Smart Citations