2019
DOI: 10.1038/s41467-018-08270-y
|View full text |Cite
|
Sign up to set email alerts
|

Biological relevance of computationally predicted pathogenicity of noncoding variants

Abstract: Computational prediction of the phenotypic propensities of noncoding single nucleotide variants typically combines annotation of genomic, functional and evolutionary attributes into a single score. Here, we evaluate if the claimed excellent accuracies of these predictions translate into high rates of success in addressing questions important in biological research, such as fine mapping causal variants, distinguishing pathogenic allele(s) at a given position, and prioritizing variants for genetic risk assessmen… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

5
47
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
6
2
1

Relationship

2
7

Authors

Journals

citations
Cited by 47 publications
(52 citation statements)
references
References 75 publications
5
47
0
Order By: Relevance
“…Even though most eQTL span 100 or more polymorphisms in a credible interval, the general assumption is that prioritizing variants according to functional criteria and evolutionary conservation, using scores, such as CADD or LINSIGHT, reduces the search space to fewer than ten candidates. However, given that these variants are in tight linkage disequilibrium with similar frequencies [10], if they have similar functional scores, then it is possible that the observed univariate eQTL effect is actually due to the summation of two or smaller contributing effects. Under this scenario, the power to detect multiple causal variants is also reduced.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Even though most eQTL span 100 or more polymorphisms in a credible interval, the general assumption is that prioritizing variants according to functional criteria and evolutionary conservation, using scores, such as CADD or LINSIGHT, reduces the search space to fewer than ten candidates. However, given that these variants are in tight linkage disequilibrium with similar frequencies [10], if they have similar functional scores, then it is possible that the observed univariate eQTL effect is actually due to the summation of two or smaller contributing effects. Under this scenario, the power to detect multiple causal variants is also reduced.…”
Section: Discussionmentioning
confidence: 99%
“…Third, the majority of the risk loci are located in non-coding regions of genes [7,8], where they exert their function through regulation of gene expression. Tools for predicting the function of such causal variants generally have low predictive value [9,10].…”
Section: Introductionmentioning
confidence: 99%
“…Third, our S-LDSC analyses are inherently focused on common variants, but deep learning models have also shown promise in prioritizing rare pathogenic variants 4,8,51 . The value of deep learning models for prioritizing rare pathogenic variants has been questioned in a recent analysis focusing on Human Gene Mutation Database (HGMD) variants 52 , meriting further investigation. Fourth, we focused here on deep learning models trained using human data, but models trained using data from other species may also be informative for human disease 24,53 .…”
Section: Discussionmentioning
confidence: 99%
“…Although the distribution of CADD scores was significantly higher for the reported-peak variants, suggesting elevated likelihood that they are pathogenic ( Figure 4A for CAGE, and Figure 4B for FHS), the magnitude of the effect is small relative to the variance in CADD scores. Correspondingly, the positive predictive value for each SNP is low and functional discrimination of primary and secondary signals by this measure is poor (see also Liu et al 2019). Potential causal variants defined by fine-mapping also have only a slightly elevated probability of locating within regulatory enhancers in the human genome as defined by the deltaSVM score.…”
Section: Biological Annotation Of Detected Multiple Eqtlsmentioning
confidence: 99%