2014
DOI: 10.1038/nmeth.2832
|View full text |Cite
|
Sign up to set email alerts
|

Functional annotation of noncoding sequence variants

Abstract: Identifying functionally relevant variants against the background of ubiquitous genetic variation is a major challenge in human genetics. For variants that fall in protein-coding regions our understanding of the genetic code and splicing allow us to identify likely candidates, but interpreting variants that fall outside of genic regions is more difficult. Here we present a new tool, GWAVA, which supports prioritisation of non-coding variants by integrating a range of annotations.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

8
555
0

Year Published

2016
2016
2019
2019

Publication Types

Select...
5
3
1

Relationship

1
8

Authors

Journals

citations
Cited by 510 publications
(577 citation statements)
references
References 25 publications
(32 reference statements)
8
555
0
Order By: Relevance
“…Among these features, H3K36me3 is associated with actively transcribed genes, and H3K4me3 is a hallmark of actively transcribed protein-coding promoters in eukaryotes (51). These findings support the fact that conserved and regulatory elements are critical to the formation and functionality of pathogenic variants in the non-coding genome (52). The area under the ROC curve was 0.89, which outperformed two well-known tools CADD and funSeq2 (14), however, more stringent comparison must be conducted to obtain a final conclusion.…”
Section: Discussionsupporting
confidence: 55%
“…Among these features, H3K36me3 is associated with actively transcribed genes, and H3K4me3 is a hallmark of actively transcribed protein-coding promoters in eukaryotes (51). These findings support the fact that conserved and regulatory elements are critical to the formation and functionality of pathogenic variants in the non-coding genome (52). The area under the ROC curve was 0.89, which outperformed two well-known tools CADD and funSeq2 (14), however, more stringent comparison must be conducted to obtain a final conclusion.…”
Section: Discussionsupporting
confidence: 55%
“…Abbreviations are as follows: BMI, body mass index; WHR, waist to hip ratio; WaistBMIadj, waist circumference adjusted for BMI; HipBMIadj, hip circumference adjusted for BMI; WHRBMIadj, waist to hip ratio adjusted for BMI; TFM, total fat mass; TLM, total lean mass; TRFM, trunk fat mass. from the two fine-mapping methods, two functional prediction scores (Genome Wide Annotation of Variants 54 [GWAVA] and GERP scores), and eQTL analysis (Figures 3 and S28). Of the 30 regions, 6 were fine-mapped to a coding variant (5 missense and 1 synonymous) and 9 were fine-mapped to a variant that was identified as an eQTL.…”
Section: Fine-mappingmentioning
confidence: 99%
“…We used several bioinformatics approaches (LDlink,30 RegulomeDB,31 Genome‐Wide Annotation of Variants [GWAVA],32 and Data‐driven Expression Prioritized Integration for Complex Traits [DEPICT]33) to search and annotate SNPs in the regions containing genome‐wide significant SNPs. Publicly available reference haplotypes from Phase 3 (Version 5) of the 1000 Genomes Project (1000G)34 were used to calculate population‐specific measures of linkage disequilibrium (LD)30 in whites.…”
Section: Methodsmentioning
confidence: 99%
“…RegulomeDB31 integrates the RoadMap Epigenomics and ENCODE projects to identify variants which have potential or demonstrated regulatory function, and predicts potential mechanisms of functional involvement. We used a GWAVA32 score to identify loci with likely functional non‐coding variants. We used an unmatched GWAVA score threshold ≥0.35, GWAVA transcription start site (TSS) score threshold ≥0.45, and region GWAVA score threshold ≥0.55 to mark potentially functional loci.…”
Section: Methodsmentioning
confidence: 99%