2014
DOI: 10.1002/gepi.21797
|View full text |Cite
|
Sign up to set email alerts
|

Estimating Genome‐Wide Significance for Whole‐Genome Sequencing Studies

Abstract: Although a standard genome-wide significance level has been accepted for the testing of association between common genetic variants and disease, the era of whole-genome sequencing (WGS) requires a new threshold. The allele frequency spectrum of sequence-identified variants is very different from common variants, and the identified rare genetic variation is usually jointly analyzed in a series of genomic windows or regions. In nearby or overlapping windows, these test statistics will be correlated, and the degr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

2
69
0

Year Published

2014
2014
2019
2019

Publication Types

Select...
7

Relationship

1
6

Authors

Journals

citations
Cited by 78 publications
(71 citation statements)
references
References 15 publications
2
69
0
Order By: Relevance
“…The study design differences between 1KG and GoNL may have also impacted the number of variants available for simulation and testing, though both projects had sufficient depth of coverage to detect the vast majority of variants with frequency >0.5% (Abecasis et al., , Auton et al., ; Francioli et al., ). Despite these drawbacks, our estimate for genome‐wide significance in Europeans is consistent with previously published work that examined the joint multiple testing burden for single‐variant and set‐based (i.e., genic burden testing) approaches ( P ∼ 5.85–8.43 × 10 −9 ; Xu et al., ), where single variants likely contributed far more to the overall testing burden than did genic set tests. Further, our null simulations likely yield more precise estimates of genome‐wide significance than methods that prune SNPs on the basis of their LD (Fadista et al., ), as it is difficult to know which threshold will accurately identify all possible independent tests.…”
Section: Discussionsupporting
confidence: 88%
See 1 more Smart Citation
“…The study design differences between 1KG and GoNL may have also impacted the number of variants available for simulation and testing, though both projects had sufficient depth of coverage to detect the vast majority of variants with frequency >0.5% (Abecasis et al., , Auton et al., ; Francioli et al., ). Despite these drawbacks, our estimate for genome‐wide significance in Europeans is consistent with previously published work that examined the joint multiple testing burden for single‐variant and set‐based (i.e., genic burden testing) approaches ( P ∼ 5.85–8.43 × 10 −9 ; Xu et al., ), where single variants likely contributed far more to the overall testing burden than did genic set tests. Further, our null simulations likely yield more precise estimates of genome‐wide significance than methods that prune SNPs on the basis of their LD (Fadista et al., ), as it is difficult to know which threshold will accurately identify all possible independent tests.…”
Section: Discussionsupporting
confidence: 88%
“…We also do not address genic burden testing approaches, designed to test a collection of rare variants in the same gene or window in association to disease (Lee et al., ). A genome‐wide significance estimate already exists for these tests ( P ∼ 2.68–7.32 ×10 −8 ; Xu et al., ). Though burden tests may improve power through variant aggregation, it remains difficult to know which variants exactly should be included in such a test.…”
Section: Discussionmentioning
confidence: 99%
“…By randomly generating case‐control phenotypes for the 1000 Genomes data, Kanai, Tanaka and Okada () also obtained genomewide significance thresholds that are more liberal than ours. On the other hand, by using the WGS data from chromosome 3 in the UK10K project and randomly assigning normally distributed phenotypes and then extrapolating from the length of chromosome 3 to the length of the whole genome, Xu et al () obtained genomewide significance thresholds that are slightly more stringent than ours.…”
Section: Discussionmentioning
confidence: 64%
“…We tested bi-allelic single-nucleotide variants (SNVs) with MAF ≥ 0.5% for association, declaring genome-wide statistical significance at P ≤ 1.2 × 10 −8 (accounting for all independent SNVs above this MAF threshold; Supplementary Methods) 18 . The sequence kernel association test (SKAT) was used to assess association of regions containing SNVs with MAF ≤ 5% and ≤ 1%.…”
mentioning
confidence: 99%