2019
DOI: 10.1016/j.ajhg.2019.10.008
|View full text |Cite
|
Sign up to set email alerts
|

A Fast and Accurate Method for Genome-wide Scale Phenome-wide G × E Analysis and Its Application to UK Biobank

Abstract: The etiology of most complex diseases involves genetic variants, environmental factors, and gene-environment interaction (G 3 E) effects. Compared with marginal genetic association studies, G 3 E analysis requires more samples and detailed measure of environmental exposures, and this limits the possible discoveries. Large-scale population-based biobanks with detailed phenotypic and environmental information, such as UK-Biobank, can be ideal resources for identifying G 3 E effects. However, due to the large com… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
23
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
2

Relationship

3
5

Authors

Journals

citations
Cited by 22 publications
(23 citation statements)
references
References 53 publications
0
23
0
Order By: Relevance
“…For example, statistical power for testing GEI with a binary environmental exposure depends on the minor allele counts in both exposed and unexposed groups. Although computationally efficient GEI tests for biobank‐scale studies have been developed recently in the context of single‐variant tests on unrelated individuals (Bi et al, 2019), critical methodological bottlenecks still exist to expand the sample size and scope of rare variant GEI analyses in large biobank‐scale sequencing studies. To increase power for rare variants, various set‐based methods have been developed to collapse variants in a particular gene or functional region to investigate how variants in a set affect a phenotype synergistically (Chen et al, 2019; Lee, Wu, & Lin, 2012; Pan, Kim, Zhang, Shen, & Wei, 2014; Sun, Zheng, & Hsu, 2013), and to demonstrate whether genetic associations with the phenotype are modified by environment factors in GEI studies (Chen, Meigs, & Dupuis, 2014; Lin et al, 2016; Su, Di, & Hsu, 2017).…”
Section: Introductionmentioning
confidence: 99%
“…For example, statistical power for testing GEI with a binary environmental exposure depends on the minor allele counts in both exposed and unexposed groups. Although computationally efficient GEI tests for biobank‐scale studies have been developed recently in the context of single‐variant tests on unrelated individuals (Bi et al, 2019), critical methodological bottlenecks still exist to expand the sample size and scope of rare variant GEI analyses in large biobank‐scale sequencing studies. To increase power for rare variants, various set‐based methods have been developed to collapse variants in a particular gene or functional region to investigate how variants in a set affect a phenotype synergistically (Chen et al, 2019; Lee, Wu, & Lin, 2012; Pan, Kim, Zhang, Shen, & Wei, 2014; Sun, Zheng, & Hsu, 2013), and to demonstrate whether genetic associations with the phenotype are modified by environment factors in GEI studies (Chen, Meigs, & Dupuis, 2014; Lin et al, 2016; Su, Di, & Hsu, 2017).…”
Section: Introductionmentioning
confidence: 99%
“…This strategy can also be applied to other cases. Recently, scalable methods were proposed for a genome-wide G × E analysis ( Bi et al, 2019 ; Wang et al, 2020 ). When testing G × E effect, the null model should include marginal genetic effect.…”
Section: Statistical and Computational Challenges In Biobank Data Analysis And Approaches To Addressing Themmentioning
confidence: 99%
“…However, the matrix projection approach might be inaccurate if the marginal genetic effect is large. To balance the computational efficiency and accuracy, SPAGE ( Bi et al, 2019 ) uses a hybrid strategy as follows. If the marginal genetic effect is small or moderate (e.g., p value > 5e-3), the matrix projection is used.…”
Section: Statistical and Computational Challenges In Biobank Data Analysis And Approaches To Addressing Themmentioning
confidence: 99%
“…8,10 Another resource-efficient approach, fastGWA, is to use sparse GRM to adjust for the sample relatedness. 11 For binary phenotype analysis, unbalanced case-control ratio can result in inflated type I error rates and saddlepoint approximation (SPA) has been demonstrated to be more accurate for single-variant analysis 7,8 , region-based analysis 12,13 , and gene-environment interaction analysis 14 . Similarly, the sample size distribution in ordinal categorical data could also be highly unbalanced, that is, the sample size in one category could be dozens of times more than the that in other categories.…”
Section: Mainmentioning
confidence: 99%