2020
DOI: 10.1186/s12859-020-3368-2
|View full text |Cite
|
Sign up to set email alerts
|

GenEpi: gene-based epistasis discovery using machine learning

Abstract: Background: Genome-wide association studies (GWAS) provide a powerful means to identify associations between genetic variants and phenotypes. However, GWAS techniques for detecting epistasis, the interactions between genetic variants associated with phenotypes, are still limited. We believe that developing an efficient and effective GWAS method to detect epistasis will be a key for discovering sophisticated pathogenesis, which is especially important for complex diseases such as Alzheimer's disease (AD). Resul… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
27
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
7

Relationship

0
7

Authors

Journals

citations
Cited by 26 publications
(27 citation statements)
references
References 44 publications
0
27
0
Order By: Relevance
“… 41 Values across all studies were assessed to be below this threshold. The study with the highest EPV of 9.43 was Chang et al 42 The lowest EPV, 0.0018, was found for Wei et al 29 …”
Section: Resultsmentioning
confidence: 93%
“… 41 Values across all studies were assessed to be below this threshold. The study with the highest EPV of 9.43 was Chang et al 42 The lowest EPV, 0.0018, was found for Wei et al 29 …”
Section: Resultsmentioning
confidence: 93%
“…To identify joint effects of SNPs on PD risk, we used a recently developed computational package called GenEpi [ 3 ], which applies a gene-based machine learning approach to discover pair-wise epistasis associated with a phenotype. In GenEpi, the first step is to group genetic variants by a set of loci (i.e., genes) in the genome using gene information available in the UCSC human genome annotation database [ 14 ] followed by dimensionality reduction of genetic features in each locus using LD which involves grouping of features into LD blocks using a given r 2 and D ’ threshold and selection of the features with the largest MAF to represent each block.…”
Section: Methodsmentioning
confidence: 99%
“…The genetic resolution of those large genotyped datasets can be increased via imputation of unobserved common and rare variants with advanced genotype imputation software which use new dense whole genome sequence (WGS)-based haplotype reference panels [ 2 ]. In addition, new methodologies have been recently developed using machine learning approaches to efficiently discover joint genetic effects of variants contributing towards complex disease risk by tackling the challenges of traditional GWAS, i.e., low statistical power to detect conditional and interaction effects due to the high dimensionality and multiple-test burden [ 3 ].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Epistatic interactions have been proposed as a potential source of genetic variation [18,19]. This leads to the third misconception that higher-order gene-gene interactions, which could, in principle, be detected by machine learning algorithms [20,21], would explain a lot of genetic variation. However, data and theory show that it is almost certainly not true and that additive variance is expected to remain the main contributor of genetic variation, even in the presence of high-order interactions [8,13].…”
Section: Persistent Misconceptionsmentioning
confidence: 99%