2022
DOI: 10.1093/bib/bbac022
|View full text |Cite
|
Sign up to set email alerts
|

Deep learning-based identification of genetic variants: application to Alzheimer’s disease classification

Abstract: Deep learning is a promising tool that uses nonlinear transformations to extract features from high-dimensional data. Deep learning is challenging in genome-wide association studies (GWAS) with high-dimensional genomic data. Here we propose a novel three-step approach (SWAT-CNN) for identification of genetic variants using deep learning to identify phenotype-related single nucleotide polymorphisms (SNPs) that can be applied to develop accurate disease classification models. In the first step, we divided the wh… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
32
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
2
2

Relationship

1
7

Authors

Journals

citations
Cited by 37 publications
(32 citation statements)
references
References 76 publications
0
32
0
Order By: Relevance
“…Based on our XGBoost algorithm model, the results are quite di erent when applied to di erent datasets. e important features correspond to the following AAL regions, i.e., superior parietal gyrus (59), parahippocampal gyrus (40), right cerebellum 7b (102), left cerebellum 6 (99), left postcentral gyrus (57), right inferior frontal gyrus, opercular part (12), right middle temporal gyrus (86), left middle temporal gyrus (85), left temporal pole: middle temporal gyrus (87), left cerebellum 10 (107), vermis 6 (112), left cerebellum 9 (105), right amygdala (42), and left supramarginal gyrus (63). We counted the top 20 features in ve SHAP algorithm graphs.…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…Based on our XGBoost algorithm model, the results are quite di erent when applied to di erent datasets. e important features correspond to the following AAL regions, i.e., superior parietal gyrus (59), parahippocampal gyrus (40), right cerebellum 7b (102), left cerebellum 6 (99), left postcentral gyrus (57), right inferior frontal gyrus, opercular part (12), right middle temporal gyrus (86), left middle temporal gyrus (85), left temporal pole: middle temporal gyrus (87), left cerebellum 10 (107), vermis 6 (112), left cerebellum 9 (105), right amygdala (42), and left supramarginal gyrus (63). We counted the top 20 features in ve SHAP algorithm graphs.…”
Section: Discussionmentioning
confidence: 99%
“…e resulting accuracy by the XGBoost algorithm is 0.76 ± 0.01, and the AUC is 0.86 ± 0.01. Jo et al [59] proposed a novel three-step approach (SWAT-CNN) for the identi cation of genetic variants using deep learning to identify phenotyperelated single nucleotide polymorphisms (SNPs) that can be applied to develop accurate disease classi cation models, and the AUC of this model is 0.82. A machine learning framework proposed in this paper for MCI detection achieved an accuracy of 65.14% when using the mPerAF dataset.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Classification of cases and controls for certain diseases is also performed using DNA methylation data. Examples of machine learning applications using epigenetic data include classification of coronary heart disease, neurodevelopmental syndromes, schizophrenia, Alzheimer's disease, psychiatric disorders and others [33][34][35][36][37][38][39].…”
Section: Introduction 1backgroundmentioning
confidence: 99%
“…Different approaches [2][3][4] like polygenic risk score (PRS) and wide-range of linear models have been proposed for risk prediction of complex diseases based on the genotype-phenotype associations for variants identified by GWAS. More recently, with increased data availability, nonlinear methods like deep-learning 5 have been considered for constructing prediction models 6 .…”
Section: Introductionmentioning
confidence: 99%