2018
DOI: 10.1002/ajmg.b.32638
|View full text |Cite
|
Sign up to set email alerts
|

Machine learning in schizophrenia genomics, a case‐control study using 5,090 exomes

Abstract: Our hypothesis is that machine learning (ML) analysis of whole exome sequencing (WES) data can be used to identify individuals at high risk for schizophrenia (SCZ). This study applies ML to WES data from 2,545 individuals with SCZ and 2,545 unaffected individuals, accessed via the database of genotypes and phenotypes (dbGaP). Single nucleotide variants and small insertions and deletions were annotated by ANNOVAR using the reference genome hg19/GRCh37. Rare (predicted functional) variants with a minor allele fr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
24
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
6
2
1

Relationship

2
7

Authors

Journals

citations
Cited by 34 publications
(31 citation statements)
references
References 98 publications
0
24
0
Order By: Relevance
“…Furthermore, stratifying depression using genetic data remains a key goal within the psychiatric genetics community [47] and should lead to improved classification of mental health conditions and more efficacious treatment for patients. Machine learning [48, 49] and polygenic risk score [6, 50] approaches offer possible methods for stratification in mental health. In the current study, we used BUHMBOX [14] to identify whether traits that were genetically correlated with depression were correlated due to a subgroup, i.e.…”
Section: Discussionmentioning
confidence: 99%
“…Furthermore, stratifying depression using genetic data remains a key goal within the psychiatric genetics community [47] and should lead to improved classification of mental health conditions and more efficacious treatment for patients. Machine learning [48, 49] and polygenic risk score [6, 50] approaches offer possible methods for stratification in mental health. In the current study, we used BUHMBOX [14] to identify whether traits that were genetically correlated with depression were correlated due to a subgroup, i.e.…”
Section: Discussionmentioning
confidence: 99%
“…Many powerful ML algorithms render themselves uninterpretable, making it difficult to understand their decision-making process. Trying to balance interpretability with model performance, we used a more interpretable state of the art ML algorithm: regularized gradient boosted machine (GBM) (XGBoost implementation) [32], which we also demonstrated as an effective algorithm in our previous study [23].…”
Section: Algorithm Selectionmentioning
confidence: 99%
“…Our group recently applied supervised ML to rare, predicted functional variants from whole-exome sequencing (WES) data of a SCZ case-control dataset (n = 5090). 70% of the data was used to train the ML algorithm and 30% (n = 1526) to evaluate its performance, showing encouraging results (86% accuracy, AUC: 0.95) [23]. Studies based on supervised learning, like the one just mentioned, are focused on learning from input-to-output labeled data where a model is trained to learn the best function or map from input variables of data instances to their labels.…”
Section: Introductionmentioning
confidence: 99%
“…Machine learning is one of the moist important application of artificial intelligence which have ability to detect, analysis and produce results related to disease [14][15][16][17]. The beauty of machine learning process is deep learning.…”
Section: Machine Learningmentioning
confidence: 99%