Background Genome-wide association studies (GWASs) in Parkinson's disease (PD) have increased the scope of biological knowledge about the disease over the past decade. We sought to use the largest aggregate of GWAS data to identify novel risk loci and gain further insight into disease etiology. Methods We performed the largest meta-GWAS of PD to date, involving the analysis of 7.8M SNPs in 37.7K cases, 18.6K UK Biobank proxy-cases (having a first degree relative with PD), and 1.4M controls. We carried out a meta-analysis of this GWAS data to nominate novel loci. We then evaluated heritable risk estimates and predictive models using this data. We also utilized large gene expression and methylation resources to examine possible functional consequences as well as tissue, cell type and biological pathway enrichments for the identified risk factors. Additionally we examined shared genetic risk between PD and other phenotypes of interest via genetic correlations followed by Mendelian randomization. Findings We identified 90 independent genome-wide significant risk signals across 78 genomic regions, including 38 novel independent risk signals in 37 loci. These 90 variants explained 16-36% of the heritable risk of PD depending on prevalence. Integrating methylation and expression data within a Mendelian randomization framework identified putatively associated genes at 70 risk signals underlying GWAS loci for follow-up functional studies. Tissue-specific expression enrichment analyses suggested PD loci were heavily brain-enriched, with specific neuronal cell types being implicated from single cell data. We found significant genetic correlations with brain volumes, smoking status, and educational attainment. Mendelian randomization between cognitive performance and PD risk showed a robust association. Interpretation These data provide the most comprehensive understanding of the genetic architecture of PD to date by revealing many additional PD risk loci, providing a biological context for these risk factors, and demonstrating that a considerable genetic component of this disease remains unidentified. Funding See supplemental materials (Text S2). lead to earlier detection and refined diagnostics, which may help improve clinical trials (4). The generation of copious amounts of public summary statistics created by this effort relating to both the GWAS and subsequent analyses of gene expression and methylation patterns may be of use to investigators planning follow-up functional studies in stem cells or other cellular screens, allowing them to prioritize targets more efficiently using our data as additional evidence. We hope our findings may have some downstream clinical impact in the future such as improved patient stratification for clinical trials and genetically informed drug targets.
We conducted a meta analysis of Parkinson’s disease genome-wide association studies using a common set of 7,893,274 variants across 13,708 cases and 95,282 controls. Twenty-six loci were identified as genome-wide significant; these and six additional previously reported loci were then tested in an independent set of 5,353 cases and 5,551 controls. Of the 32 tested SNPs, 24 replicated, including 6 novel loci. Conditional analyses within loci show four loci including GBA, GAK/DGKQ, SNCA, and HLA contain a secondary independent risk variant. In total we identified and replicated 28 independent risk variants for Parkinson disease across 24 loci. While the effect of each individual locus is small, a risk profile analysis revealed a substantial cummulative risk in a comparison highest versus lowest quintiles of genetic risk (OR=3.31, 95% CI: 2.55, 4.30; p-value = 2×10−16). We also show 6 risk loci associated with proximal gene expression or DNA methylation.
We performed the largest genome-wide association study of PD to date, involving the analysis of 7.8M SNPs in 37.7K cases, 18.6K UK Biobank proxy-cases, and 1.4M controls. We identified 90 independent genome-wide significant signals across 78 loci, including 38 independent risk signals in 37 novel loci. These variants explained 26-36% of the heritable risk of PD. Tests of causality within a Mendelian randomization framework identified putatively causal genes for 70 risk signals. Tissue expression enrichment analysis suggested that signatures of PD loci were heavily brain-enriched, consistent with specific neuronal cell types being implicated from single cell expression data. We found significant genetic correlations with brain volumes, smoking status, and educational attainment. In sum, these data provide the most comprehensive understanding of the genetic architecture of PD to date by revealing many additional PD risk loci, providing a biological context for these risk factors, and demonstrating that a considerable genetic component of this disease remains unidentified.
Background Accurate diagnosis and early detection of complex disease has the potential to be of enormous benefit to clinical trialists, patients, and researchers alike. We sought to create a non-invasive, low-cost, and accurate classification model for diagnosing Parkinson’s disease risk to serve as a basis for future disease prediction studies in prospective longitudinal cohorts. Methods We developed a simple disease classifying model within 367 patients with Parkinson’s disease and phenotypically typical imaging data and 165 controls without neurological disease of the Parkinson’s Progression Marker Initiative (PPMI) study. Olfactory function, genetic risk, family history of PD, age and gender were algorithmically selected as significant contributors to our classifying model. This model was developed using the PPMI study then tested in 825 patients with Parkinson’s disease and 261 controls from five independent studies with varying recruitment strategies and designs including the Parkinson’s Disease Biomarkers Program (PDBP), Parkinson’s Associated Risk Study (PARS), 23andMe, Longitudinal and Biomarker Study in PD (LABS-PD), and Morris K. Udall Parkinson’s Disease Research Center of Excellence (Penn-Udall). Findings Our initial model correctly distinguished patients with Parkinson’s disease from controls at an area under the curve (AUC) of 0.923 (95% CI = 0.900 – 0.946) with high sensitivity (0.834, 95% CI = 0.711 – 0.883) and specificity (0.903, 95% CI = 0.824 – 0.946) in PPMI at its optimal AUC threshold (0.655). The model is also well-calibrated with all Hosmer-Lemeshow simulations suggesting that when parsed into random subgroups, the actual data mirrors that of the larger expected data, demonstrating that our model is robust and fits well. Likewise external validation shows excellent classification of PD with AUCs of 0.894 in PDBP, 0.998 in PARS, 0.955 in 23andMe, 0.929 in LABS-PD, and 0.939 in Penn-Udall. Additionally, when our model classifies SWEDD as PD, they convert within one year to typical PD more than would be expected by chance, with 4 out of 17 classified as PD converting to PD during brief follow-up; while SWEDD not classified as PD showed one conversion to PD out of 38 participants (test of proportions, p-value = 0.003). Interpretation This model may serve as a basis for future investigations into the classification, prediction and treatment of Parkinson’s disease, particularly those planning on attempting to identify prodromal or preclinical etiologically typical PD cases in prospective cohorts for efficient interventional and biomarker studies. Funding Please see the acknowledgements and funding section at the end of the manuscript.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.