Joint analysis of multiple traits can result in the identification of associations not found through the analysis of each trait in isolation. Studies of neuropsychiatric disorders and congenital heart disease (CHD) which use de novo mutations (DNMs) from parent-offspring trios have reported multiple putatively causal genes. However, a joint analysis method designed to integrate DNMs from multiple studies has yet to be implemented. We here introduce multiple-trait TADA (mTADA) which jointly analyzes two traits using DNMs from non-overlapping family samples. We first demonstrate that mTADA is able to leverage genetic overlaps to increase the statistical power of risk-gene identification. We then apply mTADA to large datasets of >13,000 trios for five neuropsychiatric disorders and CHD. We report additional risk genes for schizophrenia, epileptic encephalopathies and CHD. We outline some shared and specific biological information of intellectual disability and CHD by conducting systems biology analyses of genes prioritized by mTADA.
for the Cooperative Studies Program (CSP) #572 and Million Veteran Program (MVP)IMPORTANCE Serious mental illnesses, including schizophrenia, bipolar disorder, and depression, are heritable, highly multifactorial disorders and major causes of disability worldwide.OBJECTIVE To benchmark the penetrance of current neuropsychiatric polygenic risk scores (PRSs) in the Veterans Health Administration health care system and to explore associations between PRS and broad categories of human disease via phenome-wide association studies.DESIGN, SETTING, AND PARTICIPANTS Extensive Veterans Health Administration's electronic health records were assessed from October 1999 to January 2021, and an embedded cohort of 9378 individuals with confirmed diagnoses of schizophrenia or bipolar 1 disorder were found. The performance of schizophrenia, bipolar disorder, and major depression PRSs were compared in participants of African or European ancestry in the Million Veteran Program (approximately 400 000 individuals), and associations between PRSs and 1650 disease categories based on ICD-9/10 billing codes were explored. Last, genomic structural equation modeling was applied to derive novel PRSs indexing common and disorder-specific genetic factors. Analysis took place from January 2021 to January 2022. MAIN OUTCOMES AND MEASURESDiagnoses based on in-person structured clinical interviews were compared with ICD-9/10 billing codes. PRSs were constructed using summary statistics from genome-wide association studies of schizophrenia, bipolar disorder, and major depression. RESULTSOf 707 299 enrolled study participants, 459 667 were genotyped at the time of writing; 84 806 were of broadly African ancestry (mean [SD] age, 58 [12.1] years) and 314 909 were of broadly European ancestry (mean [SD] age, 66.4 [13.5] years). Among 9378 individuals with confirmed diagnoses of schizophrenia or bipolar 1 disorder, 8962 (95.6%) were correctly identified using ICD-9/10 codes (2 or more). Among those of European ancestry, PRSs were robustly associated with having received a diagnosis of schizophrenia (odds ratio [OR], 1.81 [95% CI, 1.76-1.87]; P < 10 −257 ) or bipolar disorder (OR, 1.42 [95% CI,; P < 10 −295 ). Corresponding effect sizes in participants of African ancestry were considerably smaller for schizophrenia (OR, 1.35 [95% CI,; P < 10 −38 ) and bipolar disorder (OR, 1.16 [95% CI,; P < 10 −10 ). Neuropsychiatric PRSs were associated with increased risk for a range of psychiatric and physical health problems. CONCLUSIONS AND RELEVANCEUsing diagnoses confirmed by in-person structured clinical interviews and current neuropsychiatric PRSs, the validity of an electronic health records-based phenotyping approach in US veterans was demonstrated, highlighting the potential of PRSs for disentangling biological and mediated pleiotropy.
Motivation: Rare variant-based analyses are beginning to identify risk genes for neuropsychiatric disorders and other diseases. However, the identified genes only account for a fraction of predicted causal genes. Recent studies have shown that rare damaging variants are significantly enriched in specific gene-sets. Methods which are able to jointly model rare variants and gene-sets to identify enriched gene-sets and use these enriched gene-sets to prioritize additional risk genes could improve understanding of the genetic architecture of diseases. Results: We propose DECO (Integrated analysis of de novo mutations, rare case/control variants and omics information via gene-sets), an integrated method for rare-variant and gene-set analysis. The method can (i) test the enrichment of gene-sets directly within the statistical model, and (ii) use enriched gene-sets to rank existing genes and prioritize additional risk genes for tested disorders. In simulations, DECO performs better than a homologous method that uses only variant data. To demonstrate the application of the proposed protocol, we have applied this approach to rare-variant datasets of schizophrenia. Compared with a method which only uses variant information, DECO is able to prioritize additional risk genes. Availability: DECO can be used to analyze rare-variants and biological pathways or cell types for any disease. The package is available on Github https://github.com/hoangtn/DECO.
Schizophrenia (SCZ) is a chronic mental illness and among the most debilitating conditions encountered in medical practice. A recent landmark SCZ study of the protein-coding regions of the genome identified a causal role for ten genes and a concentration of rare variant signals in evolutionarily constrained genes1. This recent study—and most other large-scale human genetics studies—was mainly composed of individuals of European (EUR) ancestry, and the generalizability of the findings in non-EUR populations remains unclear. To address this gap, we designed a custom sequencing panel of 161 genes selected based on the current knowledge of SCZ genetics and sequenced a new cohort of 11,580 SCZ cases and 10,555 controls of diverse ancestries. Replicating earlier work, we found that cases carried a significantly higher burden of rare protein-truncating variants (PTVs) among evolutionarily constrained genes (odds ratio = 1.48; P = 5.4 × 10−6). In meta-analyses with existing datasets totaling up to 35,828 cases and 107,877 controls, this excess burden was largely consistent across five ancestral populations. Two genes (SRRM2 and AKAP11) were newly implicated as SCZ risk genes, and one gene (PCLO) was identified as shared by individuals with SCZ and those with autism. Overall, our results lend robust support to the rare allelic spectrum of the genetic architecture of SCZ being conserved across diverse human populations.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.