Summary paragraphThe Trans-Omics for Precision Medicine (TOPMed) program seeks to elucidate the genetic architecture and disease biology of heart, lung, blood, and sleep disorders, with the ultimate goal of improving diagnosis, treatment, and prevention. The initial phases of the program focus on whole genome sequencing of individuals with rich phenotypic data and diverse backgrounds. Here, we describe TOPMed goals and design as well as resources and early insights from the sequence data. The resources include a variant browser, a genotype imputation panel, and sharing of genomic and phenotypic data via dbGaP. In 53,581 TOPMed samples, >400 million single-nucleotide and insertion/deletion variants were detected by alignment with the reference genome. Additional novel variants are detectable through assembly of unmapped reads and customized analysis in highly variable loci. Among the >400 million variants detected, 97% have frequency <1% and 46% are singletons. These rare variants provide insights into mutational processes and recent human evolutionary history. The nearly complete catalog of genetic variation in TOPMed studies provides unique opportunities for exploring the contributions of rare and non-coding sequence variants to phenotypic variation. Furthermore, combining TOPMed haplotypes with modern imputation methods improves the power and extends the reach of nearly all genome-wide association studies to include variants down to ~0.01% in frequency.
The Trans-Omics for Precision Medicine (TOPMed) programme seeks to elucidate the genetic architecture and biology of heart, lung, blood and sleep disorders, with the ultimate goal of improving diagnosis, treatment and prevention of these diseases. The initial phases of the programme focused on whole-genome sequencing of individuals with rich phenotypic data and diverse backgrounds. Here we describe the TOPMed goals and design as well as the available resources and early insights obtained from the sequence data. The resources include a variant browser, a genotype imputation server, and genomic and phenotypic data that are available through dbGaP (Database of Genotypes and Phenotypes)1. In the first 53,831 TOPMed samples, we detected more than 400 million single-nucleotide and insertion or deletion variants after alignment with the reference genome. Additional previously undescribed variants were detected through assembly of unmapped reads and customized analysis in highly variable loci. Among the more than 400 million detected variants, 97% have frequencies of less than 1% and 46% are singletons that are present in only one individual (53% among unrelated individuals). These rare variants provide insights into mutational processes and recent human evolutionary history. The extensive catalogue of genetic variation in TOPMed studies provides unique opportunities for exploring the contributions of rare and noncoding sequence variants to phenotypic variation. Furthermore, combining TOPMed haplotypes with modern imputation methods improves the power and reach of genome-wide association studies to include variants down to a frequency of approximately 0.01%.
By meta-analyzing the whole-exomes of 24,248 cases and 97,322 controls, we implicate ultra-rare coding variants (URVs) in ten genes as conferring substantial risk for schizophrenia (odds ratios 3 -50, P < 2.14 x 10 -6 ), and 32 genes at a FDR < 5%. These genes have the greatest expression in central nervous system neurons and have diverse molecular functions that include the formation, structure, and function of the synapse. The associations of NMDA receptor subunit GRIN2A and AMPA receptor subunit GRIA3 provide support for the dysfunction of the glutamatergic system as a mechanistic hypothesis in the pathogenesis of schizophrenia. We find significant evidence for an overlap of rare variant risk between schizophrenia, autism spectrum disorders (ASD), and severe neurodevelopmental disorders (DD/ID), supporting a neurodevelopmental etiology for schizophrenia. We show that proteintruncating variants in GRIN2A, TRIO, and CACNA1G confer risk for schizophrenia whereas specific missense mutations in these genes confer risk for DD/ID. Nevertheless, few of the strongly associated schizophrenia genes appear to confer risk for DD/ID. We demonstrate that genes prioritized from common variant analyses of schizophrenia are enriched in rare variant risk, suggesting that common and rare genetic risk factors at least partially converge on the same underlying pathogenic biological processes. Even after excluding significantly associated genes, schizophrenia cases still carry a substantial excess of URVs, implying that more schizophrenia risk genes await discovery using this approach.
Tobacco and alcohol use are heritable behaviours associated with 15% and 5.3% of worldwide deaths, respectively, due largely to broad increased risk for disease and injury1–4. These substances are used across the globe, yet genome-wide association studies have focused largely on individuals of European ancestries5. Here we leveraged global genetic diversity across 3.4 million individuals from four major clines of global ancestry (approximately 21% non-European) to power the discovery and fine-mapping of genomic loci associated with tobacco and alcohol use, to inform function of these loci via ancestry-aware transcriptome-wide association studies, and to evaluate the genetic architecture and predictive power of polygenic risk within and across populations. We found that increases in sample size and genetic diversity improved locus identification and fine-mapping resolution, and that a large majority of the 3,823 associated variants (from 2,143 loci) showed consistent effect sizes across ancestry dimensions. However, polygenic risk scores developed in one ancestry performed poorly in others, highlighting the continued need to increase sample sizes of diverse ancestries to realize any potential benefit of polygenic prediction.
Parkinson’s disease (PD), with its characteristic loss of nigrostriatal dopaminergic neurons and deposition of α-synuclein in neurons, is often considered a neuronal disorder. However, in recent years substantial evidence has emerged to implicate glial cell types, such as astrocytes and microglia. In this study, we used stratified LD score regression and expression-weighted cell-type enrichment together with several brain-related and cell-type-specific genomic annotations to connect human genomic PD findings to specific brain cell types. We found that PD heritability attributable to common variation does not enrich in global and regional brain annotations or brain-related cell-type-specific annotations. Likewise, we found no enrichment of PD susceptibility genes in brain-related cell types. In contrast, we demonstrated a significant enrichment of PD heritability in a curated lysosomal gene set highly expressed in astrocytic, microglial, and oligodendrocyte subtypes, and in LoF-intolerant genes, which were found highly expressed in almost all tested cellular subtypes. Our results suggest that PD risk loci do not lie in specific cell types or individual brain regions, but rather in global cellular processes detectable across several cell types.
Neuronal intranuclear inclusion disease (NIID) is a clinically heterogeneous neurodegenerative condition characterized by pathological intranuclear eosinophilic inclusions. A CGG repeat expansion in NOTCH2NLC was recently identified to be associated with NIID in patients of Japanese descent. We screened pathologically confirmed European NIID, cases of neurodegenerative disease with intranuclear inclusions and applied in silico‐based screening using whole‐genome sequencing data from 20 536 participants in the 100 000 Genomes Project. We identified a single European case harbouring the pathogenic repeat expansion with a distinct haplotype structure. Thus, we propose new diagnostic criteria as European NIID represents a distinct disease entity from East Asian cases.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.