The contribution of rare and low-frequency variants to human traits is largely unexplored. Here we describe insights from sequencing whole genomes (low read depth, 7×) or exomes (high read depth, 80×) of nearly 10,000 individuals from population-based and disease collections. In extensively phenotyped cohorts we characterize over 24 million novel sequence variants, generate a highly accurate imputation reference panel and identify novel alleles associated with levels of triglycerides (APOB), adiponectin (ADIPOQ) and low-density lipoprotein cholesterol (LDLR and RGAG1) from single-marker and rare variant aggregation tests. We describe population structure and functional annotation of rare and low-frequency variants, use the data to estimate the benefits of sequencing for association studies, and summarize lessons from disease-specific collections. Finally, we make available an extensive resource, including individual-level genetic and phenotypic data and web-based tools to facilitate the exploration of association results
Modifying the length of the Huntington's disease (HD) CAG repeat, the major determinant of age of disease onset, is an attractive therapeutic approach. To explore this we are investigating mechanisms of intergenerational and somatic HD CAG repeat instability. Here, we have crossed HD CAG knock-in mice onto backgrounds deficient in mismatch repair genes, Msh3 and Msh6, to discern the effects on CAG repeat size and disease pathogenesis. We find that different mechanisms predominate in inherited and somatic instability, with Msh6 protecting against intergenerational contractions and Msh3 required both for increasing CAG length and for enhancing an early disease phenotype in striatum. Therefore, attempts to decrease inherited repeat size may entail a full understanding of Msh6 complexes, while attempts to block the age-dependent increases in CAG size in striatal neurons and to slow the disease process will require a full elucidation of Msh3 complexes and their function in CAG repeat instability.
Genome-wide association studies (GWAS) have identified >250 loci for body mass index (BMI), implicating pathways related to neuronal biology. Most GWAS loci represent clusters of common, non-coding variants from which pinpointing causal genes remains challenging. Here, we combined data from 718,734 individuals to discover rare and low-frequency (MAF<5%) coding variants associated with BMI. We identified 14 coding variants in 13 genes, of which eight in genes (ZBTB7B, ACHE, RAPGEF3, RAB21, ZFHX3, ENTPD6, ZFR2, ZNF169) newly implicated in human obesity, two (MC4R, KSR2) previously observed in extreme obesity, and two variants in GIPR. Effect sizes of rare variants are ~10 times larger than of common variants, with the largest effect observed in carriers of an MC4R stop-codon (p.Tyr35Ter, MAF=0.01%), weighing ~7kg more than non-carriers. Pathway analyses confirmed enrichment of neuronal genes and provide new evidence for adipocyte and energy expenditure biology, widening the potential of genetically-supported therapeutic targets to treat obesity.
The age of onset of Huntington's disease (HD) is determined primarily by the length of the HD CAG repeat mutation, but is also influenced by other modifying factors. Delineating these modifiers is a critical step towards developing validated therapeutic targets in HD patients. The HD CAG repeat is somatically unstable, undergoing progressive length increases over time, particularly in brain regions that are the targets of neurodegeneration. Here, we have explored the hypothesis that somatic instability of the HD CAG repeat is itself a modifier of disease. Using small-pool PCR, we quantified somatic instability in the cortex region of the brain from a cohort of HD individuals exhibiting phenotypic extremes of young and old disease onset as predicted by the length of their constitutive HD CAG repeat lengths. After accounting for constitutive repeat length, somatic instability was found to be a significant predictor of onset age, with larger repeat length gains associated with earlier disease onset. These data are consistent with the hypothesis that somatic HD CAG repeat length expansions in target tissues contribute to the HD pathogenic process, and support pursuing factors that modify somatic instability as viable therapeutic targets.
SummaryKinase suppressor of Ras 2 (KSR2) is an intracellular scaffolding protein involved in multiple signaling pathways. Targeted deletion of Ksr2 leads to obesity in mice, suggesting a role in energy homeostasis. We explored the role of KSR2 in humans by sequencing 2,101 individuals with severe early-onset obesity and 1,536 controls. We identified multiple rare variants in KSR2 that disrupt signaling through the Raf-MEK-ERK pathway and impair cellular fatty acid oxidation and glucose oxidation in transfected cells; effects that can be ameliorated by the commonly prescribed antidiabetic drug, metformin. Mutation carriers exhibit hyperphagia in childhood, low heart rate, reduced basal metabolic rate and severe insulin resistance. These data establish KSR2 as an important regulator of energy intake, energy expenditure, and substrate utilization in humans. Modulation of KSR2-mediated effects may represent a novel therapeutic strategy for obesity and type 2 diabetes.PaperFlick
The variation in weight within a shared environment is largely attributable to genetic factors. Whilst many genes/loci confer susceptibility to obesity, little is known about the genetic architecture of healthy thinness. Here, we characterise the heritability of thinness which we found was comparable to that of severe obesity (h2 = 28.07 vs 32.33% respectively), although with incomplete genetic overlap (r = -0.49, 95% CI [-0.17, -0.82], p = 0.003). In a genome-wide association analysis of thinness (n = 1,471) vs severe obesity (n = 1,456), we identified 10 loci previously associated with obesity, and demonstrate enrichment for established BMI-associated loci (pbinomial = 3.05x10-5). Simulation analyses showed that different association results between the extremes were likely in agreement with additive effects across the BMI distribution, suggesting different effects on thinness and obesity could be due to their different degrees of extremeness. In further analyses, we detected a novel obesity and BMI-associated locus at PKHD1 (rs2784243, obese vs. thin p = 5.99x10-6, obese vs. controls p = 2.13x10-6 pBMI = 2.3x10-13), associations at loci recently discovered with much larger sample sizes (e.g. FAM150B and PRDM6-CEP120), and novel variants driving associations at previously established signals (e.g. rs205262 at the SNRPC/C6orf106 locus and rs112446794 at the PRDM6-CEP120 locus). Our ability to replicate loci found with much larger sample sizes demonstrates the value of clinical extremes and suggest that characterisation of the genetics of thinness may provide a more nuanced understanding of the genetic architecture of body weight regulation and may inform the identification of potential anti-obesity targets.
Background Even before the onset of age-related diseases, obesity might be a contributing factor to the cumulative burden of oxidative stress and chronic inflammation throughout the life course. Obesity may therefore contribute to accelerated shortening of telomeres. Consequently, obese persons are more likely to have shorter telomeres, but the association between body mass index (BMI) and leukocyte telomere length (TL) might differ across the life span and between ethnicities and sexes. Objective A collaborative cross-sectional meta-analysis of observational studies was conducted to investigate the associations between BMI and TL across the life span. Design Eighty-seven distinct study samples were included in the meta-analysis capturing data from 146,114 individuals. Study-specific age- and sex-adjusted regression coefficients were combined by using a random-effects model in which absolute [base pairs (bp)] and relative telomere to single-copy gene ratio (T/S ratio) TLs were regressed against BMI. Stratified analysis was performed by 3 age categories (“young”: 18–60 y; “middle”: 61–75 y; and “old”: >75 y), sex, and ethnicity. Results Each unit increase in BMI corresponded to a −3.99 bp (95% CI: −5.17, −2.81 bp) difference in TL in the total pooled sample; among young adults, each unit increase in BMI corresponded to a −7.67 bp (95% CI: −10.03, −5.31 bp) difference. Each unit increase in BMI corresponded to a −1.58 × 10−3 unit T/S ratio (0.16% decrease; 95% CI: −2.14 × 10−3, −1.01 × 10−3) difference in age- and sex-adjusted relative TL in the total pooled sample; among young adults, each unit increase in BMI corresponded to a −2.58 × 10−3 unit T/S ratio (0.26% decrease; 95% CI: −3.92 × 10−3, −1.25 × 10−3). The associations were predominantly for the white pooled population. No sex differences were observed. Conclusions A higher BMI is associated with shorter telomeres, especially in younger individuals. The presently observed difference is not negligible. Meta-analyses of longitudinal studies evaluating change in body weight alongside change in TL are warranted.
Deep sequence-based imputation can enhance the discovery power of genome-wide association studies by assessing previously unexplored variation across the common-and low-frequency spectra. We applied a hybrid whole-genome sequencing (WGS) and deep imputation approach to examine the broader allelic architecture of 12 anthropometric traits associated with height, body mass, and fat distribution in up to 267,616 individuals. We report 106 genome-wide significant signals that have not been previously identified, including 9 low-frequency variants pointing to functional candidates. Of the 106 signals, 6 are in genomic regions that have not been implicated with related traits before, 28 are independent signals at previously reported regions, and 72 represent previously reported signals for a different anthropometric trait. 71% of signals reside within genes and fine mapping resolves 23 signals to one or two likely causal variants. We confirm genetic overlap between human monogenic and polygenic anthropometric traits and find signal enrichment in cis expression QTLs in relevant tissues. Our results highlight the potential of WGS strategies to enhance biologically relevant discoveries across the frequency spectrum.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.