Joint modeling of a number of phenotypes using multivariate methods has often been neglected in genome-wide association studies and if used, replication has not been sought. Modern omics technologies allow characterization of functional phenomena using a large number of related phenotype measures, which can benefit from such joint analysis. Here, we report a multivariate genome-wide association studies of 23 immunoglobulin G (IgG) N-glycosylation phenotypes. In the discovery cohort, our multi-phenotype method uncovers ten genome-wide significant loci, of which five are novel (IGH, ELL2, HLA-B-C, AZI1, FUT6-FUT3). We convincingly replicate all novel loci via multivariate tests. We show that IgG N-glycosylation loci are strongly enriched for genes expressed in the immune system, in particular antibody-producing cells and B lymphocytes. We empirically demonstrate the efficacy of multivariate methods to discover novel, reproducible pleiotropic effects.
Aging populations face diminishing quality of life due to increased disease and morbidity. These challenges call for longevity research to focus on understanding the pathways controlling healthspan. We use the data from the UK Biobank (UKB) cohort and observe that the risks of major chronic diseases increased exponentially and double every eight years, i.e., at a rate compatible with the Gompertz mortality law. Assuming that aging drives the acceleration in morbidity rates, we build a risk model to predict the age at the end of healthspan depending on age, gender, and genetic background. Using the sub-population of 300,447 British individuals as a discovery cohort, we identify 12 loci associated with healthspan at the whole-genome significance level. We find strong genetic correlations between healthspan and all-cause mortality, life-history, and lifestyle traits. We thereby conclude that the healthspan offers a promising new way to interrogate the genetics of human longevity.
Chronic musculoskeletal pain affects all aspects of human life. However, mechanisms of its genetic control remain poorly understood. Genetic studies of pain are complicated by the high complexity and heterogeneity of pain phenotypes. Here, we apply principal component analysis to reduce phenotype heterogeneity of chronic musculoskeletal pain at four locations: the back, neck/shoulder, hip, and knee. Using matrices of genetic covariances, we constructed four genetically independent phenotypes (GIPs) with the leading GIP (GIP1) explaining 78.4% of the genetic variance of the analyzed conditions, and GIP2-4 explain progressively less. We identified and replicated five GIP1-associated loci and one GIP2associated locus and prioritized the most likely causal genes. For GIP1, we showed enrichment with multiple nervous system-related terms and genetic correlations with anthropometric, sociodemographic, psychiatric/personality traits and osteoarthritis. We suggest that GIP1 represents a biopsychological component of chronic musculoskeletal pain, related to physiological and psychological aspects and reflecting pain perception and processing.
ZIP8 is a metal transporter with a role in manganese (Mn) homeostasis. A common genetic variant in ZIP8 (rs13107325; A391T) ranks in the top 10 of pleiotropic SNPs identified in GWAS; A391T has associations with an increased risk of schizophrenia, obesity, Crohn’s disease, and reduced blood Mn. Here, we used CRISPR/ Cas9 -mediated knockin (KI) to generate a mouse model of ZIP8 A391T (Zip8 393T-KI mice). Recapitulating the SNP association with blood Mn, blood Mn was reduced in Zip8 393T-KI mice. There was restricted abnormal tissue Mn homeostasis, with decreases in liver and kidney Mn and a reciprocal increase in biliary Mn, providing in vivo evidence of hypomorphic Zip8 function. Upon challenge in a chemically induced colitis model, male Zip8 393T-KI mice exhibited enhanced disease susceptibility. ZIP8 391-Thr associated with reduced triantennary plasma N-glycan species in a population-based cohort to define a genotype-specific glycophenotype hypothesized to be linked to Mn-dependent glycosyltransferase activity. This glycophenotype was maintained in a cohort of patients with Crohn’s disease. These data and the pleiotropic disease associations with ZIP8 391-Thr suggest underappreciated roles of Mn homeostasis in complex human disease.
Glycosylation is a common post-translational modification of proteins. Glycosylation is associated with a number of human diseases. Defining genetic factors altering glycosylation may provide a basis for novel approaches to diagnostic and pharmaceutical applications. Here we report a genome-wide association study of the human blood plasma N-glycome composition in up to 3811 people measured by Ultra Performance Liquid Chromatography (UPLC) technology. Starting with the 36 original traits measured by UPLC, we computed an additional 77 derived traits leading to a total of 113 glycan traits. We studied associations between these traits and genetic polymorphisms located on human autosomes. We discovered and replicated 12 loci. This allowed us to demonstrate an overlap in genetic control between total plasma protein and IgG glycosylation. The majority of revealed loci contained genes that encode enzymes directly involved in glycosylation (FUT3/FUT6, FUT8, B3GAT1, ST6GAL1, B4GALT1, ST3GAL4, MGAT3 and MGAT5) and a known regulator of plasma protein fucosylation (HNF1A). However, we also found loci that could possibly ref lect other more complex aspects of glycosylation process. Functional genomic annotation suggested the role of several genes including DERL3, CHCHD10, TMEM121, IGH and IKZF1. The hypotheses we generated may serve as a starting point for further functional studies in this research area.
45Glycosylation is a common post-translational modification of proteins. It is known, that glycans 46 are directly involved in the pathophysiology of every major disease. Defining genetic factors 47 altering glycosylation may provide a basis for novel approaches to diagnostic and pharmaceutical 48 applications. Here, we report a genome-wide association study of the human blood plasma N-49 glycome composition in up to 3811 people. We discovered and replicated twelve loci. This 50 allowed us to demonstrate a clear overlap in genetic control between total plasma and IgG 51 glycosylation. Majority of loci contained genes that encode enzymes directly involved in 52 glycosylation (FUT3/FUT6, FUT8, B3GAT1, ST6GAL1, B4GALT1, ST3GAL4, MGAT3, and 53 MGAT5). We, however, also found loci that are likely to reflect other, more complex, aspects of 54 plasma glycosylation process. Functional genomic annotation suggested the role of DERL3, which 55 potentially highlights the role of glycoprotein degradation pathway, and such transcription factor 56 as IKZF1. 57 58 59Glycosylation -addition of carbohydrates to a substrate -is a common cotranslational and 60 posttranslational modification of proteins. that affects the physical properties of proteins 61 (solubility, conformation, folding, stability, trafficking, etc.) [1-4] as well as their biological 62 functions -from protein-protein interactions, interaction of proteins with receptors, to cell-cell, 63 cell-matrix, and host-pathogen interactions [2,3,5,6]. It has been estimated that more than half of 64 all proteins are glycosylated [7][8][9]. Given the fact that glycans participate in many biological 65 processes, it is therefore not surprising that molecular defects in protein glycosylation pathways 66 are increasingly recognized as direct causes of diseases, such as rheumatoid arthritis, 67 cardiometabolic disorders, cancer, variety of autoimmune diseases, type 2 diabetes, inflammatory 68 bowel disease and others [10][11][12][13][14][15][16][17]. More specifically, a variety of N-glycan structures are now 69 considered as disease markers and represent diagnostic as well as therapeutic targets [5,12,[18][19][20][21][22][23][24][25]. 70Defining the genetic control of protein glycosylation expands our knowledge about the regulation 71 of this fundamental biological process, and it may also shed new light onto how alterations in 72 glycosylation can lead to the development of complex human diseases [11]. 73
Post-translational modifications diversify protein functions and dynamically coordinate their signalling networks, influencing most aspects of cell physiology. Nevertheless, their genetic regulation or influence on complex traits is not fully understood. Here, we compare the genetic regulation of the same PTM of two proteins – glycosylation of transferrin and immunoglobulin G (IgG). By performing genome-wide association analysis of transferrin glycosylation, we identify 10 significantly associated loci, 9 of which were not reported previously. Comparing these with IgG glycosylation-associated genes, we note protein-specific associations with genes encoding glycosylation enzymes (transferrin - MGAT5, ST3GAL4, B3GAT1; IgG - MGAT3, ST6GAL1), as well as shared associations (FUT6, FUT8). Colocalisation analyses of the latter suggest that different causal variants in the FUT genes regulate fucosylation of the two proteins. Glycosylation of these proteins is thus genetically regulated by both shared and protein-specific mechanisms.
Human protein glycosylation is a complex process, and its in vivo regulation is poorly understood. Changes in glycosylation patterns are associated with many human diseases and conditions. Understanding the biological determinants of protein glycome provides a basis for future diagnostic and therapeutic applications. Genome-wide association studies (GWAS) allow to study biology via a hypothesis-free search of loci and genetic variants associated with a trait of interest. Sixteen loci were identified by three previous GWAS of human plasma proteome N-glycosylation. However, the possibility that some of these loci are false positives needs to be eliminated by replication studies, which have been limited so far. Here, we use the largest set of samples so far (4802 individuals) to replicate the previously identified loci. For all but one locus, the expected replication power exceeded 95%. Of the 16 loci reported previously, 15 were replicated in our study. For the remaining locus (near the KREMEN1 gene), the replication power was low, and hence, replication results were inconclusive. The very high replication rate highlights the general robustness of the GWAS findings as well as the high standards adopted by the community that studies genetic regulation of protein glycosylation. The 15 replicated loci present a good target for further functional studies. Among these, eight loci contain genes encoding glycosyltransferases: MGAT5, B3GAT1, FUT8, FUT6, ST6GAL1, B4GALT1, ST3GAL4 and MGAT3. The remaining seven loci offer starting points for further functional follow-up investigation into molecules and mechanisms that regulate human protein N-glycosylation in vivo.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.