SummaryBackgroundUnderstanding the genetic basis of airflow obstruction and smoking behaviour is key to determining the pathophysiology of chronic obstructive pulmonary disease (COPD). We used UK Biobank data to study the genetic causes of smoking behaviour and lung health.MethodsWe sampled individuals of European ancestry from UK Biobank, from the middle and extremes of the forced expiratory volume in 1 s (FEV1) distribution among heavy smokers (mean 35 pack-years) and never smokers. We developed a custom array for UK Biobank to provide optimum genome-wide coverage of common and low-frequency variants, dense coverage of genomic regions already implicated in lung health and disease, and to assay rare coding variants relevant to the UK population. We investigated whether there were shared genetic causes between different phenotypes defined by extremes of FEV1. We also looked for novel variants associated with extremes of FEV1 and smoking behaviour and assessed regions of the genome that had already shown evidence for a role in lung health and disease. We set genome-wide significance at p<5 × 10−8.FindingsUK Biobank participants were recruited from March 15, 2006, to July 7, 2010. Sample selection for the UK BiLEVE study started on Nov 22, 2012, and was completed on Dec 20, 2012. We selected 50 008 unique samples: 10 002 individuals with low FEV1, 10 000 with average FEV1, and 5002 with high FEV1 from each of the heavy smoker and never smoker groups. We noted a substantial sharing of genetic causes of low FEV1 between heavy smokers and never smokers (p=2·29 × 10−16) and between individuals with and without doctor-diagnosed asthma (p=6·06 × 10−11). We discovered six novel genome-wide significant signals of association with extremes of FEV1, including signals at four novel loci (KANSL1, TSEN54, TET2, and RBM19/TBX5) and independent signals at two previously reported loci (NPNT and HLA-DQB1/HLA-DQA2). These variants also showed association with COPD, including in individuals with no history of smoking. The number of copies of a 150 kb region containing the 5′ end of KANSL1, a gene that is important for epigenetic gene regulation, was associated with extremes of FEV1. We also discovered five new genome-wide significant signals for smoking behaviour, including a variant in NCAM1 (chromosome 11) and a variant on chromosome 2 (between TEX41 and PABPC1P2) that has a trans effect on expression of NCAM1 in brain tissue.InterpretationBy sampling from the extremes of the lung function distribution in UK Biobank, we identified novel genetic causes of lung function and smoking behaviour. These results provide new insight into the specific mechanisms underlying airflow obstruction, COPD, and tobacco addiction, and show substantial shared genetic architecture underlying airflow obstruction across individuals, irrespective of smoking behaviour and other airway disease.FundingMedical Research Council.
Genome-Wide Association Study (GWAS) meta-analyses have identified a strong association signal for lung function, which maps to a region on 4q24 containing two oppositely transcribed genes: glutathione S-transferase, C-terminal domain containing (GSTCD) and integrator complex subunit 12 (INTS12). Both genes were found to be expressed in a range of human airway cell types. The promoter regions and transcription start sites were determined in mRNA from human lung and a novel splice variant was identified for each gene. We obtained the following evidence for GSTCD and INTS12 co-regulation and expression: (i) correlated mRNA expression was observed both via Q-PCR and in a lung expression quantitative trait loci (eQTL) study, (ii) induction of both GSTCD and INTS12 mRNA expression in human airway smooth muscle cells was seen in response to TGFβ1, (iii) a lung eQTL study revealed that both GSTCD and INTS12 mRNA levels positively correlate with percent predicted FEV1, and (iv) FEV1 GWAS associated SNPs in 4q24 were found to act as an eQTL for INTS12 in a number of tissues. In fixed sections of human lung tissue, GSTCD protein expression was ubiquitous, whereas INTS12 expression was predominantly in epithelial cells and pneumocytes. During human fetal lung development, GSTCD protein expression was observed to be highest at the earlier pseudoglandular stage (10-12 weeks) compared with the later canalicular stage (17-19 weeks), whereas INTS12 expression levels did not alter throughout these stages. Knowledge of the transcriptional and translational regulation and expression of GSTCD and INTS12 provides important insights into the potential role of these genes in determining lung function. Future work is warranted to fully define the functions of INTS12 and GSTCD.
Genome wide association (GWA) studies have reproducibly identified signals on chromosome 4q24 associated with lung function and COPD. GSTCD (Glutathione S -transferase C-terminal domain containing) represents a candidate causal gene in this locus, however little is currently known about the function of this protein. We set out to further our understanding of the role of GSTCD in cell functions and homeostasis using multiple molecular and cellular approaches in airway relevant cells. Recombinant expression of human GSTCD in conjunction with a GST activity assay did not identify any enzymatic activity for two GSTCD isoforms questioning the assignment of this protein to this family of enzymes. Protein structure analyses identified a potential methyltransferase domain contained within GSTCD, with these enzymes linked to cell viability and apoptosis. Targeted knockdown (siRNA) of GSTCD in bronchial epithelial cells identified a role for GSTCD in cell viability as proliferation rates were not altered. To provide greater insight we completed transcriptomic analyses on cells with GSTCD expression knocked down and identified several differentially expressed genes including those implicated in airway biology; fibrosis e.g. TGFBR1 and inflammation e.g. IL6R . Pathway based transcriptomic analyses identified an over-representation of genes related to adipogenesis which may suggest additional functions for GSTCD. These findings identify potential additional functions for GSTCD in the context of airway biology beyond the hypothesised GST activity and warrant further investigation.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.