Genome-wide association studies (GWAS) and candidate gene studies in ulcerative colitis (UC) have identified 18 susceptibility loci. We conducted a meta-analysis of 6 UC GWAS, comprising 6,687 cases and 19,718 controls, and followed-up the top association signals in 9,628 cases and 12,917 controls. We identified 29 additional risk loci (P<5×10-8), increasing the number of UC associated loci to 47. After annotating associated regions using GRAIL, eQTL data and correlations with non-synonymous SNPs, we identified many candidate genes providing potentially important insights into disease pathogenesis, including IL1R2, IL8RA/B, IL7R, IL12B, DAP, PRDM1, JAK2, IRF5, GNA12 and LSP1. The total number of confirmed inflammatory bowel disease (IBD) risk loci is now 99, including a minimum of 28 shared association signals between Crohn’s disease (CD) and UC.
Ulcerative colitis is a chronic inflammatory disease of the colon that presents as diarrhea and gastrointestinal bleeding. We performed a genome-wide association study using DNA samples from 1,052 individuals with ulcerative colitis and pre-existing data from 2,571 controls, all of European ancestry. In an analysis that controlled for gender and population structure, ulcerative colitis loci attaining genome-wide significance and subsequent replication in two independent populations were identified on chromosomes 1p36 (rs6426833, combined P = 5.1×10−13, combined OR = 0.73) and 12q15 (rs1558744, combined P = 2.5×10−12, combined OR = 1.35). In addition, combined genome-wide significant evidence for association was found in a region spanning BTNL2 to HLA-DQB1 on chromosome 6p21 (rs2395185, combined P = 1.0×10−16, combined OR = 0.66) and at the IL23R locus on chromosome 1p31 (rs11209026, combined P = 1.3×10−8, combined OR = 0.56; rs10889677, combined P = 1.3×10−8, combined OR = 1.29).
Objectives Genetic susceptibility is known to play a large part in the predisposition to the inflammatory bowel diseases (IBD) known as Crohn’s disease (CD) and ulcerative colitis (UC). The IL2/IL21 locus on 4q27 is known to be a common risk locus for inflammatory disease (shown in celiac disease, type 1 diabetes, rheumatoid arthritis, systemic lupus erythematosus and psoriasis), while the roles that IL2 and IL21 play in the immune response also make them attractive candidates for inflammatory bowel disease. Our objective was to test for association between the IL2/IL21 locus and the inflammatory bowel diseases. Methods The four single nucleotide polymorphisms (SNPs) in the IL2/IL21 locus most associated to celiac disease were genotyped in 1590 IBD cases and 929 controls from the Netherlands, and then replicated in a North American cohort (2387 cases and 1266 controls) and an Italian cohort (805 cases and 421 controls), yielding a total of 4782 cases (3194 UC, 1588 CD) and 2616 controls. Allelic association testing and a pooled analysis using a Cochran-Mantel-Haenszel test were performed. Results All four SNPs were strongly associated with UC in all three cohorts and reached genome-wide significance in the pooled analysis (rs13151961 p= 1.35×10−10, rs13119723 p= 8.60×10−8, rs6840978 p= 3.07×10−8, rs6822844 p= 2.77×10−9). We also found a moderate association with CD in the pooled analysis (p value range 0.0016–9.86×10−5). Conclusions We found a strong association for the IL2/IL21 locus with UC, which also confirms it as a general susceptibility locus for inflammatory disease.
Crohn Disease (CD) is a complex genetic disorder for which more than 140 genes have been identified using genome wide association studies (GWAS). However, the genetic architecture of the trait remains largely unknown. The recent development of machine learning (ML) approaches incited us to apply them to classify healthy and diseased people according to their genomic information. The Immunochip dataset containing 18,227 CD patients and 34,050 healthy controls enrolled and genotyped by the international Inflammatory Bowel Disease genetic consortium (IIBDGC) has been re-analyzed using a set of ML methods: penalized logistic regression (LR), gradient boosted trees (GBT) and artificial neural networks (NN). The main score used to compare the methods was the Area Under the ROC Curve (AUC) statistics. The impact of quality control (QC), imputing and coding methods on LR results showed that QC methods and imputation of missing genotypes may artificially increase the scores. At the opposite, neither the patient/control ratio nor marker preselection or coding strategies significantly affected the results. LR methods, including Lasso, Ridge and ElasticNet provided similar results with a maximum AUC of 0.80. GBT methods like XGBoost, LightGBM and CatBoost, together with dense NN with one or more hidden layers, provided similar AUC values, suggesting limited epistatic effects in the genetic architecture of the trait. ML methods detected near all the genetic variants previously identified by GWAS among the best predictors plus additional predictors with lower effects. The robustness and complementarity of the different methods are also studied. Compared to LR, non-linear models such as GBT or NN may provide robust complementary approaches to identify and classify genetic markers.
The major histocompatibility complex (MHC) on chromosome 6p is an established risk locus for ulcerative colitis (UC) and Crohn’s disease (CD). We aimed to better define MHC association signals in UC and CD by combining data from dense single nucleotide polymorphism (SNP) genotyping and from imputation of classical HLA types, their constituent SNPs and corresponding amino acids in 562 UC, 611 CD, and 1,428 control subjects. Univariate and multivariate association analyses were performed, controlling for ancestry. In univariate analyses, absence of the rs9269955 C allele was strongly associated with risk for UC (P = 2.67×10−13). rs9269955 is a SNP in the codon for amino acid position 11 of HLA-DRβ1, located in the P6 pocket of the HLA-DR antigen binding cleft. This amino acid position was also the most significantly UC-associated amino acid in omnibus tests (P = 2.68×10−13). Multivariate modeling identified rs9269955-C and 13 other variants in best predicting UC versus control status. In contrast, there was only suggestive association evidence between the MHC and CD. Taken together, these data demonstrate that variation at HLA-DRβ1, amino acid 11 in the P6 pocket of the HLA-DR complex antigen binding cleft is a major determinant of chromosome 6p association with ulcerative colitis.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.