Crohn’s disease (CD) and ulcerative colitis (UC), the two common forms of inflammatory bowel disease (IBD), affect over 2.5 million people of European ancestry with rising prevalence in other populations1. Genome-wide association studies (GWAS) and subsequent meta-analyses of CD and UC2,3 as separate phenotypes implicated previously unsuspected mechanisms, such as autophagy4, in pathogenesis and showed that some IBD loci are shared with other inflammatory diseases5. Here we expand knowledge of relevant pathways by undertaking a meta-analysis of CD and UC genome-wide association scans, with validation of significant findings in more than 75,000 cases and controls. We identify 71 new associations, for a total of 163 IBD loci that meet genome-wide significance thresholds. Most loci contribute to both phenotypes, and both directional and balancing selection effects are evident. Many IBD loci are also implicated in other immune-mediated disorders, most notably with ankylosing spondylitis and psoriasis. We also observe striking overlap between susceptibility loci for IBD and mycobacterial infection. Gene co-expression network analysis emphasizes this relationship, with pathways shared between host responses to mycobacteria and those predisposing to IBD.
Crohn’s disease (CD), a form of inflammatory bowel disease, has a higher prevalence in Ashkenazi Jewish than in non-Jewish European populations. To define the role of non-synonymous mutations, we performed exome sequencing of Ashkenazi Jewish patients with CD, followed by array-based genotyping and association analysis in 2,066 CD cases and 3,633 healthy controls. We detected association signals in the LRRK2 gene that conferred CD risk (N2081D variant, P=9.5×10−10) or protection (N551K variant, tagging R1398H-associated haplotype, P=3.3×10−8). These variants affected CD age of onset, disease location, LRRK2 activity, and autophagy. Bayesian network analysis of CD patient intestinal tissue further implicated LRRK2 in CD pathogenesis. Analysis of the extended LRRK2 locus in 24,570 CD cases, patients with Parkinson’s disease (PD), and healthy controls revealed extensive pleiotropy, with similar genetic effects between CD and PD in both Ashkenazi Jewish and non-Jewish cohorts. The LRRK2 N2081D CD risk allele is located in the same kinase domain as G2019S, a mutation that is the major genetic cause of familial and sporadic PD. Like the G2019S mutation, the N2081D variant is associated with increased kinase activity, whereas neither N551K nor R1398H on the protective haplotype altered kinase activity. R1398H, but not N551K, increased GTPase activity, thereby deactivating LRRK2. The presence of shared LRRK2 alleles in CD and PD provides refined insight into disease mechanisms and may have major implications for the treatment of these two seemingly unrelated diseases.
The Ashkenazi Jewish (AJ) population is a genetic isolate close to European and Middle Eastern groups, with genetic diversity patterns conducive to disease mapping. Here we report high-depth sequencing of 128 complete genomes of AJ controls. Compared with European samples, our AJ panel has 47% more novel variants per genome and is eightfold more effective at filtering benign variants out of AJ clinical genomes. Our panel improves imputation accuracy for AJ SNP arrays by 28%, and covers at least one haplotype in E67% of any AJ genome with long, identical-bydescent segments. Reconstruction of recent AJ history from such segments confirms a recent bottleneck of merely E350 individuals. Modelling of ancient histories for AJ and European populations using their joint allele frequency spectrum determines AJ to be an even admixture of European and likely Middle Eastern origins. We date the split between the two ancestral populations to E12-25 Kyr, suggesting a predominantly Near Eastern source for the repopulation of Europe after the Last Glacial Maximum.
BACKGROUND & AIMS Genome-wide association studies (GWAS) have identified 200 inflammatory bowel disease (IBD) loci, but the genetic architecture of Crohn’s disease (CD) and ulcerative colitis (UC) remains incompletely defined. Here we aimed to identify novel associations between IBD and functional genetic variants using the Illumina ExomeChip. METHODS Genotyping was performed in 10,523 IBD cases and 5,726 non-IBD controls. 91,713 functional single nucleotide polymorphism (SNP) loci in coding regions were analyzed. A novel identified association was further replicated in two independent cohorts. We further examined the association of the identified SNP with microbiota from 338 mucosal lavage samples in the Mucosal Luminal Interface (MLI) cohort measured using 16S sequencing. RESULTS We identified an association between CD and a missense variant encoding alanine (Ala) or threonine (Thr) at position 391 in the zinc transporter solute carrier family 39, member 8 protein (SLC39A8 Ala391Thr, rs13107325) and replicated the association with CD in two replication cohorts (combined meta-analysis p=5.55×10−13). This variant has previously been associated with distinct phenotypes including obesity, lipid levels, blood pressure and schizophrenia. We subsequently determined that the CD-risk allele was associated with altered colonic mucosal microbiome composition in both healthy controls (p=0.009) and CD cases (p=0.0009). Moreover, microbes depleted in healthy carriers strongly overlap with those reduced in CD patients (p=9.24×10−16) and overweight individuals (p=6.73×10−16). CONCLUSIONS Our results suggest that an SLC39A8-dependent shift in the gut microbiome could explain its pleiotropic effects on multiple complex diseases including CD.
Background & Aims Defects in intestinal innate defense systems predispose patients to inflammatory bowel disease (IBD). Reactive oxygen species (ROS) generated by nicotinamide-adenine dinucleotide phosphate (NADPH) oxidases in the mucosal barrier maintain gut homeostasis and defend against pathogenic attack. We hypothesized that molecular genetic defects in intestinal NADPH oxidases might be present in children with IBD. Methods After targeted exome sequencing of epithelial NADPH oxidases NOX1 and DUOX2 on 209 children with very early onset inflammatory bowel disease (VEOIBD), the identified mutations were validated using Sanger Sequencing. A structural analysis of NOX1 and DUOX2 variants was performed by homology in silico modeling. The functional characterization included ROS generation in model cell lines and in in vivo transduced murine crypts, protein expression, intracellular localization, and cell-based infection studies with the enteric pathogens Campylobacter jejuni and enteropathogenic Escherichia coli. Results We identified missense mutations in NOX1 (c.988G>A, p.Pro330Ser; c.967G>A, p.Asp360Asn) and DUOX2 (c.4474G>A, p.Arg1211Cys; c.3631C>T, p.Arg1492Cys) in 5 of 209 VEOIBD patients. The NOX1 p.Asp360Asn variant was replicated in a male Ashkenazi Jewish ulcerative colitis cohort. All NOX1 and DUOX2 variants showed reduced ROS production compared with wild-type enzymes. Despite appropriate cellular localization and comparable pathogen-stimulated translocation of altered oxidases, cells harboring NOX1 or DUOX2 variants had defective host resistance to infection with C. jejuni. Conclusions This study identifies the first inactivating missense variants in NOX1 and DUOX2 associated with VEOIBD. Defective ROS production from intestinal epithelial cells constitutes a risk factor for developing VEOIBD.
Background & Aims Crohn’s disease (CD) has the highest prevalence in Ashkenazi Jewish populations. We sought to identify rare, CD-associated frameshift variants of high functional and statistical effects. Methods We performed exome-sequencing and array-based genotype analyses of 1477 Ashkenazi Jewish individuals with CD and 2614 Ashkenazi Jewish individuals without CD (controls). To validate our findings, we performed genotype analyses of an additional 1515 CD cases and 7052 controls for frameshift mutations in the colony stimulating factor 2 receptor beta common subunit gene (CSF2RB). Intestinal tissues and blood samples were collected from patients with CD; lamina propria leukocytes were isolated and expression of CSF2RB and GMCSF-responsive cells were defined by mass cytometry (CyTOF analysis). Variants of CSF2RB were transfected into HEK293 cells and expression and functions of gene products were compared. Results In the discovery cohort, we associated CD with a frameshift mutation in CSF2RB (P=8.52×10–4); the finding was validated in the replication cohort (combined P=3.42×10–6). Incubation of intestinal lamina propria leukocytes with GMCSF resulted in high levels of phosphorylation of STAT5 and lesser increases in phosphorylation of ERK and AKT. Cells co-transfected with full-length and mutant forms of CSF2RB had reduced pSTAT5 following stimulation with GMCSF, compared to cells transfected with control CSF2RB, indicating a dominant negative effect of the mutant gene. Monocytes from patients with CD who were heterozygous for the frameshift mutation (6% of CD cases analyzed) had reduced responses to GMCSF and markedly decreased activity of aldehyde dehydrogenase; activity of this enzyme has been associated with immune tolerance. Conclusions In a genetic analysis of Ashkenazi Jewish individuals, we associated CD with a frameshift mutation in CSF2RB. Intestinal monocytes from carriers of this mutation had reduced responses to GMCSF, providing an additional mechanism for alterations to the innate immune response in individuals with CD.
Although many feature selection methods for classification have been developed, there is a need to identify genes in high-dimensional data with censored survival outcomes. Traditional methods for gene selection in classification problems have several drawbacks. First, the majority of the gene selection approaches for classification are single-gene based. Second, many of the gene selection procedures are not embedded within the algorithm itself. The technique of random forests has been found to perform well in high dimensional data settings with survival outcomes. It also has an embedded feature to identify variables of importance. Therefore, it is an ideal candidate for gene selection in high dimensional data with survival outcomes. In this paper, we develop a novel method based on the random forests to identify a set of prognostic genes. We compare our method with several machine learning methods and various node split criteria using several real data sets. Our method performed well in both simulations and real data analysis. Additionally, we have shown the advantages of our approach over single-gene based approaches. Our method incorporates multivariate correlations in microarray data for survival outcomes. The described method allows us to best utilize the information available from microarray data with survival outcomes.
The Ashkenazi Jewish population has a several-fold higher prevalence of Crohn’s disease compared to non-Jewish European ancestry populations and has a unique genetic history. Haplotype association is critical to Crohn’s disease etiology in this population, most notably at NOD2, in which three causal, uncommon, and conditionally independent NOD2 variants reside on a shared background haplotype. We present an analysis of extended haplotypes which showed significantly greater association to Crohn’s disease in the Ashkenazi Jewish population compared to a non-Jewish population (145 haplotypes and no haplotypes with P-value < 10−3, respectively). Two haplotype regions, one each on chromosomes 16 and 21, conferred increased disease risk within established Crohn’s disease loci. We performed exome sequencing of 55 Ashkenazi Jewish individuals and follow-up genotyping focused on variants in these two regions. We observed Ashkenazi Jewish-specific nominal association at R755C in TRPM2 on chromosome 21. Within the chromosome 16 region, R642S of HEATR3 and rs9922362 of BRD7 showed genome-wide significance. Expression studies of HEATR3 demonstrated a positive role in NOD2-mediated NF-κB signaling. The BRD7 signal showed conditional dependence with only the downstream rare Crohn’s disease-causal variants in NOD2, but not with the background haplotype; this elaborates NOD2 as a key illustration of synthetic association.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.