This is a pre-copyedited, author-produced version of an article accepted for publication in Nature Genetics following peer review. The version of record "Deciphering colorectal cancer genetics through multi-omic analysis of 100,204 cases and 154,587 controls of European and east Asian ancestries"
Alcohol is a consistently identified risk factor for colon cancer. However, the molecular mechanism underlying its effect on normal colon crypt cells remains poorly understood. We employed RNA-sequencing to asses transcriptomic response to ethanol exposure (0.2% vol:vol) in 3D organoid lines derived from healthy colon (n = 34). Paired regression analysis identified 2,162 differentially expressed genes in response to ethanol. When stratified by colon location, a far greater number of differentially expressed genes were identified in organoids derived from the left versus right colon, many of which corresponded to cell-type specific markers. To test the hypothesis that the effects of ethanol treatment on colon organoid populations were in part due to differential cell composition, we incorporated external single cell RNA-sequencing data from normal colon biopsies to estimate cellular proportions following single cell deconvolution. We inferred cell-type-specific changes, and observed an increase in transit amplifying cells following ethanol exposure that was greater in organoids from the left than right colon, with a concomitant decrease in more differentiated cells. If this occurs in the colon following alcohol consumption, this would lead to an increased zone of cells in the lower crypt where conditions are optimal for cell division and the potential to develop mutations.
Mechanisms underlying aspirin chemoprevention of colorectal cancer remain unclear. Prior studies have been limited because of the inability of preclinical models to recapitulate human normal colon epithelium or cellular heterogeneity present in mucosal biopsies. To overcome some of these obstacles, we performed in vitro aspirin treatment of colon organoids derived from normal mucosal biopsies to reveal transcriptional networks relevant to aspirin chemoprevention. Colon organoids derived from 38 healthy individuals undergoing endoscopy were treated with 50 μmol/L aspirin or vehicle control for 72 hours and subjected to bulk RNA sequencing. Paired regression analysis using DESeq2 identified differentially expressed genes (DEG) associated with aspirin treatment. Cellular composition was determined using CIBERSORTx. Aspirin treatment was associated with 1,154 significant (q < 0.10) DEGs prior to deconvolution. We provide replication of these findings in an independent population-based RNA-sequencing dataset of mucosal biopsies (BarcUVa-Seq), where a significant enrichment for overlap of DEGs was observed (P < 2.2E−16). Single-cell deconvolution revealed changes in cell composition, including a decrease in transit-amplifying cells following aspirin treatment (P = 0.01). Following deconvolution, DEGs included novel putative targets for aspirin such as TRABD2A (q = 0.055), a negative regulator of Wnt signaling. Weighted gene co-expression network analysis identified 12 significant modules, including two that contained hubs for EGFR and PTGES2, the latter being previously implicated in aspirin chemoprevention. In summary, aspirin treatment of patient-derived colon organoids using physiologically relevant doses resulted in transcriptome-wide changes that reveal altered cell composition and improved understanding of transcriptional pathways, providing novel insight into its chemopreventive properties. Prevention Relevance: Numerous studies have highlighted a role for aspirin in colorectal cancer chemoprevention, though the mechanisms driving this association remain unclear. We addressed this by showing that aspirin treatment of normal colon organoids diminished the transit-amplifying cell population, inhibited prostaglandin synthesis, and dysregulated expression of novel genes implicated in colon tumorigenesis.
Colon biopsies from superficial mucosa Gene expression and alternative splicing DNA genotyping eQTLs and sQTLs Blood samples Colon Transcriptome Explorer Enrichment at regulatory regions and trait and diseaseassociated loci Differential expression between colon subsites …ACTGCATGCTACC… SUMMARYWe profiled gene expression and alternative splicing of nonneoplastic colon from biopsy specimens from 445 healthy individuals. We showed that single-nucleotide polymorphisms associated with these profiles are enriched in disease-associated loci, including colorectal cancer and inflammatory bowel disease. BACKGROUND & AIMS:The association of genetic variation with tissue-specific gene expression and alternative splicing guides functional characterization of complex trait-associated loci and may suggest novel genes implicated in disease. Here, our aims were as follows: (1) to generate reference profiles of colon mucosa gene expression and alternative splicing and compare them across colon subsites (ascending, transverse, and descending), (2) to identify expression and splicing quantitative trait loci (QTLs), (3) to find traits for which identified QTLs contribute to single-nucleotide polymorphism (SNP)based heritability, (4) to propose candidate effector genes, and (5) to provide a web-based visualization resource. METHODS:We collected colonic mucosal biopsy specimens from 485 healthy adults and performed bulk RNA sequencing. We performed genome-wide SNP genotyping from blood leukocytes. Statistical approaches and bioinformatics software were used for QTL identification and downstream analyses. RESULTS:We provided a complete quantification of gene expression and alternative splicing across colon subsites and described their differences. We identified thousands of expression and splicing QTLs and defined their enrichment at genome-wide regulatory regions. We found that part of the SNP-based heritability of diseases affecting colon tissue, such as colorectal cancer and inflammatory bowel disease, but also of diseases affecting other tissues, such as psychiatric conditions, can be explained by the identified QTLs. We provided candidate
Tobacco smoke and red/processed meats are well-known risk factors for colorectal cancer (CRC). Most research has focused on studies of normal colon biopsies in epidemiologic studies or treatment of CRC cell lines in vitro . These studies are often constrained by challenges with accuracy of self-report data or, in the case of CRC cell lines, small sample sizes and lack of relationship to normal tissue at risk. In an attempt to address some of these limitations, we performed a 24-hour treatment of a representative carcinogens cocktail in 37 independent organoid lines derived from normal colon biopsies. Machine learning algorithms were applied to bulk RNA-sequencing and revealed cellular composition changes in colon organoids. We identified 738 differentially expressed genes in response to carcinogens exposure. Network analysis identified significantly different modules of co-expression, that included genes related to MSI-H tumor biology, and genes previously implicated in CRC through genome-wide association studies. Our study helps to better define the molecular effects of representative carcinogens from smoking and red/processed meat in normal colon epithelial cells and in the etiology of the MSI-H subtype of CRC, and suggests an overlap between molecular mechanisms involved in inherited and environmental CRC risk.
INTRODUCTION: Colorectal cancer is a common malignancy that can be cured when detected early, but recurrence among survivors is a persistent risk. A field effect of cancer in the colon has been reported and could have implications for surveillance, but studies to date have been limited. A joint analysis of pooled transcriptomic data from all available bulk RNA-sequencing data sets of healthy, histologically normal tumor-adjacent, and tumor tissues was performed to provide an unbiased assessment of field effect. METHODS: A novel bulk RNA-sequencing data set from biopsies of nondiseased colon from screening colonoscopy along with published data sets from the Genomic Data Commons and Sequence Read Archive were considered for inclusion. Analyses were limited to samples with a quantified read depth of at least 10 million reads. Transcript abundance was estimated with Salmon, and downstream analysis was performed in R. RESULTS: A total of 1,139 samples were analyzed in 3 cohorts. The primary cohort consisted of 834 independent samples from 8 independent data sets, including 462 healthy, 61 tumor-adjacent, and 311 tumor samples. Tumor-adjacent gene expression was found to represent an intermediate state between healthy and tumor expression. Among differentially expressed genes in tumor-adjacent samples, 1,143 were expressed in patterns similar to tumor samples, and these genes were enriched for cancer-associated pathways. DISCUSSION: Novel insights into the field effect in colorectal cancer were generated in this mega-analysis of the colorectal transcriptome. Oncogenic features that might help explain metachronous lesions in cancer survivors and could be used for surveillance and risk stratification were identified.
Genome-wide association studies (GWAS) have identified single-nucleotide polymorphisms (SNPs) associated with glioma risk on 20q13.33, but the biological mechanisms underlying this association are unknown. We tested the hypothesis that a functional SNP on 20q13.33 impacted the activity of an enhancer, leading to an altered expression of nearby genes. To identify candidate functional SNPs, we identified all SNPs in linkage disequilibrium with the risk-associated SNP rs2297440 that mapped to putative enhancers. Putative enhancers containing candidate functional SNPs were tested for allele-specific effects in luciferase enhancer activity assays against glioblastoma multiforme (GBM) cell lines. An enhancer containing SNP rs3761124 exhibited allele-specific effects on activity. Deletion of this enhancer by CRISPR-Cas9 editing in GBM cell lines correlated with an altered expression of multiple genes, including STMN3, RTEL1, RTEL1-TNFRSF6B, GMEB2, and SRMS. Expression quantitative trait loci (eQTL) analyses using nondiseased brain samples, isocitrate dehydrogenase 1 (IDH1) wild-type glioma, and neurodevelopmental tissues showed STMN3 to be a consistent significant eQTL with rs3761124. RTEL1 and GMEB2 were also significant eQTLs in the context of early CNS development and/or in IDH1 wild-type glioma. We provide evidence that rs3761124 is a functional variant on 20q13.33 related to glioma/GBM risk that modulates the expression of STMN3 and potentially other genes across diverse cellular contexts.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.