To further dissect the genetic architecture of colorectal cancer (CRC), we performed whole-genome sequencing of 1,439 cases and 720 controls, imputed discovered sequence variants and Haplotype Reference Consortium panel variants into genome-wide association study data, and tested for association in 34,869 cases and 29,051 controls. Findings were followed up in an additional 23,262 cases and 38,296 controls. We discovered a strongly protective 0.3% frequency variant signal at
CHD1
. In a combined meta-analysis of 125,478 individuals, we identified 40 new independent signals at
P
<5×10
−8
, bringing the number of known independent signals for CRC to approximately 100. New signals implicate lower-frequency variants, Krüppel-like factors, Hedgehog signaling, Hippo-YAP signaling, long noncoding RNAs, somatic drivers, and support a role of immune function. Heritability analyses suggest that CRC risk is highly polygenic, and larger, more comprehensive studies enabling rare variant analysis will improve understanding of underlying biology, and impact personalized screening strategies and drug development.
ObjectiveAn understanding of the etiologic heterogeneity of colorectal cancer (CRC) is critical for improving precision prevention, including individualized screening recommendations and the discovery of novel drug targets and repurposable drug candidates for chemoprevention. Known differences in molecular characteristics and environmental risk factors among tumors arising in different locations of the colorectum suggest partly distinct mechanisms of carcinogenesis. The extent to which the contribution of inherited genetic risk factors for CRC differs by anatomical subsite of the primary tumor has not been examined.DesignTo identify new anatomical subsite-specific risk loci, we performed genome-wide association study (GWAS) meta-analyses including data of 48 214 CRC cases and 64 159 controls of European ancestry. We characterised effect heterogeneity at CRC risk loci using multinomial modelling.ResultsWe identified 13 loci that reached genome-wide significance (p<5×10−8) and that were not reported by previous GWASs for overall CRC risk. Multiple lines of evidence support candidate genes at several of these loci. We detected substantial heterogeneity between anatomical subsites. Just over half (61) of 109 known and new risk variants showed no evidence for heterogeneity. In contrast, 22 variants showed association with distal CRC (including rectal cancer), but no evidence for association or an attenuated association with proximal CRC. For two loci, there was strong evidence for effects confined to proximal colon cancer.ConclusionGenetic architectures of proximal and distal CRC are partly distinct. Studies of risk factors and mechanisms of carcinogenesis, and precision prevention strategies should take into consideration the anatomical subsite of the tumour.
Colorectal cancer (CRC) shows aggregation in some families but no alterations in the known hereditary CRC genes. We aimed to identify new candidate genes which are potentially involved in germline predisposition to familial CRC. An integrated analysis of germline and tumor whole-exome sequencing data was performed in 18 unrelated CRC families. Deleterious single nucleotide variants (SNV), short insertions and deletions (indels), copy number variants (CNVs) and loss of heterozygosity (LOH) were assessed as candidates for first germline or second somatic hits. Candidate tumor suppressor genes were selected when alterations were detected in both germline and somatic DNA, fulfilling Knudson’s two-hit hypothesis. Somatic mutational profiling and signature analysis were also performed. A series of germline-somatic variant pairs were detected. In all cases, the first hit was presented as a rare SNV/indel, whereas the second hit was either a different SNV (3 genes) or LOH affecting the same gene (141 genes). BRCA2, BLM, ERCC2, RECQL, REV3L and RIF1 were among the most promising candidate genes for germline CRC predisposition. The identification of new candidate genes involved in familial CRC could be achieved by our integrated analysis. Further functional studies and replication in additional cohorts are required to confirm the selected candidates.
The serrated polyposis syndrome (SPS) is the most common and yet underdiagnosed colorectal polyposis syndrome. It is characterized by multiple and/or large colonic serrated polyps and a higher associated risk for colorectal cancer (CRC). The main objective of this study was to identify new candidate genes involved in the germline predisposition to SPS/CRC. Thirty-nine SPS patients from 16 families (≥2 patients per family) were recruited without alterations in well-known hereditary CRC genes, and germline and somatic whole-exome sequencing were performed. Germline rare variants with plausible pathogenicity, located in genes involved in cancer development, senescence and epigenetic regulation were selected. Somatic mutational profiling and signature analysis was pursued in one sample per family, when possible. After data filtering, ANXA10, ASXL1, CFTR, DOT1L, HIC1, INO80, KLF3, MCM3AP, MCM8, PDLIM2, POLD1, TP53BP1, WNK2 and WRN were highlighted as the more promising candidate genes for SPS germline predisposition with potentially pathogenic variants shared within families. Somatic analysis characterized mutational profiles in advanced serrated polyps/tumors, revealing a high proportion of hypermutated samples, with a prevalence of clock-like mutational signatures in most samples and the presence of DNA mismatch repair-defective signatures in some cases. In conclusion, we identified new candidate genes to be involved in familial SPS. Further functional studies and replication in additional cohorts are required to confirm the selected candidates.
Colorectal cancer (CRC) is a complex disorder for which the majority of the underlying germline predisposition factors remain still unidentified. Here, we combined whole‐exome sequencing (WES) and linkage analysis in families with multiple relatives affected by CRC to identify candidate genes harboring rare variants with potential high‐penetrance effects. Forty‐seven affected subjects from 18 extended CRC families underwent WES. Genome‐wide linkage analysis was performed under linear and exponential models. Suggestive linkage peaks were identified on chromosomes 1q22–q24.2 (maxSNP = rs2134095; LODlinear = 2.38, LODexp = 2.196), 7q31.2–q34 (maxSNP = rs6953296; LODlinear = 2.197, LODexp = 2.149) and 10q21.2–q23.1 (maxSNP = rs1904589; LODlinear = 1.445, LODexp = 2.195). These linkage signals were replicated in 10 independent sets of random markers from each of these regions. To assess the contribution of rare variants predicted to be pathogenic, we performed a family‐based segregation test with 89 rare variants predicted to be deleterious from 78 genes under the linkage intervals. This analysis showed significant segregation of rare variants with CRC in 18 genes (weighted p‐value > 0.0028). Protein network analysis and functional evaluation were used to suggest a plausible candidate gene for germline CRC predisposition. Etiologic rare variants implicated in cancer germline predisposition may be identified by combining traditional linkage with WES data. This approach can be used with already available NGS data from families with several sequenced members to further identify candidate genes involved germline predisposition to disease. This approach resulted in one candidate gene associated with increased risk of CRC but needs evidence from further studies.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.