Major international projects are now underway aimed at creating a comprehensive catalog of all genes responsible for the initiation and progression of cancer. These studies involve sequencing of matched tumor–normal samples followed by mathematical analysis to identify those genes in which mutations occur more frequently than expected by random chance. Here, we describe a fundamental problem with cancer genome studies: as the sample size increases, the list of putatively significant genes produced by current analytical methods burgeons into the hundreds. The list includes many implausible genes (such as those encoding olfactory receptors and the muscle protein titin), suggesting extensive false positive findings that overshadow true driver events. Here, we show that this problem stems largely from mutational heterogeneity and provide a novel analytical methodology, MutSigCV, for resolving the problem. We apply MutSigCV to exome sequences from 3,083 tumor-normal pairs and discover extraordinary variation in (i) mutation frequency and spectrum within cancer types, which shed light on mutational processes and disease etiology, and (ii) mutation frequency across the genome, which is strongly correlated with DNA replication timing and also with transcriptional activity. By incorporating mutational heterogeneity into the analyses, MutSigCV is able to eliminate most of the apparent artefactual findings and allow true cancer genes to rise to attention.
The incidence of esophageal adenocarcinoma (EAC) has risen 600% over the last 30 years. With a five-year survival rate of 15%, identification of new therapeutic targets for EAC is greatly important. We analyze the mutation spectra from whole exome sequencing of 149 EAC tumors/normal pairs, 15 of which have also been subjected to whole genome sequencing. We identify a mutational signature defined by a high prevalence of A to C transversions at AA dinucleotides. Statistical analysis of exome data identified significantly mutated 26 genes. Of these genes, four (TP53, CDKN2A, SMAD4, and PIK3CA) have been previously implicated in EAC. The novel significantly mutated genes include chromatin modifying factors and candidate contributors: SPG20, TLR4, ELMO1, and DOCK2. Functional analyses of EAC-derived mutations in ELMO1 reveal increased cellular invasion. Therefore, we suggest a new hypothesis about the potential activation of the RAC1 pathway to be a contributor to EAC tumorigenesis.
A more detailed understanding of the somatic genetic events that drive gastrointestinal adenocarcinomas is necessary to improve diagnosis and therapy. Using data from high-density genomic profiling arrays, we conducted an analysis of somatic copy-number aberrations (SCNAs) in 486 gastrointestinal adenocarcinomas including 296 esophageal and gastric cancers. Focal amplifications were substantially more prevalent in gastric/esophageal adenocarcinomas than colorectal tumors. We identified 64 regions of significant recurrent amplification and deletion, some shared and others unique to the adenocarcinoma types examined. Amplified genes were noted in 37% of gastric/esophageal tumors, including in therapeutically targetable kinases such as ERBB2, FGFR1, FGFR2, EGFR, and MET, suggesting the potential utility of genomic amplifications as biomarkers to guide therapy of gastric and esophageal cancers where targeted therapeutics have been less developed compared to colorectal cancers. Amplified loci implicated genes with known involvement in carcinogenesis but also pointed to regions harboring potentially novel cancer genes, including a recurrent deletion found in 15% of esophageal tumors where the Runt transcription factor subunit RUNX1 was implicated, including by functional experiments in tissue culture. Together, our results defined genomic features that were common and distinct to various gut-derived adenocarcinomas, potentially informing novel opportunities for targeted therapeutic interventions.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.