No abstract
SummaryThe role of germline genetics in shaping the tumor immune landscape is largely unknown. Using genotypes from >9,000 individuals in The Cancer Genome Atlas, we investigated the association of common and rare variants with 139 well-defined immune traits. Our analysis of common variants identified 10 immune traits with significant heritability estimates, and an additional 23 with suggestive heritability, including estimates of T-cell subset abundance and interferon signaling. We performed genome-wide association on the 33 heritable traits and identified 23 genome-wide significant loci associated with at least one immune trait, including SNPs in the IFIH1 locus previously associated with several autoimmune diseases. We also found significant associations between immune traits and pathogenic or likely-pathogenic rare variants in BRCA1 and in genes functionally linked to telomere stabilization, and Wnt/Beta-catenin signaling. We conclude that germline genetic variants significantly impact the composition and functional orientation of the tumor immune microenvironment.
SummaryAlternative splicing changes are frequently observed in cancer and are starting to be recognized as important signatures for tumor progression and therapy. However, their functional impact and relevance to tumorigenesis remains mostly unknown. We carried out a systematic analysis to characterize the potential functional consequences of alternative splicing changes in thousands of tumor samples. This analysis revealed that a subset of alternative splicing changes affect protein domain families that are frequently mutated in tumors and potentially disrupt proteinprotein interactions in cancer-related pathways. Moreover, there was a negative correlation between the number of these alternative splicing changes in a sample and the number of somatic mutations in drivers. We propose that a subset of the alternative splicing changes observed in tumors may represent independent oncogenic processes that could be relevant to explain the functional transformations in cancer and some of them could potentially be considered alternative splicing drivers (AS-drivers).
The protein structure field is experiencing a revolution. From the increased throughput of techniques to determine experimental structures, to developments such as cryo-EM that allow us to find the structures of large protein complexes or, more recently, the development of artificial intelligence tools, such as AlphaFold, that can predict with high accuracy the folding of proteins for which the availability of homology templates is limited. Here we quantify the effect of the recently released AlphaFold database of protein structural models in our knowledge on human proteins. Our results indicate that our current baseline for structural coverage of 47%, considering experimentally-derived or template-based homology models, elevates up to 75% when including AlphaFold predictions, reducing the fraction of dark proteome from 22% to just 7% and the number of proteins without structural information from 4.832 to just 29. Furthermore, although the coverage of disease-associated genes and mutations was near complete before AlphaFold release (70% of ClinVar pathogenic mutations and 74% of oncogenic mutations), AlphaFold models still provide an additional coverage of 2% to 14% of these critically important sets of biomedical genes and mutations. We also provide several examples of disease-associated proteins where AlphaFold provides critical new insights. Overall, our results show that the sequence-structure gap of human proteins has almost disappeared, an outstanding success of direct consequences for the knowledge on the human genome and the derived medical applications.
Despite their critical importance in maintaining the integrity of all cellular pathways, the specific role of mutations on protein-protein interaction (PPI) interfaces as cancer drivers, though known for some specific examples, has not been systematically studied. We analyzed missense somatic mutations in a pan-cancer cohort of 5,989 tumors from 23 projects of The Cancer Genome Atlas (TCGA) for enrichment on PPI interfaces using e-Driver, an algorithm to analyze the mutation pattern of specific protein regions such as PPI interfaces. We identified 128 PPI interfaces enriched in somatic cancer mutations. Our results support the notion that many mutations in well-established cancer driver genes, particularly those in critical network positions, act by altering PPI interfaces. Finally, focusing on individual interfaces we are also able to show how tumors driven by the same gene can have different behaviors, including patient outcomes, depending on whether specific interfaces are mutated or not.
The analysis of tumor genomes is a key step to understand many aspects of cancer biology, ranging from its aetiology to its treatment or the oncogenic processes driving cell transformation. The first crucial step in the analysis of any tumor genome is the identification of somatic genetic variants that cancer cells have acquired during their evolution. For that purpose, a wide range of somatic variant callers have been created in recent years. However, it is still unclear which variant caller, or combination of them, is best suited to analyze tumor sequencing data.Here we present a study to elucidate if different variant callers (MuSE, MuTect2, SomaticSniper, VarScan2) and strategies to combine them (Consensus and Union) lead to different downstream results in these three important aspects of cancer genomics: driver genes, mutational signatures and clinically actionable targets identification. To this end, we assess their performance in five different projects from The Cancer Genome Atlas (TCGA). Our results show that variant calling decisions have a significant impact on these downstream analyses, rendering important differences in driver genes prediction and mutational status among variant call sets, as well as in the identification of clinically actionable targets. More importantly, it seems that there is not a one-size-fits-all variant calling strategy, as the optimal decision seems to depend on both, the cancer type and the goal of the analysis.Contact: eduard.porta@bsc.es; alfonso.valencia@bsc.es Fig. 6. Patients with at least one clinically actionable target identified by Cancer Genome Interpreter (CGI) classified per clinical evidence level.
Motivation The analysis of cancer genomes provides fundamental information about its aetiology, the processes driving cell transformation or potential treatments. While researchers and clinicians are often only interested in the identification of oncogenic mutations, actionable variants or mutational signatures, the first crucial step in the analysis of any tumor genome is the identification of somatic variants in cancer cells (i.e., those that have been acquired during their evolution). For that purpose, a wide range of computational tools have been developed in recent years to detect somatic mutations in sequencing data from tumor samples. While there have been some efforts to benchmark somatic variant calling tools and strategies, the extent to which variant calling decisions impact the results of downstream analyses of tumor genomes remains unknown. Results Here we quantify the impact of variant calling decisions by comparing the results obtained in three important analyses of cancer genomics data (identification of cancer driver genes, quantification of mutational signatures and detection of clinically actionable variants) when changing the somatic variant caller (MuSE, MuTect2, SomaticSniper, VarScan2) or the strategy to combine them (Consensus of two, Consensus of three and Union) across all 33 cancer types from The Cancer Genome Atlas. Our results show that variant calling decisions have a significant impact on these analyses, creating important differences that could even impact treatment decisions for some patients. Moreover, the Consensus of three calling strategy to combine the output of multiple variant calling tools, a very widely used strategy by the research community, can lead to the loss of some cancer driver genes and actionable mutations. Overall, our results highlight the limitations of widespread practices within the cancer genomics community and point to important differences in critical analyses of tumor sequencing data depending on variant calling, affecting even the identification of clinically actionable variants. Availability Code is available at https://github.com/carlosgarciaprieto/VariantCallingClinicalBenchmark Supplementary information Supplementary data are available at Bioinformatics online.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.