Summary Structural variants (SVs) are implicated in numerous diseases and make up the majority of varying nucleotides among human genomes. Here we describe an integrated set of eight SV classes comprising both balanced and unbalanced variants, which we constructed using short-read DNA sequencing data and statistically phased onto haplotype-blocks in 26 human populations. Analyzing this set, we identify numerous gene-intersecting SVs exhibiting population stratification and describe naturally occurring homozygous gene knockouts suggesting the dispensability of a variety of human genes. We demonstrate that SVs are enriched on haplotypes identified by genome-wide association studies and exhibit enrichment for expression quantitative trait loci. Additionally, we uncover appreciable levels of SV complexity at different scales, including genic loci subject to clusters of repeated rearrangement and complex SVs with multiple breakpoints likely formed through individual mutational events. Our catalog will enhance future studies into SV demography, functional impact and disease association.
Motivation: The discovery of genomic structural variants (SVs) at high sensitivity and specificity is an essential requirement for characterizing naturally occurring variation and for understanding pathological somatic rearrangements in personal genome sequencing data. Of particular interest are integrated methods that accurately identify simple and complex rearrangements in heterogeneous sequencing datasets at single-nucleotide resolution, as an optimal basis for investigating the formation mechanisms and functional consequences of SVs.Results: We have developed an SV discovery method, called DELLY, that integrates short insert paired-ends, long-range mate-pairs and split-read alignments to accurately delineate genomic rearrangements at single-nucleotide resolution. DELLY is suitable for detecting copy-number variable deletion and tandem duplication events as well as balanced rearrangements such as inversions or reciprocal translocations. DELLY, thus, enables to ascertain the full spectrum of genomic rearrangements, including complex events. On simulated data, DELLY compares favorably to other SV prediction methods across a wide range of sequencing parameters. On real data, DELLY reliably uncovers SVs from the 1000 Genomes Project and cancer genomes, and validation experiments of randomly selected deletion loci show a high specificity.Availability: DELLY is available at www.korbel.embl.de/software.htmlContact: tobias.rausch@embl.de
Summary Current therapies for medulloblastoma (MB), a highly malignant childhood brain tumor, impose debilitating effects on the developing child, warranting deployment of molecularly targeted treatments with reduced toxicities. Prior studies failed to disclose the full spectrum of driver genes and molecular processes operative in MB subgroups. Herein, we detail the somatic landscape across 491 sequenced MBs and molecular heterogeneity amongst 1,256 epigenetically analyzed cases, identifying subgroup-specific driver alterations including previously unappreciated actionable targets. Driver mutations explained the majority of Group 3 and Group 4 patients, remarkably enhancing previous knowledge. Novel molecular subtypes were differentially enriched for specific driver events, including hotspot in-frame insertions targeting KBTBD4 and ‘enhancer hijacking’ driving PRDM6 activation. Thus, application of integrative genomics to an unprecedented cohort of clinical samples derived from a single childhood cancer entity disclosed a series of new cancer genes and biologically relevant subtype diversity that represent attractive therapeutic targets for treating MB patients.
The Drosophila melanogaster Genetic Reference Panel (DGRP) is a community resource of 205 sequenced inbred lines, derived to improve our understanding of the effects of naturally occurring genetic variation on molecular and organismal phenotypes. We used an integrated genotyping strategy to identify 4,853,802 single nucleotide polymorphisms (SNPs) and 1,296,080 non-SNP variants. Our molecular population genomic analyses show higher deletion than insertion mutation rates and stronger purifying selection on deletions. Weaker selection on insertions than deletions is consistent with our observed distribution of genome size determined by flow cytometry, which is skewed toward larger genomes. Insertion/ deletion and single nucleotide polymorphisms are positively correlated with each other and with local recombination, suggesting that their nonrandom distributions are due to hitchhiking and background selection. Our cytogenetic analysis identified 16 polymorphic inversions in the DGRP. Common inverted and standard karyotypes are genetically divergent and account for most of the variation in relatedness among the DGRP lines. Intriguingly, variation in genome size and many quantitative traits are significantly associated with inversions. Approximately 50% of the DGRP lines are infected with Wolbachia, and four lines have germline insertions of Wolbachia sequences, but effects of Wolbachia infection on quantitative traits are rarely significant. The DGRP complements ongoing efforts to functionally annotate the Drosophila genome. Indeed, 15% of all D. melanogaster genes segregate for potentially damaged proteins in the DGRP, and genome-wide analyses of quantitative traits identify novel candidate genes. The DGRP lines, sequence data, genotypes, quality scores, phenotypes, and analysis and visualization tools are publicly available.[Supplemental material is available for this article.]Studies in Drosophila melanogaster have revealed basic principles and mechanisms underlying fundamental genetic concepts of linkage and recombination and were instrumental in identifying canonical and evolutionarily conserved cell signaling pathways.Most D. melanogaster genes are evolutionarily conserved, leading to fly models for understanding common human diseases and behavioral disorders, dipteran disease vectors, and insects impacting agriculture, medicine, and forensics. Despite nearly a century of research on D. melanogaster, however, a large fraction of its coding and noncoding sequence has no known function (McQuilton et al. 2012). Recent efforts to induce mutations in every protein coding gene utilize transposable elements (Bellen et al. 2004(Bellen et al. , 2011, which have a different spectrum of allelic effects than SNPs and small insertions and deletions (indels). Comprehensive efforts to identify regulatory DNA elements in Drosophila (The Ó 2014 Huang et al.
Summary Medulloblastoma, the most common malignant pediatric brain tumour, is currently treated with non-specific cytotoxic therapies including surgery, whole brain radiation, and aggressive chemotherapy. As medulloblastoma exhibits marked intertumoural heterogeneity, with at least four distinct molecular variants, prior attempts to identify targets for therapy have been underpowered due to small samples sizes. Here we report somatic copy number aberrations (SCNAs) in 1087 unique medulloblastomas. SCNAs are common in medulloblastoma, and are predominantly subgroup enriched. The most common region of focal copy number gain is a tandem duplication of the Parkinson’s disease gene SNCAIP, which is exquisitely restricted to Group 4α. Recurrent translocations of PVT1, including PVT1-MYC and PVT1-NDRG1 that arise through chromothripsis are restricted to Group 3. Numerous targetable SCNAs, including recurrent events targeting TGFβ signaling in Group 3, and NF-κB signaling in Group 4 suggest future avenues for rational, targeted therapy.
SUMMARY Genomic rearrangements are thought to occur progressively during tumor development. Recent findings, however, suggest an alternative mechanism, involving massive chromosome rearrangements in a one-step catastrophic event termed chromothripsis. We report the whole-genome sequencing-based analysis of a Sonic-Hedgehog medulloblastoma (SHH-MB) brain tumor from a patient with a germline TP53 mutation (Li-Fraumeni syndrome), uncovering massive, complex chromosome rearrangements. Integrating TP53 status with microarray and deep sequencing-based DNA rearrangement data in additional patients reveals a striking association between TP53 mutation and chromothripsis in SHH-MBs. Analysis of additional tumor entities substantiates a link between TP53 mutation and chromothripsis, and indicates a context-specific role for p53 in catastrophic DNA rearrangements. Among these, we observed a strong association between somatic TP53 mutations and chromothripsis in acute myeloid leukemia. These findings connect p53 status and chromothripsis in specific tumor types, providing a genetic basis for understanding particularly aggressive subtypes of cancer.
Summary Medulloblastoma is an aggressively-growing tumour, arising in the cerebellum or medulla/brain stem. It is the most common malignant brain tumour in children, and displays tremendous biological and clinical heterogeneity1. Despite recent treatment advances, approximately 40% of children experience tumour recurrence, and 30% will die from their disease. Those who survive often have a significantly reduced quality of life. Four tumour subgroups with distinct clinical, biological and genetic profiles are currently discriminated2,3. WNT tumours, displaying activated wingless pathway signalling, carry a favourable prognosis under current treatment regimens4. SHH tumours show hedgehog pathway activation, and have an intermediate prognosis2. Group 3 & 4 tumours are molecularly less well-characterised, and also present the greatest clinical challenges2,3,5. The full repertoire of genetic events driving this distinction, however, remains unclear. Here we describe an integrative deep-sequencing analysis of 125 tumour-normal pairs. Tetraploidy was identified as a frequent early event in Group 3 & 4 tumours, and a positive correlation between patient age and mutation rate was observed. Several recurrent mutations were identified, both in known medulloblastoma-related genes (CTNNB1, PTCH1, MLL2, SMARCA4) and in genes not previously linked to this tumour (DDX3X, CTDNEP1, KDM6A, TBR1), often in subgroup-specific patterns. RNA-sequencing confirmed these alterations, and revealed the expression of the first medulloblastoma fusion genes. Chromatin modifiers were frequently altered across all subgroups. These findings enhance our understanding of the genomic complexity and heterogeneity underlying medulloblastoma, and provide several potential targets for new therapeutics, especially for Group 3 & 4 patients.
Pilocytic astrocytoma, the most common childhood brain tumor1, is typically associated with mitogen-activated protein kinase (MAPK) pathway alterations2. Surgically inaccessible midline tumors are therapeutically challenging, showing sustained tendency for progression3 and often becoming a chronic disease with substantial morbidities4. Here we describe whole-genome sequencing of 96 pilocytic astrocytomas, with matched RNA sequencing (n=73), conducted by the International Cancer Genome Consortium (ICGC) PedBrain Tumor Project. We identified recurrent activating mutations in FGFR1 and PTPN11 and novel NTRK2 fusion genes in non-cerebellar tumors. New BRAF activating changes were also observed. MAPK pathway alterations affected 100% of tumors analyzed, with no other significant mutations, indicating pilocytic astrocytoma as predominantly a single-pathway disease. Notably, we identified the same FGFR1 mutations in a subset of H3F3A-mutated pediatric glioblastoma with additional alterations in NF15. Our findings thus identify new potential therapeutic targets in distinct subsets of pilocytic astrocytoma and childhood glioblastoma.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.