Somatic genome rearrangements are thought to play important roles in cancer development. We optimized a long-span paired-end-tag (PET) sequencing approach using 10-Kb genomic DNA inserts to study human genome structural variations (SVs). The use of a 10-Kb insert size allows the identification of breakpoints within repetitive or homology-containing regions of a few kilobases in size and results in a higher physical coverage compared with small insert libraries with the same sequencing effort. We have applied this approach to comprehensively characterize the SVs of 15 cancer and two noncancer genomes and used a filtering approach to strongly enrich for somatic SVs in the cancer genomes. Our analyses revealed that most inversions, deletions, and insertions are germ-line SVs, whereas tandem duplications, unpaired inversions, interchromosomal translocations, and complex rearrangements are over-represented among somatic rearrangements in cancer genomes. We demonstrate that the quantitative and connective nature of DNA-PET data is precise in delineating the genealogy of complex rearrangement events, we observe signatures that are compatible with breakagefusion-bridge cycles, and we discover that large duplications are among the initial rearrangements that trigger genome instability for extensive amplification in epithelial cancers.
Genome rearrangements, a hallmark of cancer, can result in gene fusions with oncogenic properties. Using DNA paired-end-tag (DNA-PET) whole-genome sequencing, we analyzed 15 gastric cancers (GCs) from Southeast Asians. Rearrangements were enriched in open chromatin and shaped by chromatin structure. We identified seven rearrangement hot spots and 136 gene fusions. In three out of 100 GC cases, we found recurrent fusions between CLDN18, a tight junction gene, and ARHGAP26, a gene encoding a RHOA inhibitor. Epithelial cell lines expressing CLDN18-ARHGAP26 displayed a dramatic loss of epithelial phenotype and long protrusions indicative of epithelial-mesenchymal transition (EMT). Fusion-positive cell lines showed impaired barrier properties, reduced cell-cell and cell-extracellular matrix adhesion, retarded wound healing, and inhibition of RHOA. Gain of invasion was seen in cancer cell lines expressing the fusion. Thus, CLDN18-ARHGAP26 mediates epithelial disintegration, possibly leading to stomach H(+) leakage, and the fusion might contribute to invasiveness once a cell is transformed.
BackgroundGastric cancer is the second highest cause of global cancer mortality. To explore the complete repertoire of somatic alterations in gastric cancer, we combined massively parallel short read and DNA paired-end tag sequencing to present the first whole-genome analysis of two gastric adenocarcinomas, one with chromosomal instability and the other with microsatellite instability.ResultsIntegrative analysis and de novo assemblies revealed the architecture of a wild-type KRAS amplification, a common driver event in gastric cancer. We discovered three distinct mutational signatures in gastric cancer - against a genome-wide backdrop of oxidative and microsatellite instability-related mutational signatures, we identified the first exome-specific mutational signature. Further characterization of the impact of these signatures by combining sequencing data from 40 complete gastric cancer exomes and targeted screening of an additional 94 independent gastric tumors uncovered ACVR2A, RPL22 and LMAN1 as recurrently mutated genes in microsatellite instability-positive gastric cancer and PAPPA as a recurrently mutated gene in TP53 wild-type gastric cancer.ConclusionsThese results highlight how whole-genome cancer sequencing can uncover information relevant to tissue-specific carcinogenesis that would otherwise be missed from exome-sequencing data.
To resolve relationships between members of the family Thermomonosporaceae, phylogenetic analyses using three sets of nucleotide sequences from 16S rDNA, 23S rDNA and the 16S-23S internal transcribed spacer (ITS) were carried out. Nearly all species of the family were included in this study. On the basis of congruous phylogenetic results and chemotaxonomic data, the following proposals are made. First, Actinomadura libanotica, Actinomadura aurantiaca, Actinomadura glomerata and Actinomadura longicatena are transferred to the genus Actinocorallia as Actinocorallia libanotica comb. nov., Actinocorallia aurantiaca comb. nov., Actinocorallia glomerata comb. nov. and Actinocorallia longicatena comb. nov., respectively. All the species of this genus are phylogenetically coherent and of phospholipid type PII (presence of phosphatidylethanolamine), distinguishing them from other Actinomadura species that are of phospholipid type PI (absence of diagnostic phospholipids). Second, Excellospora viridilutea is transferred to the genus Actinomadura as Actinomadura viridilutea comb. nov. As a result of the proposed transfers, the family Thermomonosporaceae now contains four genera Thermomonospora, Actinomadura, Actinocorallia and Spirillospora. The genus Actinocorallia and family Thermomonosporaceae are redescribed.
Delineating candidate genes at the chromosomal breakpoint regions in the apparently balanced chromosome rearrangements (ABCR) has been shown to be more effective with the emergence of next-generation sequencing (NGS) technologies. We employed a large-insert (7–11 kb) paired-end tag sequencing technology (DNA-PET) to systematically analyze genome of four patients harbouring cytogenetically defined ABCR with neurodevelopmental symptoms, including developmental delay (DD) and speech disorders. We characterized structural variants (SVs) specific to each individual, including those matching the chromosomal breakpoints. Refinement of these regions by Sanger sequencing resulted in the identification of five disrupted genes in three individuals: guanine nucleotide binding protein, q polypeptide (GNAQ), RNA-binding protein, fox-1 homolog (RBFOX3), unc-5 homolog D (C.elegans) (UNC5D), transmembrane protein 47 (TMEM47), and X-linked inhibitor of apoptosis (XIAP). Among them, XIAP is the causative gene for the immunodeficiency phenotype seen in the patient. The remaining genes displayed specific expression in the fetal brain and have known biologically relevant functions in brain development, suggesting putative candidate genes for neurodevelopmental phenotypes. This study demonstrates the application of NGS technologies in mapping individual gene disruptions in ABCR as a resource for deciphering candidate genes in human neurodevelopmental disorders (NDDs).
Structural variations (SVs) contribute significantly to the variability of the human genome and extensive genomic rearrangements are a hallmark of cancer. While genomic DNA paired-end-tag (DNA-PET) sequencing is an attractive approach to identify genomic SVs, the current application of PET sequencing with short insert size DNA can be insufficient for the comprehensive mapping of SVs in low complexity and repeat-rich genomic regions. We employed a recently developed procedure to generate PET sequencing data using large DNA inserts of 10–20 kb and compared their characteristics with short insert (1 kb) libraries for their ability to identify SVs. Our results suggest that although short insert libraries bear an advantage in identifying small deletions, they do not provide significantly better breakpoint resolution. In contrast, large inserts are superior to short inserts in providing higher physical genome coverage for the same sequencing cost and achieve greater sensitivity, in practice, for the identification of several classes of SVs, such as copy number neutral and complex events. Furthermore, our results confirm that large insert libraries allow for the identification of SVs within repetitive sequences, which cannot be spanned by short inserts. This provides a key advantage in studying rearrangements in cancer, and we show how it can be used in a fusion-point-guided-concatenation algorithm to study focally amplified regions in cancer.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.