Cell lines were not tested for mycoplasma contamination. Commonly misidentified lines (See ICLAC register) No commonly misidentified cell lines were used.
Structural variants (SVs) can contribute to oncogenesis through a variety of mechanisms. Despite their importance, the identification of SVs in cancer genomes remains challenging. Here, we present a framework that integrates optical mapping, high-throughput chromosome conformation capture (Hi-C), and whole-genome sequencing to systematically detect SVs in a variety of normal or cancer samples and cell lines. We identify the unique strengths of each method and demonstrate that only integrative approaches can comprehensively identify SVs in the genome. By combining Hi-C and optical mapping, we resolve complex SVs and phase multiple SV events to a single haplotype. Furthermore, we observe widespread structural variation events affecting the functions of noncoding sequences, including the deletion of distal regulatory sequences, alteration of DNA replication timing, and the creation of novel three-dimensional chromatin structural domains. Our results indicate that noncoding SVs may be underappreciated mutational drivers in cancer genomes.
Duplication of the genome in mammalian cells occurs in a defined temporal order referred to as its replication-timing (RT) program. RT changes dynamically during development, regulated in units of 400-800 kb referred to as replication domains (RDs). Changes in RT are generally coordinated with transcriptional competence and changes in subnuclear position. We generated genome-wide RT profiles for 26 distinct human cell types, including embryonic stem cell (hESC)-derived, primary cells and established cell lines representing intermediate stages of endoderm, mesoderm, ectoderm, and neural crest (NC) development. We identified clusters of RDs that replicate at unique times in each stage (RT signatures) and confirmed global consolidation of the genome into larger synchronously replicating segments during differentiation. Surprisingly, transcriptome data revealed that the well-accepted correlation between early replication and transcriptional activity was restricted to RT-constitutive genes, whereas two-thirds of the genes that switched RT during differentiation were strongly expressed when late replicating in one or more cell types. Closer inspection revealed that transcription of this class of genes was frequently restricted to the lineage in which the RT switch occurred, but was induced prior to a late-to-early RT switch and/or down-regulated after an early-to-late RT switch. Analysis of transcriptional regulatory networks showed that this class of genes contains strong regulators of genes that were only expressed when early replicating. These results provide intriguing new insight into the complex relationship between transcription and RT regulation during human development.
Graphical Abstract Highlights d Early replicating control elements (ERCEs) regulate replication timing d ERCEs regulate A/B compartmentalization and TAD architecture d ERCEs form CTCF-independent loops and have features of enhancer/promoters d ERCEs enable genetic dissection of large-scale chromosome structure and function SUMMARYThe temporal order of DNA replication (replication timing [RT]) is highly coupled with genome architecture, but cis-elements regulating either remain elusive. We created a series of CRISPR-mediated deletions and inversions of a pluripotency-associated topologically associating domain (TAD) in mouse ESCs. CTCF-associated domain boundaries were dispensable for RT. CTCF protein depletion weakened most TAD boundaries but had no effect on RT or A/B compartmentalization genome-wide. By contrast, deletion of three intra-TAD CTCF-independent 3D contact sites caused a domain-wide earlyto-late RT shift, an A-to-B compartment switch, weakening of TAD architecture, and loss of transcription. The dispensability of TAD boundaries and the necessity of these ''early replication control elements'' (ERCEs) was validated by deletions and inversions at additional domains. Our results demonstrate that discrete cis-regulatory elements orchestrate domain-wide RT, A/B compartmentalization, TAD architecture, and transcription, revealing fundamental principles linking genome structure and function.
Highlights d Deep multi-omics characterization of replicative and oncogene-induced senescence d Senescence-associated heterochromatin domains (SAHDs) form SAHFs via 3D changes d DNMT1 is required for SAHF formation via regulation of HMGA2 expression d SAHF formation leads to expression of SAHF-adjacent genes via 3D chromatin contacts
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.