Background Type II DNA topoisomerases (TOP2) regulate DNA topology by generating transient double stranded breaks during replication and transcription. Topoisomerase II beta (TOP2B) facilitates rapid gene expression and functions at the later stages of development and differentiation. To gain new insight into the genome biology of TOP2B, we used proteomics (BioID), chromatin immunoprecipitation, and high-throughput chromosome conformation capture (Hi-C) to identify novel proximal TOP2B protein interactions and characterize the genomic landscape of TOP2B binding at base pair resolution. Results Our human TOP2B proximal protein interaction network included members of the cohesin complex and nucleolar proteins associated with rDNA biology. TOP2B associates with DNase I hypersensitivity sites, allele-specific transcription factor (TF) binding, and evolutionarily conserved TF binding sites on the mouse genome. Approximately half of all CTCF/cohesion-bound regions coincided with TOP2B binding. Base pair resolution ChIP-exo mapping of TOP2B, CTCF, and cohesin sites revealed a striking structural ordering of these proteins along the genome relative to the CTCF motif. These ordered TOP2B-CTCF-cohesin sites flank the boundaries of topologically associating domains (TADs) with TOP2B positioned externally and cohesin internally to the domain loop. Conclusions TOP2B is positioned to solve topological problems at diverse cis-regulatory elements and its occupancy is a highly ordered and prevalent feature of CTCF/cohesin binding sites that flank TADs. Electronic supplementary material The online version of this article (doi:10.1186/s13059-016-1043-8) contains supplementary material, which is available to authorized users.
CTG repeat expansions in DMPK cause myotonic dystrophy (DM1) with a continuum of severity and ages of onset. Congenital DM1 (CDM1), the most severe form, presents distinct clinical features, large expansions, and almost exclusive maternal transmission. The correlation between CDM1 and expansion size is not absolute, suggesting contributions of other factors. We determined CpG methylation flanking the CTG repeat in 79 blood samples from 20 CDM1-affected individuals; 21, 27, and 11 individuals with DM1 but not CDM1 (henceforth non-CDM1) with maternal, paternal, and unknown inheritance; and collections of maternally and paternally derived chorionic villus samples (7 CVSs) and human embryonic stem cells (4 hESCs). All but two CDM1-affected individuals showed high levels of methylation upstream and downstream of the repeat, greater than non-CDM1 individuals (p = 7.04958 × 10). Most non-CDM1 individuals were devoid of methylation, where one in six showed downstream methylation. Only two non-CDM1 individuals showed upstream methylation, and these were maternally derived childhood onset, suggesting a continuum of methylation with age of onset. Only maternally derived hESCs and CVSs showed upstream methylation. In contrast, paternally derived samples (27 blood samples, 3 CVSs, and 2 hESCs) never showed upstream methylation. CTG tract length did not strictly correlate with CDM1 or methylation. Thus, methylation patterns flanking the CTG repeat are stronger indicators of CDM1 than repeat size. Spermatogonia with upstream methylation may not survive due to methylation-induced reduced expression of the adjacent SIX5, thereby protecting DM1-affected fathers from having CDM1-affected children. Thus, DMPK methylation may account for the maternal bias for CDM1 transmission, larger maternal CTG expansions, age of onset, and clinical continuum, and may serve as a diagnostic indicator.
Gene expression is a combinatorial function of genetic/epigenetic factors such as copy number variation (CNV), DNA methylation (DM), transcription factors (TF) occupancy, and microRNA (miRNA) post-transcriptional regulation. At the maturity of microarray/sequencing technologies, large amounts of data measuring the genome-wide signals of those factors became available from Encyclopedia of DNA Elements (ENCODE) and The Cancer Genome Atlas (TCGA). However, there is a lack of an integrative model to take full advantage of these rich yet heterogeneous data. To this end, we developed RACER (Regression Analysis of Combined Expression Regulation), which fits the mRNA expression as response using as explanatory variables, the TF data from ENCODE, and CNV, DM, miRNA expression signals from TCGA. Briefly, RACER first infers the sample-specific regulatory activities by TFs and miRNAs, which are then used as inputs to infer specific TF/miRNA-gene interactions. Such a two-stage regression framework circumvents a common difficulty in integrating ENCODE data measured in generic cell-line with the sample-specific TCGA measurements. As a case study, we integrated Acute Myeloid Leukemia (AML) data from TCGA and the related TF binding data measured in K562 from ENCODE. As a proof-of-concept, we first verified our model formalism by 10-fold cross-validation on predicting gene expression. We next evaluated RACER on recovering known regulatory interactions, and demonstrated its superior statistical power over existing methods in detecting known miRNA/TF targets. Additionally, we developed a feature selection procedure, which identified 18 regulators, whose activities clustered consistently with cytogenetic risk groups. One of the selected regulators is miR-548p, whose inferred targets were significantly enriched for leukemia-related pathway, implicating its novel role in AML pathogenesis. Moreover, survival analysis using the inferred activities identified C-Fos as a potential AML prognostic marker. Together, we provided a novel framework that successfully integrated the TCGA and ENCODE data in revealing AML-specific regulatory program at global level.
A comprehensive catalogue of the mutations that drive tumorigenesis and progression is essential to understanding tumor biology and developing therapies. Protein-coding driver mutations have been well-characterized by large exome-sequencing studies, however many tumors have no mutations in protein-coding driver genes. Non-coding mutations are thought to explain many of these cases, however few non-coding drivers besides TERT promoter are known. To fill this gap, we analyzed 150,000 cis-regulatory regions in 1,844 whole cancer genomes from the ICGC-TCGA PCAWG project. Using our new method, ActiveDriverWGS, we found 41 frequently mutated regulatory elements (FMREs) enriched in non-coding SNVs and indels (FDR<0.05) characterized by aging-associated mutation signatures and frequent structural variants. Most FMREs are distal from genes, reported here for the first time and also recovered by additional driver discovery methods. FMREs were enriched in super-enhancers, H3K27ac enhancer marks of primary tumors and long-range chromatin interactions, suggesting that the mutations drive cancer by distally controlling gene expression through threedimensional genome organization. In support of this hypothesis, the chromatin interaction network of FMREs and target genes revealed associations of mutations and differential gene expression of known and novel cancer genes (e.g., CNNB1IP1, RCC1), activation of immune response pathways and altered enhancer marks. Thus distal genomic regions may include additional, infrequently mutated drivers that act on target genes via chromatin loops. Our study is an important step towards finding such regulatory regions and deciphering the somatic mutation landscape of the non-coding genome..
The precise genetic cause remains elusive in nearly 50% of patients with presumed neurogenetic disease, representing a significant barrier for clinical care. This is despite significant advances in clinical genetic diagnostics, including the application of whole‐exome sequencing and next‐generation sequencing‐based gene panels. In this study, we identify a deep intronic mutation in the DMD gene in a patient with muscular dystrophy using both conventional and RNAseq‐based transcriptome analyses. The implications of our data are that noncoding mutations likely comprise an important source of unresolved genetic disease and that RNAseq is a powerful platform for detecting such mutations.
The regulatory elements controlling gene expression during acute inflammation are not fully elucidated. Here we report the identification of a set of NF-κB-bound elements and common chromatin landscapes underlying the acute inflammatory response across cell-types and mammalian species. Using primary vascular endothelial cells (human/mouse/bovine) treated with the pro−inflammatory cytokine, Tumor Necrosis Factor-α, we identify extensive (~30%) conserved orthologous binding of NF-κB to accessible, as well as nucleosome-occluded chromatin. Regions with the highest NF-κB occupancy pre-stimulation show dramatic increases in NF-κB binding and chromatin accessibility post-stimulation. These ‘pre-bound’ regions are typically conserved (~56%), contain multiple NF-κB motifs, are utilized by diverse cell types, and overlap rare non-coding mutations and common genetic variation associated with both inflammatory and cardiovascular phenotypes. Genetic ablation of conserved, ‘pre-bound’ NF-κB regions within the super-enhancer associated with the chemokine-encoding CCL2 gene and elsewhere supports the functional relevance of these elements.
Human embryonic stem cell-derived beta cells offer a promising cell-based therapy for diabetes. However, efficient stem cell to beta cell differentiation has proven difficult, possibly due to the lack of cross-talk with the appropriate mesenchymal niche. To define organ-specific niche signals, we isolated pancreatic and gastrointestinal stromal cells, and analyzed their gene expression during development. Our genetic studies reveal the importance of tightly regulated Hedgehog signaling in the pancreatic mesenchyme: inactivation of mesenchymal signaling leads to annular pancreas, whereas stroma-specific activation of signaling via loss of Hedgehog regulators, Sufu and Spop, impairs pancreatic growth and beta cell genesis. Genetic rescue and transcriptome analyses show that these Sufu and Spop knockout defects occur through Gli2-mediated activation of gastrointestinal stromal signals such as Wnt ligands. Importantly, inhibition of Wnt signaling in organoid and human stem cell cultures significantly promotes insulin-producing cell generation, altogether revealing the requirement for organ-specific regulation of stromal niche signals.
Quebec Platelet disorder (QPD) is a unique bleeding disorder that markedly increases urokinase plasminogen activator (uPA) in megakaryocytes and platelets but not in plasma or urine. The cause is tandem duplication of a 78 kb region of chromosome 10 containing PLAU (the uPA gene) and C10orf55, a gene of unknown function. QPD increases uPA in platelets and megakaryocytes >100 fold, far more than expected for a gene duplication. To investigate the tissue-specific effect that PLAU duplication has on gene expression and transcript structure in QPD, we tested if QPD leads to: 1) overexpression of normal or unique PLAU transcripts; 2) increased uPA in leukocytes; 3) altered levels of C10orf55 mRNA and/or protein in megakaryocytes and leukocytes; and 4) global changes in megakaryocyte gene expression. Primary cells and cultured megakaryocytes from donors were prepared for quantitative reverse polymerase chain reaction analyses, RNA-seq and protein expression analyses. Rapidly isolated blood leukocytes from QPD subjects showed only a 3.9 fold increase in PLAU transcript levels, in keeping with the normal to minimally increased uPA in affinity purified, QPD leukocytes. All subjects had more uPA in granulocytes than monocytes and minimal uPA in lymphocytes. QPD leukocytes expressed PLAU alleles in proportions consistent with an extra copy of PLAU on the disease chromosome, unlike QPD megakaryocytes. QPD PLAU transcripts were consistent with reference gene models, with a much higher proportion of reads originating from the disease chromosome in megakaryocytes than granulocytes. QPD and control megakaryocytes contained minimal reads for C10orf55, and C10orf55 protein was not increased in QPD megakaryocytes or platelets. Finally, our QPD megakaryocyte transcriptome analysis revealed a global down regulation of the interferon type 1 pathway. We suggest that the low endogenous levels of uPA in blood are actively regulated, and that the regulatory mechanisms are disrupted in QPD in a megakaryocyte-specific manner.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.