Regulated transcription controls the diversity, developmental pathways and spatial organization of the hundreds of cell types that make up a mammal. Using single-molecule cDNA sequencing, we mapped transcription start sites (TSSs) and their usage in human and mouse primary cells, cell lines and tissues to produce a comprehensive overview of mammalian gene expression across the human body. We find that few genes are truly ‘housekeeping’, whereas many mammalian promoters are composite entities composed of several closely separated TSSs, with independent cell-type-specific expression profiles. TSSs specific to different cell types evolve at different rates, whereas promoters of broadly expressed genes are the most conserved. Promoter-based expression analysis reveals key transcription factors defining cell states and links them to binding-site motifs. The functions of identified novel transcripts can be predicted by coexpression and sample ontology enrichment analyses. The functional annotation of the mammalian genome 5 (FANTOM5) project provides comprehensive expression profiles and functional annotation of mammalian cell-type-specific transcriptomes with wide applications in biomedical research.
Gene expression profiling has uncovered the transcription factor Sox4 with upregulated activity during TGF-β-induced epithelial-mesenchymal transition (EMT) in normal and cancerous breast epithelial cells. Sox4 is indispensable for EMT and cell survival in vitro and for primary tumor growth and metastasis in vivo. Among several EMT-relevant genes, Sox4 directly regulates the expression of Ezh2, encoding the Polycomb group histone methyltransferase that trimethylates histone 3 lysine 27 (H3K27me3) for gene repression. Ablation of Ezh2 expression prevents EMT, whereas forced expression of Ezh2 restores EMT in Sox4-deficient cells. Ezh2-mediated H3K27me3 marks associate with key EMT genes, representing an epigenetic EMT signature that predicts patient survival. Our results identify Sox4 as a master regulator of EMT by governing the expression of the epigenetic modifier Ezh2.
Accurate reconstruction of the regulatory networks that control gene expression is one of the key current challenges in molecular biology. Although gene expression and chromatin state dynamics are ultimately encoded by constellations of binding sites recognized by regulators such as transcriptions factors (TFs) and microRNAs (miRNAs), our understanding of this regulatory code and its context-dependent read-out remains very limited. Given that there are thousands of potential regulators in mammals, it is not practical to use direct experimentation to identify which of these play a key role for a particular system of interest. We developed a methodology that models gene expression or chromatin modifications in terms of genome-wide predictions of regulatory sites and completely automated it into a web-based tool called ISMARA (Integrated System for Motif Activity Response Analysis). Given only gene expression or chromatin state data across a set of samples as input, ISMARA identifies the key TFs and miRNAs driving expression/chromatin changes and makes detailed predictions regarding their regulatory roles. These include predicted activities of the regulators across the samples, their genome-wide targets, enriched gene categories among the targets, and direct interactions between the regulators. Applying ISMARA to data sets from well-studied systems, we show that it consistently identifies known key regulators ab initio. We also present a number of novel predictions including regulatory interactions in innate immunity, a master regulator of mucociliary differentiation, TFs consistently disregulated in cancer, and TFs that mediate specific chromatin modifications.
Vertebrates have greatly elaborated the basic chordate body plan and evolved highly distinctive genomes that have been sculpted by two whole-genome duplications. Here we sequence the genome of the Mediterranean amphioxus ( Branchiostoma lanceolatum ) and characterize DNA methylation, chromatin accessibility, histone modifications and transcriptomes across multiple developmental stages and adult tissues to investigate the evolution of the regulation of the chordate genome. Comparisons with vertebrates identify an intermediate stage in the evolution of differentially methylated enhancers, and a high conservation of gene expression and its cis -regulatory logic between amphioxus and vertebrates that occurs maximally at an earlier mid-embryonic phylotypic period. We analyse regulatory evolution after whole-genome duplications, and find that—in vertebrates—over 80% of broadly expressed gene families with multiple paralogues derived from whole-genome duplications have members that restricted their ancestral expression, and underwent specialization rather than subfunctionalization. Counter-intuitively, paralogues that restricted their expression increased the complexity of their regulatory landscapes. These data pave the way for a better understanding of the regulatory principles that underlie key vertebrate innovations.
Using deep sequencing (deepCAGE), the FANTOM4 study measured the genome-wide dynamics of transcription-start-site usage in the human monocytic cell line THP-1 throughout a time course of growth arrest and differentiation. Modeling the expression dynamics in terms of predicted cis-regulatory sites, we identified the key transcription regulators, their time-dependent activities and target genes. Systematic siRNA knockdown of 52 transcription factors confirmed the roles of individual factors in the regulatory network. Our results indicate that cellular states are constrained by complex networks involving both positive and negative regulatory interactions among substantial numbers of transcription factors and that no single transcription factor is both necessary and sufficient to drive the differentiation process.
In obesity, white adipose tissue (WAT) inflammation is linked to insulin resistance. Increased adipocyte chemokine (C-C motif) ligand 2 (CCL2) secretion may initiate adipose inflammation by attracting the migration of inflammatory cells into the tissue. Using an unbiased approach, we identified adipose microRNAs (miRNAs) that are dysregulated in human obesity and assessed their possible role in controlling CCL2 production. In subcutaneous WAT obtained from 56 subjects, 11 miRNAs were present in all subjects and downregulated in obesity. Of these, 10 affected adipocyte CCL2 secretion in vitro and for 2 miRNAs (miR-126 and miR-193b), regulatory circuits were defined. While miR-126 bound directly to the 3′-untranslated region of CCL2 mRNA, miR-193b regulated CCL2 production indirectly through a network of transcription factors, many of which have been identified in other inflammatory conditions. In addition, overexpression of miR-193b and miR-126 in a human monocyte/macrophage cell line attenuated CCL2 production. The levels of the two miRNAs in subcutaneous WAT were significantly associated with CCL2 secretion (miR-193b) and expression of integrin, α-X, an inflammatory macrophage marker (miR-193b and miR-126). Taken together, our data suggest that miRNAs may be important regulators of adipose inflammation through their effects on CCL2 release from human adipocytes and macrophages.
The loss of HBII-52 and related C/D box small nucleolar RNA (snoRNA) expression units have been implicated as a cause for the Prader-Willi syndrome (PWS). We recently found that the C/D box snoRNA HBII-52 changes the alternative splicing of the serotonin receptor 2C pre-mRNA, which is different from the traditional C/D box snoRNA function in non-mRNA methylation. Using bioinformatic predictions and experimental verification, we identified five pre-mRNAs (DPM2, TAF1, RALGPS1, PBRM1 and CRHR1) containing alternative exons that are regulated by MBII-52, the mouse homolog of HBII-52. Analysis of a single member of the MBII-52 cluster of snoRNAs by RNase protection and northern blot analysis shows that the MBII-52 expressing unit generates shorter RNAs that originate from the full-length MBII-52 snoRNA through additional processing steps. These novel RNAs associate with hnRNPs and not with proteins associated with canonical C/D box snoRNAs. Our data indicate that not a traditional C/D box snoRNA MBII-52, but a processed version lacking the snoRNA stem is the predominant MBII-52 RNA missing in PWS. This processed snoRNA functions in alternative splice-site selection. Its substitution could be a therapeutic principle for PWS.
New cancer therapies are likely to arise from an in-depth understanding of the signaling networks influencing tumor initiation, progression and metastasis. We show a fundamental role for Src-homology 2 domain-containing phosphatase 2 (SHP2) in these processes in human epidermal growth factor receptor 2 (HER2)-positive and triple-negative breast cancers. Knockdown of SHP2 eradicated breast tumor-initiating cells in xenograft models, and SHP2 depletion also prevented invasion in three-dimensional cultures and in a transductal invasion assay in vivo. Notably, SHP2 knockdown in established breast tumors blocked their growth and reduced metastasis. Mechanistically, SHP2 activated stemness-associated transcription factors, including v-myc myelocytomatosis viral oncogene homolog (c-Myc) and zinc finger E-box binding homeobox 1 (ZEB1), which resulted in the repression of let-7 microRNA and the expression of a set of 'SHP2 signature' genes. We found these genes to be simultaneously activated in a large subset of human primary breast tumors that are associated with invasive behavior and poor prognosis. These results provide new insights into the signaling cascades influencing tumor-initiating cells as well as a rationale for targeting SHP2 in breast cancer.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.