The genome is extensively transcribed into long intergenic noncoding RNAs (lincRNAs), many of which are implicated in gene silencing1,2. Potential roles of lincRNAs in gene activation are much less understood3,4,5. Development and homeostasis require coordinate regulation of neighboring genes through a process termed locus control6. Some locus control elements and enhancers transcribe lincRNAs7,8,9,10, hinting at possible roles in long range control. In vertebrates, 39 Hox genes, encoding homeodomain transcription factors critical for positional identity, are clustered in four chromosomal loci; the Hox genes are expressed in nested anterior-posterior and proximal-distal patterns co-linear with their genomic position from 3′ to 5′of the cluster11. Here we identify HOTTIP, a lincRNA transcribed from the 5′ tip of the HOXA locus that coordinates the activation of multiple 5′ HOXA genes in vivo. Chromosomal looping brings HOTTIP into close proximity to its target genes. HOTTIP directly binds the adaptor protein WDR5 and targets WDR5/MLL complexes across HOXA, driving histone H3 lysine 4 trimethylation and gene transcription. Induced proximity is necessary and sufficient for HOTTIP activation of its target genes. Thus, by serving as key intermediates that transmit information from higher order chromosomal looping into chromatin modifications, lincRNAs may organize chromatin domains to coordinate long-range gene activation.
Recruitment of the RNA Polymerase II (Pol II) transcription initiation apparatus to promoters by specific DNA binding transcription factors is well recognized as a key regulatory step in gene expression. We report here that promoter-proximal pausing is a general feature of transcription by Pol II in mammalian cells, and thus an additional step where regulation of gene expression occurs. This suggests that some transcription factors recruit the transcription apparatus to promoters, while others effect promoter-proximal pause release. Indeed, we find that the transcription factor c-Myc, a key regulator of cellular proliferation, plays a major role in Pol II pause release rather than Pol II recruitment at its target genes. We discuss the implications of these results for the role of c-Myc amplification in human cancer.
SUMMARY N6-methyl-adenosine (m6A) is the most abundant modification on messenger RNAs and is linked to human diseases, but its functions in mammalian development are poorly understood. Here we reveal the evolutionary conservation and function of m6A by mapping the m6A methylome in mouse and human embryonic stem cells. Thousands of messenger and long noncoding RNAs show conserved m6A modification, including transcripts encoding core pluripotency transcription factors. m6A is enriched over 3′ untranslated regions at defined sequence motifs, and marks unstable transcripts, including transcripts turned over upon differentiation. Genetic inactivation or depletion of mouse and human Mettl3, one of the m6A methylases, led to m6A erasure on select target genes, prolonged Nanog expression upon differentiation, and impaired ESC’s exit from self-renewal towards differentiation into several lineages in vitro and in vivo. Thus, m6A is a mark of transcriptome flexibility required for stem cells to differentiate to specific lineages.
Summary Noncoding RNAs (ncRNAs) function with associated proteins to effect complex structural and regulatory outcomes. To reveal the composition and dynamics of specific noncoding RNA- protein complexes (RNPs) in vivo, we developed comprehensive identification of RNA-binding proteins by mass spectrometry (ChIRP-MS). ChIRP-MS analysis of four ncRNAs captures key protein interactors, including a U1-specific link to the 3′ RNA processing machinery. Xist, an essential lncRNA for X-chromosome inactivation (XCI), interacts with 81 proteins from chromatin modification, nuclear matrix, and RNA remodeling pathways. The Xist RNA-protein particle assembles in two steps coupled with the transition from pluripotency to differentiation. Specific interactors include HnrnpK that participates in Xist-mediated gene silencing and histone modifications, but not Xist localization and Drosophila Split ends homolog Spen that interacts via the A-repeat domain of Xist and is required for gene silencing. Thus, Xist lncRNA engages with proteins in a modular and developmentally controlled manner to coordinate chromatin spreading and silencing.
Visualizing the physical basis for molecular behavior inside living cells is a grand challenge in biology. RNAs are central to biological regulation, and RNA’s ability to adopt specific structures intimately controls every step of the gene expression program1. However, our understanding of physiological RNA structures is limited; current in vivo RNA structure profiles view only two of four nucleotides that make up RNA2,3. Here we present a novel biochemical approach, In Vivo Click SHAPE (icSHAPE), that enables the first global view of RNA secondary structures of all four bases in living cells. icSHAPE of mouse embryonic stem cell transcriptome versus purified RNA folded in vitro shows that the structural dynamics of RNA in the cellular environment distinguishes different classes of RNAs and regulatory elements. Structural signatures at translational start sites and ribosome pause sites are conserved from in vitro, suggesting that these RNA elements are programmed by sequence. In contrast, focal structural rearrangements in vivo reveal precise interfaces of RNA with RNA binding proteins or RNA modification sites that are consistent with atomic-resolution structural data. Such dynamic structural footprints enable accurate prediction of RNA-protein interactions and N6-methyladenosine (m6A) modification genome-wide. These results open the door for structural genomics of RNA in living cells and reveal key physiological structures controlling gene expression.
Genome conformation is central to gene control but challenging to interrogate. Here we present HiChIP, a protein-centric chromatin conformation method. HiChIP improves the yield of conformation-informative reads by over 10-fold and lowers input requirement over 100-fold relative to ChIA-PET. HiChIP of cohesin reveals multi-scale genome architecture with greater signal to background than in situ Hi-C. Thus, HiChIP adds to the toolbox of 3D genome structure and regulation for diverse biomedical applications.
Transcription initiation by RNA polymerase II (RNAPII) is thought to occur unidirectionally from most genes. Here, we present evidence of widespread divergent transcription at protein-encoding gene promoters. Transcription start site-associated RNAs (TSSa-RNAs) nonrandomly flank active promoters, with peaks of antisense and sense short RNAs at 250 nucleotides upstream and 50 nucleotides downstream of TSSs, respectively. Northern analysis shows that TSSa-RNAs are subsets of an RNA population 20 to 90 nucleotides in length. Promoter-associated RNAPII and H3K4-trimethylated histones, transcription initiation hallmarks, colocalize at sense and antisense TSSa-RNA positions; however, H3K79-dimethylated histones, characteristic of elongating RNAPII, are only present downstream of TSSs. These results suggest that divergent transcription over short distances is common for active promoters and may help promoter regions maintain a state poised for subsequent regulation.
Several of the thousands of human long non-coding RNAs (lncRNAs) have been functionally characterized1–4; however, potential roles for lncRNAs in somatic tissue differentiation remain poorly understood. Here we show that a 3.7-kilobase lncRNA, terminal differentiation-induced ncRNA (TINCR), controls human epidermal differentiation by a post-transcriptional mechanism. TINCR is required for high messenger RNA abundance of key differentiation genes, many of which are mutated in human skin diseases, including FLG, LOR, ALOXE3, ALOX12B, ABCA12, CASP14 and ELOVL3. TINCR-deficient epidermis lacked terminal differentiation ultrastructure, including keratohyalin granules and intact lamellar bodies. Genome-scale RNA interactome analysis revealed that TINCR interacts with a range of differentiation mRNAs. TINCR–mRNA interaction occurs through a 25-nucleotide ‘TINCR box’ motif that is strongly enriched in interacting mRNAs and required for TINCR binding. A high-throughput screen to analyse TINCR binding capacity to approximately 9,400 human recombinant proteins revealed direct binding of TINCR RNA to the staufen1 (STAU1) protein. STAU1-deficient tissue recapitulated the impaired differentiation seen with TINCR depletion. Loss of UPF1 and UPF2, both of which are required for STAU1-mediated RNA decay, however, did not have differentiation effects. Instead, the TINCR–STAU1 complex seems to mediate stabilization of differentiation mRNAs, such as KRT80. These data identify TINCR as a key lncRNA required for somatic tissue differentiation, which occurs through lncRNA binding to differentiation mRNAs to ensure their expression.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.