Pervasive and hidden transcription is widespread in eukaryotes, but its global level, the mechanisms from which it originates and its functional significance are unclear. Cryptic unstable transcripts (CUTs) were recently described as a principal class of RNA polymerase II transcripts in Saccharomyces cerevisiae. These transcripts are targeted for degradation immediately after synthesis by the action of the Nrd1-exosome-TRAMP complexes. Although CUT degradation mechanisms have been analysed in detail, the genome-wide distribution at the nucleotide resolution and the prevalence of CUTs are unknown. Here we report the first high-resolution genomic map of CUTs in yeast, revealing a class of potentially functional CUTs and the intrinsic bidirectional nature of eukaryotic promoters. An RNA fraction highly enriched in CUTs was analysed by a 3' Long-SAGE (serial analysis of gene expression) approach adapted to deep sequencing. The resulting detailed genomic map of CUTs revealed that they derive from extremely widespread and very well defined transcription units and do not result from unspecific transcriptional noise. Moreover, the transcription of CUTs predominantly arises within nucleosome-free regions, most of which correspond to promoter regions of bona fide genes. Some of the CUTs start upstream from messenger RNAs and overlap their 5' end. Our study of glycolysis genes, as well as recent results from the literature, indicate that such concurrent transcription is potentially associated with regulatory mechanisms. Our data reveal numerous new CUTs with such a potential regulatory role. However, most of the identified CUTs corresponded to transcripts divergent from the promoter regions of genes, indicating that they represent by-products of divergent transcription occurring at many and possibly most promoters. Eukaryotic promoter regions are thus intrinsically bidirectional, a fundamental property that escaped previous analyses because in most cases divergent transcription generates short-lived unstable transcripts present at very low steady-state levels.
Despite intense investigation, human replication origins and termini remain elusive. Existing data have shown strong discrepancies. Here we sequenced highly purified Okazaki fragments from two cell types and, for the first time, quantitated replication fork directionality and delineated initiation and termination zones genome-wide. Replication initiates stochastically, primarily within non-transcribed, broad (up to 150 kb) zones that often abut transcribed genes, and terminates dispersively between them. Replication fork progression is significantly co-oriented with the transcription. Initiation and termination zones are frequently contiguous, sometimes separated by regions of unidirectional replication. Initiation zones are enriched in open chromatin and enhancer marks, even when not flanked by genes, and often border ‘topologically associating domains' (TADs). Initiation zones are enriched in origin recognition complex (ORC)-binding sites and better align to origins previously mapped using bubble-trap than λ-exonuclease. This novel panorama of replication reveals how chromatin and transcription modulate the initiation process to create cell-type-specific replication programs.
Non-coding (nc)RNAs are key players in numerous biological processes such as gene regulation, chromatin domain formation and genome stability. Large ncRNAs interact with histone modifiers and are involved in cancer development, X-chromosome inactivation and autosomal gene imprinting. However, despite recent evidence showing that pervasive transcription is more widespread than previously thought, only a few examples mediating gene regulation in eukaryotes have been described. In Saccharomyces cerevisiae, the bona-fide regulatory ncRNAs are destabilized by the Xrn1 5'-3' RNA exonuclease (also known as Kem1), but the genome-wide characterization of the entire regulatory ncRNA family remains elusive. Here, using strand-specific RNA sequencing (RNA-seq), we identify a novel class of 1,658 Xrn1-sensitive unstable transcripts (XUTs) in which 66% are antisense to open reading frames. These transcripts are polyadenylated and RNA polymerase II (RNAPII)-dependent. The majority of XUTs strongly accumulate in lithium-containing media, indicating that they might have a role in adaptive responses to changes in growth conditions. Notably, RNAPII chromatin immunoprecipitation followed by DNA sequencing (ChIP-seq) analysis of Xrn1-deficient strains revealed a significant decrease of RNAPII occupancy over 273 genes with antisense XUTs. These genes show an unusual bias for H3K4me3 marks and require the Set1 histone H3 lysine 4 methyl-transferase for silencing. Furthermore, abolishing H3K4me3 triggers the silencing of other genes with antisense XUTs, supporting a model in which H3K4me3 antagonizes antisense ncRNA repressive activity. Our results demonstrate that antisense ncRNA-mediated regulation is a general regulatory pathway for gene expression in S. cerevisiae.
Long non-protein coding RNAs (npcRNA) represent an emerging class of riboregulators, which either act directly in this long form or are processed to shorter miRNA and siRNA. Genome-wide bioinformatic analysis of full-length cDNA databases identified 76 Arabidopsis npcRNAs. Fourteen npcRNAs were antisense to protein-coding mRNAs, suggesting cis-regulatory roles. Numerous 24-nt siRNA matched to five different npcRNAs, suggesting that these npcRNAs are precursors of this type of siRNA. Expression analyses of the 76 npcRNAs identified a novel npcRNA that accumulates in a dcl1 mutant but does not appear to produce trans-acting siRNA or miRNA. Additionally, another npcRNA was the precursor of miR869 and shown to be up-regulated in dcl4 but not in dcl1 mutants, indicative of a young miRNA gene. Abiotic stress altered the accumulation of 22 npcRNAs among the 76, a fraction significantly higher than that observed for the RNA binding protein-coding fraction of the transcriptome. Overexpression analyses in Arabidopsis identified two npcRNAs as regulators of root growth during salt stress and leaf morphology, respectively. Hence, together with small RNAs, long npcRNAs encompass a sensitive component of the transcriptome that have diverse roles during growth and differentiation.[Supplemental material is available online at www.genome.org.]Non-protein coding RNAs (npcRNAs) are a class of RNAs that do not encode proteins, but instead their function lies on the RNA molecule. They are a heterogeneous group and have been divided into different classes according to their length and function. With respect to length, npcRNAs can range from 20 to 27 nucleotides (nt) for the families of microRNAs (miRNAs) and small interfering RNAs (siRNAs), 20-300 nt for small RNAs commonly found as transcriptional and translational regulators, or up to and beyond 10,000 nt for medium and large RNAs involved in other processes, including splicing, gene inactivation, and translation (Costa 2007). We use the term non-protein-coding RNAs instead of noncoding RNAs as every sequence has the potential to be coding, and certain large npcRNAs might encode small oligopeptides, which could be translated under specific conditions as shown for a pentapeptide located inside rRNA, a canonical RNA in Escherichia coli (Tenson et al. 1996). In recent years, numerous novel npcRNA candidates have been identified in a variety of organisms from E. coli to Homo sapiens (Argaman et al. 2001;Storz et al. 2004;Washietl et al. 2005).Several strategies have been employed to detect and discover novel npcRNAs, including both experimental and computational screenings (Huttenhofer et al. 2002). Genomic approaches, such as tiling arrays and systematic sequencing of full-length cDNA libraries, in model organisms have recently revealed that much larger portions of eukaryote transcriptomes represent nonprotein-coding transcripts than previously believed (Okazaki et al. 2002;Numata et al. 2003;Rinn et al. 2003;Ota et al. 2004;Chekanova et al. 2007). Diverse npcRNAs, including a surpris...
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.