Elucidating the human transcriptional regulatory network is a challenge of the post-genomic era. Technical progress so far is impressive, including detailed understanding of regulatory mechanisms for at least a few genes in multicellular organisms, rapid and precise localization of regulatory regions within extensive regions of DNA by means of cross-species comparison, and de novo determination of transcription-factor binding specificities from large-scale yeast expression data. Here we address two problems involved in extending these results to the human genome: first, it has been unclear how many model organism genomes will be needed to delineate most regulatory regions; and second, the discovery of transcription-factor binding sites (response elements) from expression data has not yet been generalized from single-celled organisms to multicellular organisms. We found that 98% (74/75) of experimentally defined sequence-specific binding sites of skeletal-muscle-specific transcription factors are confined to the 19% of human sequences that are most conserved in the orthologous rodent sequences. Also we found that in using this restriction, the binding specificities of all three major muscle-specific transcription factors (MYF, SRF and MEF2) can be computationally identified.
The vast majority of annotated transcripts in bacteria are mRNAs. Here we identify ~1,000 antisense transcripts in the model bacterium Escherichia coli. We propose that these transcripts are generated by promiscuous transcription initiation within genes and that many of them regulate expression of the overlapping gene.
Summary
Genome-wide studies have identified abundant small, non-coding RNAs including snRNAs, snoRNAs, cryptic unstable transcripts (CUTs), and upstream regulatory RNAs (uRNAs) that are transcribed by RNA polymerase II (pol II) and terminated by a Nrd1-dependent pathway. Here, we show that the prolyl isomerase, Ess1, is required for Nrd1-dependent termination of ncRNAs. Ess1 binds the carboxy terminal domain (CTD) of pol II and is thought to regulate transcription by conformational isomerization of Ser-Pro bonds within the CTD. In ess1 mutants, expression of ∼10% of the genome was altered, due primarily to defects in termination of snoRNAs, CUTs, SUTs and uRNAs. Ess1 promoted dephosphorylation of Ser5 (but not Ser2) within the CTD, most likely by the Ssu72 phosphatase, and we provide evidence for a competition between Nrd1 and Pcf11 for CTD-binding that is regulated by Ess1-dependent isomerization. This is the first example of a prolyl isomerase required for interpreting the “CTD code.”
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.