Gene regulation requires selective targeting of DNA regulatory enhancers over megabase distances. Here we show that Evf2, a cloud-forming Dlx5/6 ultraconserved enhancer (UCE) lncRNA, simultaneously localizes to activated (Umad1, 1.6 Mb distant) and repressed (Akr1b8, 27 Mb distant) chr6 target genes, precisely regulating UCE-gene distances and cohesin binding in mouse embryonic forebrain GABAergic interneurons (INs). Transgene expression of Evf2 activates Lsm8 (12 Mb distant) but fails to repress Akr1b8, supporting trans activation and long-range cis repression. Through both short-range (Dlx6 antisense) and long-range (Akr1b8) repression, the Evf2-5'UCE links homeodomain and mevalonate pathway-regulated enhancers to IN diversity. The Evf2-3' end is required for long-range activation but dispensable for RNA cloud localization, functionally dividing the RNA into 3'-activator and 5'UCE repressor and targeting regions. Together, these results support that Evf2 selectively regulates UCE interactions with multi-megabase distant genes through complex effects on chromosome topology, linking lncRNA-dependent topological and transcriptional control with interneuron diversity and seizure susceptibility.
Characterization of intrinsically disordered proteins (IDPs) has grown tremendously over the past two decades. NMR-based structural characterization has been widely embraced by the IDP community, largely because this technique is amenable to highly flexible biomolecules. Particularly, carbon-detect nuclear magnetic resonance (NMR) experiments provide a straight forward and expedient method for completing backbone assignments, thus providing the framework to study the structural and dynamic properties of IDPs. However, these experiments remain unfamiliar to most NMR spectroscopists, thus limiting the breadth of their application. In an effort to remove barriers that may prevent the application of carbon-detected bio-NMR where it has the potential to benefit investigators, here we describe the experimental requirements to collect a robust set of carbon-detected NMR data for complete backbone assignment of IDPs. Specifically, we advocate the use of threedimensional experiments that exploit magnetization transfer pathways initiated on the aliphatic protons, which produces increased sensitivity and provides a suitable method for IDPs that are only soluble in basic pH conditions (>7.5). The applicability of this strategy to systems featuring a high degree of proline content will also be discussed.
There is an extraordinary need to describe the structures of intrinsically disordered proteins (IDPs) due to their role in various biological processes involved in signaling and transcription. However, general study of IDPs by NMR spectroscopy is limited by the poor 1H-amide chemical shift dispersion typically observed in their spectra. Recently, 13C direct-detected NMR spectroscopy has been recognized as enabling broad structural study of IDPs. Most notably, multi-dimensional experiments based on the 15N,13C-CON spectrum make complete chemical shift assignment feasible. Here we document a collection of NMR based tools that efficiently lead to chemical shift assignment of IDPs, motivated by a case study of the C-terminal disordered region from the human pancreatic transcription factor Pdx1. Our strategy builds on the combination of two 3D experiments, (HN-flip)N(CA)CON and 3D (HN-flip)N(CA)NCO, that enable daisy-chain connections to be built along the IDP backbone, facilitated by acquisition of amino-acid specific 15N,13C-CON detected experiments. Assignments are completed through carbon-detected, TOCSY based side chain chemical shift measurement. Conducting our study required producing valuable modifications to many previously published pulse sequences, motivating us to announce the creation of a database of our pulse programs, which we make freely available through the web.
The Evf2 long non-coding RNA directs Dlx5/6 ultraconserved enhancer(UCE)-intrachromosomal interactions, regulating genes across a 27Mb region on chr6 in mouse developing forebrain. Here, we show that Evf2 long-range gene repression occurs through multi-step mechanisms involving the transcription factor Sox2. Evf2 directly interacts with Sox2, antagonizing Sox2 activation of Dlx5/6UCE, and recruits Sox2 to the Dlx5/6eii shadow enhancer and key Dlx5/6UCE interaction sites. Sox2 directly interacts with Dlx1 and Smarca4, as part of the Evf2 ribonucleoprotein complex, forming spherical subnuclear domains (protein pools, PPs). Evf2 targets Sox2 PPs to one long-range repressed target gene (Rbm28), at the expense of another (Akr1b8).Evf2 and Sox2 shift Dlx5/6UCE interactions towards Rbm28, linking Evf2/Sox2 co-regulated topological control and gene repression. We propose a model that distinguishes Evf2 gene repression mechanisms at Rbm28 (Dlx5/6UCE position) and Akr1b8 (limited Sox2 availability). Genome-wide control of RNPs (Sox2, Dlx and Smarca4) shows that co-recruitment influences Sox2 DNA binding. Together, these data suggest that Evf2 organizes a Sox2 PP subnuclear domain, and through Sox2-RNP sequestration and recruitment, regulates chr6 long-range UCE targeting and activity with genome-wide consequences.
In adult mammals, the production of insulin and other peptide hormones, such as the islet amyloid polypeptide (IAPP), is limited to β-cells due to tissue-specific expression of a set of transcription factors, the best known of which is Pdx1. Like many homeodomain transcription factors, Pdx1 binds to a core DNA recognition sequence containing the tetranucleotide 5´-TAAT-3´; its consensus recognition element is 5´-CTCTAAT(T/G)AG-3´. Currently, a complete thermodynamic profile of Pdx1 binding to near-consensus and native promoter sequences has not been established, obscuring the mechanism of target site selection by this critical transcription factor. Strikingly, while Pdx1 responsive elements in the human insulin promoter conform to the pentanucleotide 5´-CTAAT-3´ sequence, the Pdx1 responsive elements in the human iapp promoter all contain a substitution to 5-TTAAT-3´. The crystal structure of Pdx1 bound to the consensus nucleotide sequence does not explain how Pdx1 identifies this natural variation, if it does at all. Here we report a combination of isothermal calorimetric titrations, NMR spectroscopy, and extensive multi-microsecond molecular dynamics calculations of Pdx1 that define its interactions with a panel of natural promoter elements and consensus-derived sequences. Our results show a small preference of Pdx1 for a Cyt base 5´ relative to the core TAAT promoter element. Molecular mechanics calculations, corroborated by experimental NMR data, lead to a rational explanation for sequence discrimination at this position. Taken together, our results suggest a molecular mechanism for differential Pdx1 affinity to elements from the insulin and iapp promoter sequences.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.