Long non-coding RNAs (lncRNAs) are a diverse class of RNAs that engage in numerous biological processes across every branch of life. Although initially discovered as mRNA-like transcripts that do not encode proteins, recent studies have revealed features of lncRNAs that further distinguish them from mRNAs. In this Review, we describe special events in the lifetimes of lncRNAs - before, during and after transcription - and discuss how these events ultimately shape the unique characteristics and functional roles of lncRNAs.
A transcribed, multi-channel, and continuously evolving molecular recorder 77 To achieve our goal of a tunable, high information content molecular recorder, we 78 utilized Cas9 to generate insertions or deletions (indels) upon repair of double-stranded breaks, 79 which are inherited in the next generation of cells 11-16. We record within a 205 base pair, synthetic DNA "target site" containing three "cut sites" and a static 8 base pair "integration barcode" (intBC), which are delivered in multiple copies via piggyBac transposition (Fig. 1a, b). We embedded this sequence into the 3'UTR of a constitutively transcribed fluorescent protein to enable profiling from the transcriptome. A second cassette encodes three independently transcribed and complementary guide RNAs to permit recording of multiple, distinct signals (Fig. 1a, b) 18. Our system is capable of high information storage due to the diversity of heritable repair outcomes, and the large number of targeted sites, which can be distinguished by the intBC (Fig. 1c). DNA repair generates hundreds of unique indels, and the distribution for each cut site is different and nonuniform: some produce highly biased outcomes while others create a diverse series (Fig. 1c, Extended Data Fig. 1) 19-21. To identify sequences that can tune the mutation rate of our recorder for timescales that are not pre-defined, and may extend from days to months, we screened several guide RNA series containing mismatches to their targets 22 by monitoring their activity on a GFP reporter over a 20-day timecourse and selected those that demonstrated a broad dynamic range (Fig. 1d). Slower cutting rates may improve viability in vivo, as frequent Cas9mediated double-strand breaks can cause cellular toxicity 23,24. To demonstrate information recovery from single cell transcriptomes, we stably transduced K562 cells with our technology and generated a primary, cell-barcoded cDNA pool via the 10x Genomics platform, allowing us * * *
Long noncoding RNAs are key regulators of chromatin states for important biological processes such as dosage compensation, imprinting, and developmental gene expression 1,2,3,4,5,6,7 . The recent discovery of thousands of lncRNAs in association with specific chromatin modification complexes, such as Polycomb Repressive Complex 2 (PRC2) that mediates histone H3 lysine 27 trimethylation (H3K27me3), suggests broad roles for numerous lncRNAs in managing chromatin states in a gene-specific fashion 8,9 . While some lncRNAs are thought to work in cis on neighboring genes, other lncRNAs work in trans to regulate distantly located genes. For instance, Drosophila lncRNAs roX1 and roX2 bind numerous regions on the X chromosome of male cells, and are critical for dosage compensation 10,11 . However, the exact locations of their binding sites are not known at high resolution. Similarly, human lncRNA HOTAIR can affect PRC2 occupancy on hundreds of genes genomewide 3,12,13 , but how specificity is achieved is unclear. LncRNAs can also serve as modular scaffolds to recruit the assembly of multiple protein complexes. The classic trans-acting RNA scaffold is the TERC RNA that serves as the template and scaffold for the telomerase complex 14 ; HOTAIR can also serve as a scaffold for PRC2 and a H3K4 demethylase complex 13 .Prior studies mapping RNA occupancy at chromatin have revealed substantial insights 15,16 , but only at a single gene locus at a time. The occupancy sites of most lncRNAs are not known, and the roles of lncRNAs in chromatin regulation have been mostly inferred from the indirect effects of lncRNA perturbation. Just as chromatin immunoprecipitation followed by microarray or deep sequencing (ChIP-chip or ChIP-seq, respectively) has greatly improved our understanding of protein-DNA interactions on a genomic scale, here we illustrate a recently published strategy to map long RNA occupancy genome-wide at high resolution 17 . This method, Chromatin Isolation by RNA Purification (ChIRP) (Figure 1), is based on affinity capture of target lncRNA:chromatin complex by tiling antisense-oligos, which then generates a map of genomic binding sites at a resolution of several hundred bases with high sensitivity and low background. ChIRP is applicable to many lncRNAs because the design of affinity-probes is straightforward given the RNA sequence and requires no knowledge of the RNA's structure or functional domains. Video LinkThe video component of this article can be found at https://www.jove.com/video/3912/ Protocol Probe DesignDesign anti-sense DNA tiling probes for selective retrieval of RNA target by ChIRP.1. Design anti-sense oligo probes using the online probe designer at singlemoleculefish.com 18 . 2. Use these parameters: number of probes = 1 probe /100 bp of RNA length; 2) Target GC% = 45; 3) Oligonucleotide length = 20; 4) Spacing length = 60-80. Break RNA into segments if too long for the designer. Omit regions of repeats or extensive homology. 3. Order anti-sense DNA probes with BiotinTEG at 3-prime end. 4....
Summary Dosage compensation in Drosophila is an epigenetic phenomenon utilizing proteins and long noncoding RNAs (lncRNAs) for transcriptional upregulation of the male X chromosome. Here, by using UV crosslinking followed by deep sequencing, we show that two enzymes in the Male-Specific Lethal complex, MLE RNA helicase and MSL2 ubiquitin ligase, bind evolutionarily conserved domains containing tandem stem loops in roX1 and roX2 RNAs in vivo. These domains constitute the minimal RNA unit present in multiple copies in diverse arrangements for nucleation of the MSL complex. MLE binds to these domains with distinct ATP-independent and ATP-dependent behavior. Importantly, we show that different roX RNA domains have overlapping function, since only combinatorial mutations in the tandem stem loops result in severe loss of dosage compensation and consequently male-specific lethality. We propose that repetitive structural motifs in lncRNAs could provide plasticity during multiprotein complex assemblies to ensure efficient targeting in cis or in trans along chromosomes.
Many long noncoding RNAs (lncRNAs) can regulate chromatin states, but the evolutionary origin and dynamics driving lncRNA-genome interactions are unclear. We adapted an integrative strategy that identifies lncRNA orthologs in different species despite limited sequence similarity, which is applicable to mammalian and insect lncRNAs. Analysis of the roX lncRNAs, which are essential for dosage compensation of the single X chromosome in Drosophila males, revealed 47 new roX orthologs in diverse Drosophilid species across ∼40 million years of evolution. Genetic rescue by roX orthologs and engineered synthetic lncRNAs showed that altering the number of focal, repetitive RNA structures determines roX ortholog function. Genomic occupancy maps of roX RNAs in four species revealed conserved targeting of X chromosome neighborhoods but rapid turnover of individual binding sites. Many new roX-binding sites evolved from DNA encoding a pre-existing RNA splicing signal, effectively linking dosage compensation to transcribed genes. Thus, dynamic change in lncRNAs and their genomic targets underlies conserved and essential lncRNA-genome interactions.
Little is known about the functional domain architecture of long RNA molecules, mainly because of a relative paucity of suitable methods to analyze RNA function at a domain level. Here we describe domain-specific chromatin isolation by RNA purification (dChIRP), a scalable technique to dissect pairwise RNA-RNA, RNA-protein, and RNA-chromatin interactions in living cells. dChIRP of roX1, a lncRNA essential for Drosophila X-chromosome dosage compensation, reveals a “three-fingered hand” ribonucleoprotein topology. Each RNA finger binds chromatin and the Male-Specific Lethal (MSL) protein complex, and can individually rescue male lethality in roX-null flies, thus defining a minimal RNA domain for chromosome-wide dosage compensation. dChIRP improves RNA genomic localization signal by >20-fold relative to previous techniques, and these binding sites are correlated with chromosome conformation data, indicating that most roX-bound loci cluster in a nuclear territory. These results suggest dChIRP can reveal lncRNA architecture and function with new precision and sensitivity.
The pairing of CRISPR/Cas9-based gene editing with massively parallel single-cell readouts now enables large-scale lineage tracing. However, the rapid growth in complexity of data from these assays has outpaced our ability to accurately infer phylogenetic relationships. First, we introduce Cassiopeia-a suite of scalable maximum parsimony approaches for tree reconstruction. Second, we provide a simulation framework for evaluating algorithms and exploring lineage tracer design principles. Finally, we generate the most complex experimental lineage tracing dataset to date, 34,557 human cells continuously traced over 15 generations, and use it for benchmarking phylogenetic inference approaches. We show that Cassiopeia outperforms traditional methods by several metrics and under a wide variety of parameter regimes, and provide insight into the principles for the design of improved Cas9-enabled recorders. Together, these should broadly enable large-scale mammalian lineage tracing efforts. Cassiopeia and its benchmarking resources are publicly available at www.github.com/YosefLab/Cassiopeia.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.