In mammals and other eukaryotes most of the genome is transcribed in a developmentally regulated manner to produce large numbers of long non-coding RNAs (ncRNAs). Here we review the rapidly advancing field of long ncRNAs, describing their conservation, their organization in the genome and their roles in gene regulation. We also consider the medical implications, and the emerging recognition that any transcript, regardless of coding potential, can have an intrinsic function as an RNA.
The competitive endogenous RNA (ceRNA) hypothesis proposes that transcripts with shared microRNA (miRNA) binding sites compete for post-transcriptional control. This hypothesis has gained substantial attention as a unifying function for long non-coding RNAs, pseudogene transcripts and circular RNAs, as well as an alternative function for messenger RNAs. Empirical evidence supporting the hypothesis is accumulating but not without attracting scepticism. Recent studies that model transcriptome-wide binding-site abundance suggest that physiological changes in expression of most individual transcripts will not compromise miRNA activity. In this Review, we critically evaluate the evidence for and against the ceRNA hypothesis to assess the impact of endogenous miRNA-sponge interactions.
A major proportion of the mammalian transcriptome comprises long RNAs that have little or no protein-coding capacity (ncRNAs). Only a handful of such transcripts have been examined in detail, and it is unknown whether this class of transcript is generally functional or merely artifact. Using in situ hybridization data from the Allen Brain Atlas, we identified 849 ncRNAs (of 1,328 examined) that are expressed in the adult mouse brain and found that the majority were associated with specific neuroanatomical regions, cell types, or subcellular compartments. Examination of their genomic context revealed that the ncRNAs were expressed from diverse places including intergenic, intronic, and imprinted loci and that many overlap with, or are transcribed antisense to, proteincoding genes of neurological importance. Comparisons between the expression profiles of ncRNAs and their associated proteincoding genes revealed complex relationships that, in combination with the specific expression profiles exhibited at both regional and subcellular levels, are inconsistent with the notion that they are transcriptional noise or artifacts of chromatin remodeling. Our results show that the majority of ncRNAs are expressed in the brain and provide strong evidence that the majority of processed transcripts with no protein-coding capacity function intrinsically as RNAs.genomics ͉ neuroscience ͉ transcriptomics ͉ imprinting ͉ subcellular A lthough only 1.2% of the mammalian genome encodes proteins, it is now evident that most of the genome is transcribed to yield complex patterns of interlaced and overlapping transcripts that include tens of thousands of long (Ͼ200 nt) noncoding RNAs (ncRNAs) (1, 2). Although a small number of long ncRNAs have been functionally characterized (3), it remains a matter of debate whether the majority are biologically meaningful or merely transcriptional ''noise'' (4-7). The few long ncRNAs that have been characterized to date exhibit a diverse range of functions (3,8) and expression in specific cell types and/or localization to specific subcellular compartments (9-12). The determination of whether many more long ncRNAs are functional may considerably impact our understanding of various fundamental biological processes and significantly influence the approaches used to investigate them.If this class of long ncRNAs is indeed functional, one would expect that they would, in the main, show developmentally regulated and cell-specific expression patterns. The Allen Brain Atlas (ABA) is a large-scale study of the adult mouse brain that comprehensively catalogues and maps the patterns of gene expression that underlie brain development and function on a genome-wide scale (13). The ABA used high-throughput RNA in situ hybridization (ISH) to visualize the expression of over 20,000 mainly protein-coding transcripts from the mouse transcriptome at cellular resolution. We discovered that the ABA (13) also contained ISH data for many long ncRNAs. Our analysis of these data provides a landscape perspective of ncRNA expressi...
Summary The human mitochondrial genome comprises a distinct genetic system transcribed as precursor polycistronic transcripts that are subsequently cleaved to generate individual mRNAs, tRNAs and rRNAs. Here we provide a comprehensive analysis of the human mitochondrial transcriptome across multiple cell lines and tissues. Using directional deep sequencing and parallel analysis of RNA ends, we demonstrate wide variation in mitochondrial transcript abundance and precisely resolve transcript processing and maturation events. We identify previously undescribed transcripts, including small RNAs, and observe the enrichment of several nuclear RNAs in mitochondria. Using high-throughput in vivo DNaseI footprinting, we establish the global profile of DNA-binding protein occupancy across the mitochondrial genome at single nucleotide resolution, revealing regulatory features at mitochondrial transcription initiation sites and functional insights into disease-associated variants. This integrated analysis of the mitochondrial transcriptome reveals unexpected complexity in the regulation, expression, and processing of mitochondrial RNA, and provides a resource for future studies of mitochondrial function (accessed at mitochondria.matticklab.com).
For 50 years the term 'gene' has been synonymous with regions of the genome encoding mRNAs that are translated into protein. However, recent genome-wide studies have shown that the human genome is pervasively transcribed and produces many thousands of regulatory non-protein-coding RNAs (ncRNAs), including microRNAs, small interfering RNAs, PIWI-interacting RNAs and various classes of long ncRNAs. It is now clear that these RNAs fulfil critical roles as transcriptional and post-transcriptional regulators and as guides of chromatin-modifying complexes. Here we review the biology of ncRNAs, focusing on the fundamental mechanisms by which ncRNAs facilitate normal development and physiology and, when dysfunctional, underpin disease. We also discuss evidence that intergenic regions associated with complex diseases express ncRNAs, as well as the potential use of ncRNAs as diagnostic markers and therapeutic targets. Taken together, these observations emphasize the need to move beyond the confines of protein-coding genes and highlight the fact that continued investigation of ncRNA biogenesis and function will be necessary for a comprehensive understanding of human disease.
Studies of the transcriptional output of the human and mouse genomes have revealed that there are many more transcripts produced than can be accounted for by predicted protein-coding genes. Using a custom microarray, we have identified 184 non-coding RNAs that exhibit more than twofold up-or down-regulation upon differentiation of C2C12 myoblasts into myotubes. Here, we focus on the Men e/b locus, which is up-regulated 3.3-fold during differentiation. Two non-coding RNA isoforms are produced from a single RNA polymerase II promoter, differing in the location of their 39 ends. Men e is a 3.2-kb polyadenylated RNA, whereas Men b is an ;20-kb transcript containing a genomically encoded poly(A)-rich tract at its 39-end. The 39-end of Men b is generated by RNase P cleavage. The Men e/b transcripts are localized to nuclear paraspeckles and directly interact with NONO. Knockdown of MEN e/b expression results in the disruption of nuclear paraspeckles. Furthermore, the formation of paraspeckles, after release from transcriptional inhibition by DRB treatment, was suppressed in MEN e/b-depleted cells. Our findings indicate that the MEN e/b non-coding RNAs are essential structural/organizational components of paraspeckles.
The transcriptional networks that regulate embryonic stem (ES) cell pluripotency and lineage specification are the subject of considerable attention. To date such studies have focused almost exclusively on protein-coding transcripts. However, recent transcriptome analyses show that the mammalian genome contains thousands of long noncoding RNAs (ncRNAs), many of which appear to be expressed in a developmentally regulated manner. The functions of these remain untested. To identify ncRNAs involved in ES cell biology, we used a custom-designed microarray to examine the expression profiles of mouse ES cells differentiating as embryoid bodies (EBs) over a 16-d time course. We identified 945 ncRNAs expressed during EB differentiation, of which 174 were differentially expressed, many correlating with pluripotency or specific differentiation events. Candidate ncRNAs were identified for further characterization by an integrated examination of expression profiles, genomic context, chromatin state, and promoter analysis. Many ncRNAs showed coordinated expression with genomically associated developmental genes, such as Dlx1, Dlx4, Gata6, and Ecsit. We examined two novel developmentally regulated ncRNAs, Evx1as and Hoxb5/6as, which are derived from homeotic loci and share similar expression patterns and localization in mouse embryos with their associated protein-coding genes. Using chromatin immunoprecipitation, we provide evidence that both ncRNAs are associated with trimethylated H3K4 histones and histone methyltransferase MLL1, suggesting a role in epigenetic regulation of homeotic loci during ES cell differentiation. Taken together, our data indicate that long ncRNAs are likely to be important in processes directing pluripotency and alternative differentiation programs, in some cases through engagement of the epigenetic machinery.
Cancer is a disease of aberrant gene expression. While the genetic causes of cancer have been intensively studied, it is becoming evident that a large proportion of cancer susceptibility cannot be attributed to variation in protein-coding sequences. This is highlighted by genome-wide association studies in cancer that reveal that more than 80% of cancer-associated SNPs occur in noncoding regions of the genome. In this review, we posit that a significant fraction of the genetic aetiology of cancer is exacted by noncoding regulatory sequences, particularly by long noncoding RNAs (lncRNAs). Recent studies indicate that several cancer risk loci are transcribed into lncRNAs and these transcripts play key roles in tumorigenesis. We discuss the epigenetic and other mechanisms through which lncRNAs function and how they contribute to each stage of cancer progression, understanding of which will be crucial for realising new opportunities in cancer diagnosis and treatment. Long noncoding RNAs play important roles in almost every aspect of cell biology from nuclear organisation and epigenetic regulation to post-transcriptional regulation and splicing, and we link these processes to the hallmarks and genetics of cancer. Finally, we highlight recent progress and future potential in the application of lncRNAs as therapeutic targets and diagnostic markers.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.