We have identified three C͞D-box small nucleolar RNAs (snoRNAs) and one H͞ACA-box snoRNA in mouse and human. In mice, all four snoRNAs (MBII-13, MBII-52, MBII-85, and MBI-36) are exclusively expressed in the brain, unlike all other known snoRNAs. Two of the human RNA orthologues (HBII-52 and HBI-36) share this expression pattern, and the remainder, HBII-13 and HBII-85, are prevalently expressed in that tissue. In mice and humans, the brain-specific H͞ACA box snoRNA (MBI-36 and HBI-36, respectively) is intronencoded in the brain-specific serotonin 2C receptor gene. The three human C͞D box snoRNAs map to chromosome 15q11-q13, within a region implicated in the Prader-Willi syndrome (PWS), which is a neurogenetic disease resulting from a deficiency of paternal gene expression. Unlike other C͞D box snoRNAs, two snoRNAs, HBII-52 and HBII-85, are encoded in a tandemly repeated array of 47 or 24 units, respectively. In mouse the homologue of HBII-52 is processed from intronic portions of the tandem repeats. Interestingly, these snoRNAs were absent from the cortex of a patient with PWS and from a PWS mouse model, demonstrating their paternal imprinting status and pointing to their potential role in the etiology of PWS. Despite displaying hallmarks of the two families of ubiquitous snoRNAs that guide 2-O-ribose methylation and pseudouridylation of rRNA, respectively, they lack any telltale rRNA complementarity. Instead, brain-specific C͞D box snoRNA HBII-52 has an 18-nt phylogenetically conserved complementarity to a critical segment of serotonin 2C receptor mRNA, pointing to a potential role in the processing of this mRNA. T he biogenesis of eukaryotic ribosomes involves a complex rRNA processing pathway mostly taking place in a specialized subnuclear compartment, the nucleolus. Pre-rRNA maturation includes, in addition to a series of endonucleolytic and exonucleolytic cleavages, the covalent modification of a definite subset of rRNA nucleotides, essentially by 2Ј-O-ribose methylation and pseudouridylation. Each of these modifications is found at about 100 sites per vertebrate ribosome (1). Although these modifications are phylogenetically conserved and restricted to the most highly conserved and functionally important regions of rRNA, their function remains largely unknown. Spliceosomal small nuclear RNAs (snRNAs) also contain a number of conserved 2Ј-Omethylated nucleotides and pseudouridines, confined to snRNA sequences critical for splicing, i.e., involved in contacts with premRNA or other snRNAs (2). Remarkably, nucleotide modifications in the 5Ј terminal region of U2 snRNA are required for assembly of a functional U2 sn-ribonucleoprotein particle (3).The nucleolus contains a large number of small, metabolically stable RNAs, termed small nucleolar RNAs (snoRNAs) that fall into two major classes, the C͞D box and H͞ACA box snoRNAs, designated after common sequence motifs involved in the assembly of sno-ribonucleoprotein particles. Although each class includes a small number of snoRNAs required for definite pre-rRNA cl...
Recent bioinformatics-aided searches have identified many new small RNAs (sRNAs) in the intergenic regions of the bacterium Escherichia coli. Here, a shot-gun cloning approach (RNomics) was used to generate cDNA libraries of small sized RNAs. Besides many of the known sRNAs, we found new species that were not predicted previously. The present work brings the number of sRNAs in E.coli to 62. Experimental transcription start site mapping showed that some sRNAs were encoded from independent genes, while others were processed from mRNA leaders or trailers, indicative of a parallel transcriptional output generating sRNAs co-expressed with mRNAs. Two of these RNAs (SroA and SroG) consist of known (THI and RFN) riboswitch elements. We also show that two recently identified sRNAs (RyeB and SraC/RyeA) interact, resulting in RNase III-dependent cleavage. To the best of our knowledge, this represents the first case of two non-coding RNAs interacting by a putative antisense mechanism. In addition, intracellular metabolic stabilities of sRNAs were determined, including ones from previous screens. The wide range of half-lives (<2 to >32 min) indicates that sRNAs cannot generally be assumed to be metabolically stable. The experimental characterization of sRNAs analyzed here suggests that the definition of an sRNA is more complex than previously assumed.
In contrast to the fairly reliable and complete annotation of the protein coding genes in the human genome, comparable information is lacking for non-coding RNAs. We present a comparative screen of vertebrate genomes for structural non-coding RNAs, which evaluates sequence conservation, secondary structure conservation, and thermodynamic stability of putative RNA structures. We predict more than 30 000 structured RNA elements in the human genome, almost 1000 of which are conserved across all vertebrates. Roughly a third is found in introns of known genes, a sixth are potential regulatory elements in untranslated regions, about half are located far away of any known gene. Only a small fraction of these sequences has been described previously. EST data demonstrate, however, that the majority of them is at least transcribed. The widespread conservation of secondary structure points to a large number of functional ncRNAs in the human genome, which we estimate to be comparable to the number of protein-coding genes. The recent finishing of the human genome sequence emphasizes the "need for reliable experimental and computational methods for comprehensive identification of non-coding RNAs" 1 . A variety of experimental techniques have been used to uncover the human and mouse transcriptomes, in particular tiling arrays 2-4 , cDNA sequencing 5,6 , and unbiased mapping of transcription factor binding sites 7 . All these studies agree that a substantial fraction of the genome is transcribed and that a large fraction of the transcriptome consists of non-coding RNAs. It is unclear, however, which fraction are functional non-coding RNAs (ncRNAs), and which constitutes "transcriptional noise" 8 .Genome-wide computational surveys of ncRNAs, on the other hand, have been impossible until recently, because ncRNAs do not share common signals that could be detected at the sequence level. A large class of ncRNAs, however, has characteristic structures that are functional and hence are well conserved over evolutionary timescales: most of the "classical" ncRNAs, including rRNAs, tRNAs, snRNAs, snoRNAs, as well as the RNA components of RNAse P and the signal recognition particle, are of this type. The stabilizing selection acting on the secondary structure causes characteristic substitution patterns in the underlying sequences: Consistent and compensatory mutations replace one type of base-pair by another one in the paired regions (helices) of the molecule. In addition, loop regions are more variable than helices. These patterns can be ex-1 ploited in comparative computational approaches 9-12 to discriminate functional RNAs from other types of conserved sequence. Recently, high levels of sequence conservation of non-coding DNA regions have been reported 13, 14 . Here we screen the complete collection of conserved non-coding DNA sequences from mammalian genomes and provide a first annotation of the complement of structurally conserved RNAs in the human genome. ResultsSelection of conserved sequences and screening for structural RNAs We s...
In a specialized cDNA library from the archaeon Archaeoglobus fulgidus we have identified a total of 86 different expressed RNA sequences potentially encoding previously uncharacterized small non-messenger RNA (snmRNA) species. Ten of these RNAs resemble eukaryotic small nucleolar RNAs, which guide rRNA 2'-O-methylations (C/D-box type) and pseudouridylations (H/ACA-box type). Thereby, we identified four candidates for H/ACA small RNAs in an archaeal species that are predicted to guide a total of six rRNA pseudouridylations. Furthermore, we have verified the presence of the six predicted pseudouridines experimentally. We demonstrate that 22 snmRNAs are transcribed from a family of short tandem repeats conserved in most archaeal genomes and shown previously to be potentially involved in replicon partitioning. In addition, four snmRNAs derived from the rRNA operon of A. fulgidus were identified and shown to be generated by a splicing/processing pathway of pre-rRNAs. The remaining 50 RNAs could not be assigned to a known class of snmRNAs because of the lack of known structure and/or sequence motifs. Regarding their location on the genome, only nine were located in intergenic regions, whereas 33 were complementary to an ORF, five were overlapping an ORF, and three were derived from the sense orientation within an ORF. Our study further supports the importance of snmRNAs in all three domains of life.
In mouse brain cDNA libraries generated from small RNA molecules we have identified a total of 201 different expressed RNA sequences potentially encoding novel small non‐messenger RNA species (snmRNAs). Based on sequence and structural motifs, 113 of these RNAs can be assigned to the C/D box or H/ACA box subclass of small nucleolar RNAs (snoRNAs), known as guide RNAs for rRNA. While 30 RNAs represent mouse homologues of previously identified human C/D or H/ACA snoRNAs, 83 correspond to entirely novel snoRNAs. Among these, for the first time, we identified four C/D box snoRNAs and four H/ACA box snoRNAs predicted to direct modifications within U2, U4 or U6 small nuclear RNAs (snRNAs). Furthermore, 25 snoRNAs from either class lacked antisense elements for rRNAs or snRNAs. Therefore, additional snoRNA targets have to be considered. Surprisingly, six C/D box snoRNAs and one H/ACA box snoRNA were expressed exclusively in brain. Of the 88 RNAs not belonging to either snoRNA subclass, at least 26 are probably derived from truncated heterogeneous nuclear RNAs (hnRNAs) or mRNAs. Short interspersed repetitive elements (SINEs) are located on five RNA sequences and may represent rare examples of transcribed SINEs. The remaining RNA species could not as yet be assigned either to any snmRNA class or to a part of a larger hnRNA/mRNA. It is likely that at least some of the latter will represent novel, unclassified snmRNAs.
Small nucleolar RNAs (designated as snoRNAs in Eukarya or sRNAs in Archaea) can be grouped into H/ACA or C/D box snoRNA (sRNA) subclasses. In Eukarya, H/ACA snoRNAs assemble into a ribonucleoprotein (RNP) complex comprising four proteins: Cbf5p, Gar1p, Nop10p and Nhp2p. A homolog for the Nhp2p protein has not been identified within archaeal H/ACA RNPs thus far, while potential orthologs have been identified for the other three proteins. Nhp2p is related, particularly in the middle portion of the protein sequence, to the archaeal ribosomal protein and C/D box protein L7Ae. This finding suggests that L7Ae may be able to substitute for the Nhp2p protein in archaeal H/ACA sRNAs. By band shift assays, we have analyzed in vitro the interaction between H/ACA box sRNAs and protein L7Ae from the archaeon Archaeoglobus fulgidus. We present evidence that L7Ae forms specific complexes with three different H/ACA sRNAs, designated as Afu-4, Afu-46 and Afu-190 with an apparent K(d) ranging from 28 to 100 nM. By chemical and enzymatic probing we show that distinct bases located within bulges or loops of H/ACA sRNAs interact with the L7Ae protein. These findings are corroborated by mutational analysis of the L7Ae binding site. Thereby, the RNA motif required for L7Ae binding exhibits a structure, designated as the K-turn, which is present in all C/D box sRNAs. We also identified four H/ACA RNAs from the archaeal species Pyrococcus which exhibit the K-turn motif at a similar position in their structure. These findings suggest a triple role for L7Ae protein in Archaea, e.g. in ribosomes as well as H/ACA and C/D box sRNP biogenesis and function by binding to the K-turn motif.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.