The RNA recognition motif (RRM), also known as the RNA-binding domain (RBD) or ribonucleoprotein domain (RNP), was first identified in the late 1980s when it was demonstrated that mRNA precursors (pre-mRNA) and heterogeneous nuclear RNAs (hnRNAs) are always found in complex with proteins (reviewed in [1]). Biochemical characterizations of the mRNA polyadenylate binding protein (PABP) and the hnRNP protein C shed light on a consensus RNA-binding domain of approximately 90 amino acids containing a central sequence of eight conserved residues that are mainly aromatic and positively charged [2,3]. This sequence, termed the RNP consensus sequence, was thought to be involved in RNA interaction and was defined as Lys ⁄ ArgGly-Phe ⁄ Tyr-Gly ⁄ Ala-Phe ⁄ Tyr-Val ⁄ Ile ⁄ Leu-X-Phe ⁄ Tyr, where X can be any amino acid. Later, a second consensus sequence less conserved than the previously characterized one [1] The RNA recognition motif (RRM), also known as RNA-binding domain (RBD) or ribonucleoprotein domain (RNP) is one of the most abundant protein domains in eukaryotes. Based on the comparison of more than 40 structures including 15 complexes (RRM-RNA or RRM-protein), we reviewed the structure-function relationships of this domain. We identified and classified the different structural elements of the RRM that are important for binding a multitude of RNA sequences and proteins. Common structural aspects were extracted that allowed us to define a structural leitmotif of the RRM-nucleic acid interface with its variations. Outside of the two conserved RNP motifs that lie in the center of the RRM b-sheet, the two external b-strands, the loops, the C-and N-termini, or even a second RRM domain allow high RNA-binding affinity and specific recognition. Protein-RRM interactions that have been found in several structures reinforce the notion of an extreme structural versatility of this domain supporting the numerous biological functions of the RRM-containing proteins.Abbreviations ACF, APOBEC-1 complementary factor; CBP, cap binding protein; CstF, cleavage stimulation factor; hnRNP, heterogeneous nuclear ribonucleoprotein; HuD, Hu protein D; LRR, leucine rich repeat; MIF4G, middle domain of the translation initiation factor 4 G; PABP, polyadenylate binding protein; PIE, polyadenylation inhibition element; PTB, polypyrimidine tract binding protein; RBD, RNA-binding domain; RNP, ribonucleoprotein; RRM, RNA recognition motif; SR, serine/arginine rich proteins; TLS, translocated in liposarcoma; U1A, U2A¢, U2B¢: U1 snRNP proteins A, A¢, B¢; U2AF, U2 snRNP auxiliary factor; UHM, U2AF homology motif; UPF, up-frameshift protein.
SUMMARY Sequence-dependent recognition of dsDNA-binding proteins are well understood, yet sequence-specific recognition of dsRNA by proteins remains largely unknown, despite their importance in RNA maturation pathways. Adenosine deaminases that act on RNA (ADARs) recode genomic information by the site-selective deamination of adenosine. Here, we report the solution structure of the ADAR2 double-stranded RNA-binding motifs (dsRBMs) bound to a stem-loop pre-mRNA encoding the R/G editing site of GluR-2. The structure provides a molecular basis for how dsRBMs recognize the shape, and also more surprisingly, the sequence of the dsRNA. The unexpected direct readout of the RNA primary sequence by dsRBMs is achieved via the minor groove of the dsRNA and this recognition is critical for both editing and binding affinity at the R/G site of GluR-2. More generally, our findings suggest a solution to the sequence-specific paradox faced by many dsRBM-containing proteins that are involved in post-transcriptional regulation of gene expression.
Regulatory RNA elements, like riboswitches, respond to intracellular signals by three-dimensional (3D) conformational changes. RNA thermometers employ a similar strategy to sense temperature changes in the cell and regulate the translational machinery. We present here the first 3D NMR structure of the functional domain of a highly conserved bacterial RNA thermometer containing the ribosome binding site that remains occluded at normal temperatures (301C). We identified a region adjacent to the Shine-Dalgarno sequence that has a network of weak hydrogen bonds within the RNA helix. With the onset of heat shock at 421C, destabilisation of the RNA structure initiates at this region and favours the release of the ribosome binding site and of the start codon. Deletion of a highly conserved G residue leads to the formation of a stable regular RNA helix that loses thermosensing ability. Our results indicate that RNA thermometers are able to sense temperature changes without the aid of accessory factors.
Splicing of the c-src N1 exon is repressed by the polypyrimidine tract binding protein (PTB or PTBP1). During exon repression, the U1 snRNP binds properly to the N1 exon 5′ splice site but is made inactive by the presence of PTB. Examining the patterns of nuclease protection at this 5′ splice site, we find that the interaction of U1 is altered by the adjacent PTB. Interestingly, UV-crosslinking identifies a direct contact between the pre-mRNA-bound PTB and the U1 snRNA. EMSA, ITC and NMR studies show that PTB RRMs 1 and 2 bind the pyrimidine-rich internal loop of U1 snRNA stem loop 4. The PTB/U1 interaction prevents further assembly of the U1 snRNP with spliceosomal components downstream. This precise interaction between a splicing regulator and an snRNA component of the spliceosome points to a range of different mechanisms for splicing regulation.
Structural information on RNA, emerging more and more as a major regulator in gene expression, dramatically lags behind compared with information on proteins. Although NMR spectroscopy has proven to be an excellent tool to solve RNA structures, it is hampered by the severe spectral resonances overlap found in RNA, limiting its use for large RNA molecules. Segmental isotope labeling of RNA or ligation of a chemically synthesized RNA containing modified nucleotides with an unmodified RNA fragment have proven to have high potential in overcoming current limitations in obtaining structural information on RNA. However, low yields, cumbersome preparations and sequence requirements have limited its broader application in structural biology. Here we present a fast and efficient approach to generate multiple segmentally labeled RNAs with virtually no sequence requirements with very high yields (up to 10-fold higher than previously reported). We expect this approach to open new avenues in structural biology of RNA.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.