RNA molecules can fold into intricate shapes that can provide an additional layer of control of gene expression beyond that of their sequence. In this Review, we discuss the current mechanistic understanding of structures in 5′ untranslated regions (UTRs) of eukaryotic mRNAs and the emerging methodologies used to explore them. These structures may regulate cap-dependent translation initiation through helicase-mediated remodelling of RNA structures and higher-order RNA interactions, as well as cap-independent translation initiation through internal ribosome entry sites (IRESs), mRNA modifications and other specialized translation pathways. We discuss known 5′ UTR RNA structures and how new structure probing technologies coupled with prospective validation, particularly compensatory mutagenesis, are likely to identify classes of structured RNA elements that shape post-transcriptional control of gene expression and the development of multicellular organisms.
Tumor necrosis factor-α (TNF-α) is the most potent proinflammatory cytokine in mammals. The degradation of TNF-α mRNA is critical for restricting TNF-α synthesis and involves a constitutive decay element (CDE) in the 3' UTR of the mRNA. Here, we demonstrate that the CDE folds into an RNA stem-loop motif that is specifically recognized by Roquin and Roquin2. Binding of Roquin initiates degradation of TNF-α mRNA and limits TNF-α production in macrophages. Roquin proteins promote mRNA degradation by recruiting the Ccr4-Caf1-Not deadenylase complex. CDE sequences are highly conserved and are found in more than 50 vertebrate mRNAs, many of which encode regulators of development and inflammation. In macrophages, CDE-containing mRNAs were identified as the primary targets of Roquin on a transcriptome-wide scale. Thus, Roquin proteins act broadly as mediators of mRNA deadenylation by recognizing a conserved class of stem-loop RNA degradation motifs.
Summary During eukaryotic evolution, ribosomes have considerably increased in size forming a surface exposed ribosomal RNA (rRNA) shell of unknown function, which may create an interface for yet uncharacterized interacting proteins. To investigate such protein interactions, we establish a ribosome affinity purification method that unexpectedly identified hundreds of ribosome associated proteins (RAPs) from categories including metabolism, cell cycle, as well as RNA and protein modifying enzymes that functionally diversify mammalian ribosomes. By further characterizing RAPs, we discover the presence of ufmylation, a metazoan-specific posttranslational modification, on ribosomes and define its direct substrates. Moreover, we show that the metabolic enzyme, pyruvate kinase muscle (PKM), interacts with sub-pools of endoplasmic reticulum (ER)-associated ribosomes, exerting a non-canonical function as an RNA binding protein in the translation of ER-destined mRNAs. Therefore, RAPs interconnect one of life’s most ancient molecular machines with diverse cellular processes, providing an additional layer of regulatory potential to protein expression.
Determining the composition of messenger ribonucleoprotein (mRNP) particles is essential for a comprehensive understanding of the complex mechanisms underlying mRNA regulation, but is technically challenging. Here we present an RNA-based method to identify RNP components using a modified streptavidin (SA)-binding RNA aptamer termed S1m. By optimizing the RNA aptamer S1 in structure and repeat conformation, we improved its affinity for SA and found a 4-fold repeat of S1m (4×S1m) to be more efficient than the established MS2 and PP7 systems from bacteriophages. We then attached the AU-rich element (ARE) of tumor necrosis factor alpha (TNFα), a well-known RNA motif that induces mRNA degradation, via 4×S1m to a SA matrix, and used the resulting RNA affinity column to purify ARE-binding proteins (BPs) from cellular extracts. By quantitative mass spectrometry using differential dimethyl labeling, we identified the majority of established ARE-BPs and detected several RNA-BPs that had previously not been associated with AREs. For two of these proteins, Rbms1 and Roxan, we confirmed specific binding to the TNFα ARE. The optimized 4×S1m aptamer, therefore, provides a powerful tool for the discovery of mRNP components in a single affinity purification step.
Therapeutic mRNAs and vaccines are being developed for a broad range of human diseases, including COVID-19. However, their optimization is hindered by mRNA instability and inefficient protein expression. Here, we describe design principles that overcome these barriers. We develop an RNA sequencing-based platform called PERSIST-seq to systematically delineate in-cell mRNA stability, ribosome load, as well as in-solution stability of a library of diverse mRNAs. We find that, surprisingly, in-cell stability is a greater driver of protein output than high ribosome load. We further introduce a method called In-line-seq, applied to thousands of diverse RNAs, that reveals sequence and structure-based rules for mitigating hydrolytic degradation. Our findings show that highly structured “superfolder” mRNAs can be designed to improve both stability and expression with further enhancement through pseudouridine nucleoside modification. Together, our study demonstrates simultaneous improvement of mRNA stability and protein expression and provides a computational-experimental platform for the enhancement of mRNA medicines.
In mammalian cells, AU-rich elements (AREs) are well known regulatory sequences located in the 3′ untranslated region (UTR) of many short-lived mRNAs. AREs cause mRNAs to be degraded rapidly and thereby suppress gene expression at the posttranscriptional level. Based on the number of AUUUA pentamers, their proximity, and surrounding AU-rich regions, we generated an algorithm termed AREScore that identifies AREs and provides a numerical assessment of their strength. By analyzing the AREScore distribution in the transcriptomes of 14 metazoan species, we provide evidence that AREs were selected for in several vertebrates and Drosophila melanogaster. We then measured mRNA expression levels genome-wide to address the importance of AREs in SL2 cells derived from D. melanogaster hemocytes. Tis11, a zinc finger RNA–binding protein homologous to mammalian tristetraprolin, was found to target ARE–containing reporter mRNAs for rapid degradation in SL2 cells. Drosophila mRNAs whose expression is elevated upon knock down of Tis11 were found to have higher AREScores. Moreover high AREScores correlate with reduced mRNA expression levels on a genome-wide scale. The precise measurement of degradation rates for 26 Drosophila mRNAs revealed that the AREScore is a very good predictor of short-lived mRNAs. Taken together, this study introduces AREScore as a simple tool to identify ARE–containing mRNAs and provides compelling evidence that AREs are widespread regulatory elements in Drosophila.
Therapeutic mRNAs and vaccines are being developed for a broad range of human diseases, including COVID-19. However, their optimization is hindered by mRNA instability and inefficient protein expression. Here, we describe design principles that overcome these barriers. We develop a new RNA sequencing-based platform called PERSIST-seq to systematically delineate in-cell mRNA stability, ribosome load, as well as in-solution stability of a library of diverse mRNAs. We find that, surprisingly, in-cell stability is a greater driver of protein output than high ribosome load. We further introduce a method called In-line-seq, applied to thousands of diverse RNAs, that reveals sequence and structure-based rules for mitigating hydrolytic degradation. Our findings show that superfolder mRNAs can be designed to improve both stability and expression that are further enhanced through pseudouridine nucleoside modification. Together, our study demonstrates simultaneous improvement of mRNA stability and protein expression and provides a computational-experimental platform for the enhancement of mRNA medicines.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.