Many naturally occurring RNA structures contain single mismatches. However, the algorithms currently used to predict RNA structure from sequence rely on a minimal set of data for single mismatches, most of which occur rather infrequently in nature. As a result, several approximations and assumptions are used to predict the stability of RNA duplexes containing the most common single mismatches. Therefore, the relative frequency of single mismatches was determined by compiling and searching a database of 955 RNA secondary structures. Thermodynamic parameters for duplex formation, derived from optical melting experiments, are reported for 28 oligoribonucleotides containing frequently occurring single mismatches. These data were then combined with previous data to construct a dataset of 64 single mismatches, including the 30 most common in the database. Because of this increase in experimental thermodynamic parameters for single mismatches that occur frequently in nature, more accurate free energy calculations have resulted. To improve the prediction of the thermodynamic parameters for duplexes containing single mismatches that have not been experimentally measured, single mismatch-specific nearest neighbor parameters were derived. The free energy of an RNA duplex containing a single mismatch that has not been thermodynamically characterized can be calculated by: DeltaG degrees 37,single mismatch = DeltaG degrees 37,mismatch nt + DeltaG degrees 37,mismatch-NN interaction + DeltaG degrees 37,AU/GU. Here, DeltaG degrees 37,mismatch is -0.4, -2.1, and -0.3 kcal/mol for A.G, G.G, and U.U mismatches, respectively; DeltaG degrees 37,mismatch-NN interaction is 0.7, -0.5, 0.4, -0.4, and -1.0 kcal/mol for 5'YRR3'/3'RRY5', 5'RYY3'/3'YYR5', 5'YYR3'/3'RYY5', 5'YRY3'/3'RYR5', and 5'RRY3'/3'YYR5' mismatch-nearest neighbor combinations, respectively, when A and G are categorized as purines (R) and C and U are categorized as pyrimidines (Y); and DeltaG degrees 37,AU/GU is a penalty of 1.2 kcal/mol for replacing a G-C base pair with either an A-U or G-U base pair. Similar predictive models were also derived for DeltaH degrees single mismatch and DeltaS degrees single mismatch. These new predictive models, in conjunction with the reported thermodynamics for frequently occurring single mismatches, should allow for more accurate calculations of the free energy of RNA duplexes containing single mismatches and, furthermore, allow for improved prediction of secondary structure from sequence.
Internal loops in RNA are important for folding and function. Many folding motifs are internal loops containing GA base pairs, which are usually thermodynamically stabilizing, i.e., contribute favorable free energy to folding. Understanding the sequence dependence of folding stability and structure in terms of molecular interactions, such as hydrogen bonding and base stacking, will provide a foundation for predicting stability and structure. Here, we report the NMR structure of the oligonucleotide duplex, 5'GGUGGAGGCU3'/3'PCCGAAGCCG5' (P = purine), containing an unusually stable and relatively abundant internal loop, 5'GGA3'/3'AAG5'. This loop contains three consecutive sheared GA pairs (trans Hoogsteen/Sugar edge AG) with separate stacks of three G's and three A's in a row. The thermodynamic consequences of various nucleotide substitutions are also reported. Significant destabilization of approximately 2 kcal/mol at 37 degrees C is found for substitution of the middle GA with AA to form 5'GAA3'/3'AAG5'. This destabilization correlates with a unique base stacking and hydrogen-bonding network within the 5'GGA3'/3'AAG5' loop. Interestingly, the motifs, 5'UG3'/3'GA5' and 5'UG3'/3'AA5', have stability similar to 5'CG3'/3'GA5' even though UG and UA pairs are usually less stable than CG pairs. Consecutive sheared GA pairs in the 5'GGA3'/3'AAG5' loop are preorganized for potential tertiary interactions and ligand binding.
Although tetraloops are one of the most frequently occurring secondary structure motifs in RNA, less than one-third of the 30 most frequently occurring RNA tetraloops have been thermodynamically characterized. Therefore, 24 stem-loop sequences containing common tetraloops were optically melted, and the thermodynamic parameters DH°, DS°, DG°3 7, and T M for each stem-loop were determined. These new experimental values, on average, are 0.7 kcal/mol different from the values predicted for these tetraloops using the model proposed by Vecenie CJ, Morrow CV, Zyra A, Serra MJ. 2006. Biochemistry 45: 1400-1407. The data for the 24 tetraloops reported here were then combined with the data for 28 tetraloops that were published previously. A new model, independent of terminal mismatch data, was derived to predict the free energy contribution of previously unmeasured tetraloops. The average absolute difference between the measured values and the values predicted using this proposed model is 0.4 kcal/mol. This new experimental data and updated predictive model allow for more accurate calculations of the free energy of RNA stem-loops containing tetraloops and, furthermore, should allow for improved prediction of secondary structure from sequence. It was also shown that tetraloops within the sequence 59-GCCNNNNGGC-39 are, on average, 0.6 kcal/mol more stable than the same tetraloop within the sequence 59-GGCNNNNGCC-39. More systemic studies are required to determine the full extent of non-nearest-neighbor effects on tetraloop stability.
Pseudouridine (Ψ) is the most common noncanonical nucleotide present in naturally occurring RNA and serves a variety of roles in the cell, typically appearing where structural stability is crucial to function. Ψ residues are isomerized from native uridine residues by a class of highly conserved enzymes known as pseudouridine synthases. In order to quantify the thermodynamic impact of pseudouridylation on U-A base pairs, 24 oligoribonucleotides, 16 internal and eight terminal Ψ-A oligoribonucleotides, were thermodynamically characterized via optical melting experiments. The thermodynamic parameters derived from two-state fits were used to generate linearly independent parameters for use in secondary structure prediction algorithms using the nearestneighbor model. On average, internally pseudouridylated duplexes were 1.7 kcal/mol more stable than their U-A counterparts, and terminally pseudouridylated duplexes were 1.0 kcal/mol more stable than their U-A equivalents. Due to the fact that Ψ-A pairs maintain the same Watson-Crick hydrogen bonding capabilities as the parent U-A pair in A-form RNA, the difference in stability due to pseudouridylation was attributed to two possible sources: the novel hydrogen bonding capabilities of the newly relocated imino group as well as the novel stacking interactions afforded by the electronic configuration of the Ψ residue. The newly derived nearest-neighbor parameters for Ψ-A base pairs may be used in conjunction with other nearest-neighbor parameters for accurately predicting the most likely secondary structure of A-form RNA containing Ψ-A base pairs.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.