We present a theory of the dependence on sequence of the three-dimensional size of large single-stranded (ss) RNA molecules. The work is motivated by the fact that the genomes of many viruses are large ssRNA molecules-often several thousand nucleotides long-and that these RNAs are spontaneously packaged into small rigid protein shells. We argue that there has been evolutionary pressure for the genome to have overall spatial properties-including an appropriate radius of gyration, R g-that facilitate this assembly process. For an arbitrary RNA sequence, we introduce the (thermal) average maximum ladder distance (͗MLD͘) and use it as a measure of the ''extendedness'' of the RNA secondary structure. The ͗MLD͘ values of viral ssRNAs that package into capsids of fixed size are shown to be consistently smaller than those for randomly permuted sequences of the same length and base composition, and also smaller than those of natural ssRNAs that are not under evolutionary pressure to have a compact native form. By mapping these secondary structures onto a linear polymer model and by using ͗MLD͘ as a measure of effective contour length, we predict the R g values of viral ssRNAs are smaller than those of nonviral sequences. More generally, we predict the average ͗MLD͘ values of large nonviral ssRNAs scale as N 0.67؎0.01 , where N is the number of nucleotides, and that their R g values vary as ͗MLD͘ 0.5 in an ideal solvent, and hence as N 0.34 . An alternative analysis, which explicitly includes all branches, is introduced and shown to yield consistent results.branched polymer ͉ ladder distance ͉ radius of gyration ͉ secondary structure ͉ viral RNA
Recent work has shown that pressures inside dsDNA phage capsids can be as high as many tens of atmospheres; it is this pressure that is responsible for initiation of the delivery of phage genomes to host cells. The forces driving ejection of the genome have been shown to decrease monotonically as ejection proceeds, and hence to be strongly dependent on the genome length. Here we investigate the effects of ambient salts on the pressures inside phage-lambda, for the cases of mono-, di-, and tetravalent cations, and measure how the extent of ejection against a fixed osmotic pressure (mimicking the bacterial cytoplasm) varies with cation concentration. We find, for example, that the ejection fraction is halved in 30 mM Mg(2+) and is decreased by a factor of 10 upon addition of 1 mM spermine. These effects are calculated from a simple model of genome packaging, using DNA-DNA repulsion energies as determined independently from x-ray diffraction measurements on bulk DNA solutions. By comparing the measured ejection fractions with values implied from the bulk DNA solution data, we predict that the bending energy makes the d-spacings inside the capsid larger than those for bulk DNA at the same osmotic pressure.
We show on general theoretical grounds that the two ends of single-stranded (ss) RNA molecules (consisting of roughly equal proportions of A, C, G and U) are necessarily close together, largely independent of their length and sequence. This is demonstrated to be a direct consequence of two generic properties of the equilibrium secondary structures, namely that the average proportion of bases in pairs is ∼60% and that the average duplex length is ∼4. Based on mfold and Vienna computations on large numbers of ssRNAs of various lengths (1000–10 000 nt) and sequences (both random and biological), we find that the 5′–3′ distance—defined as the sum of H-bond and covalent (ss) links separating the ends of the RNA chain—is small, averaging 15–20 for each set of viral sequences tested. For random sequences this distance is ∼12, consistent with the theory. We discuss the relevance of these results to evolved sequence complementarity and specific protein binding effects that are known to be important for keeping the two ends of viral and messenger RNAs in close proximity. Finally we speculate on how our conclusions imply indistinguishability in size and shape of equilibrated forms of linear and covalently circularized ssRNA molecules.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.