A number of protein factors regulate protein synthesis by bridging mRNA ends or untranslated regions (UTRs). Using experimental and computational approaches, we show that mRNAs from various organisms, including humans, have an intrinsic propensity to fold into structures in which the 5' end and 3' end are ≤ 7 nm apart irrespective of mRNA length. Computational estimates performed for ~22,000 human transcripts indicate that the inherent proximity of the ends is a universal property of most, if not all, mRNA sequences. Only specific RNA sequences, which have low sequence complexity and are devoid of guanosines, are unstructured and exhibit end-to-end distances expected for the random coil conformation of RNA. Our results suggest that the intrinsic proximity of mRNA ends may facilitate binding of translation factors that bridge mRNA 5' and 3' UTRs. Furthermore, our studies provide the basis for measuring, computing and manipulating end-to-end distances and secondary structure in mRNAs in research and biotechnology.
INRODUCTIONRegulation of mRNA translation in eukaryotes involves protein-mediated interactions between mRNA ends. Translation initiation requires the recruitment of the small ribosomal subunit to the 5' end of the mRNA 1 . The formation of the initiation complex is stimulated by the interaction between the 5' mRNA cap-binding protein eIF4E and the 3' end poly(A) tail binding protein PABP, which is mediated through their binding to different parts of the translational factor eIF4G 2,3 . The eIF4E•eIF4G•PABP complex is thought to enhance translation initiation by circularizing the mRNA and forming the "closed-loop" structure 4-6 . The mechanism by which the mRNA closed loop enhances proteins synthesis is not well understood.Remarkably, translation initiation of many eukaryotic mRNAs is also regulated by sequences in their 3' UTRs and controlled by the formation of protein bridges between the 5' and 3' UTRs. For example, the 3' UTR regulatory sequences recruit protein complexes (e.g. CPEB•Maskin, Bruno•Cup, or GAIT complex), which inhibit translation by interacting with either eIF4E or eIF4E•eIF4G bound to the 5' end of mRNA 7 . The pervasiveness of protein bridges between mRNA UTRs in the evolution of translation regulation is puzzling because of the significant entropic cost expected for protein-mediated mRNA circularization 8 .The entropic penalty for the formation of protein bridges between mRNA ends may be partially mitigated by mRNA compaction through intramolecular basepairing interactions.Recent theoretical analyses suggested that the 5' and 3' ends of long (1,000-10,000 nucleotidelong) RNAs are always brought in the proximity of few nanometers of each other regardless of RNA length and sequence because of the intrinsic propensity of RNA to form widespread intramolecular basepairing interactions 8-10 . One study predicted that the 5' to 3' end distance in RNAs is 3 nm, on average 8 . These theoretical predictions were tested by single-molecule Förster resonance energy transfer (smFRET) measurements o...