An analysis is presented of experimental versus calculated chemical shifts of the non-exchangeable protons for 28 RNA structures deposited in the Protein Data Bank, covering a wide range of structural building blocks. We have used existing models for ring-current and magnetic-anisotropy contributions to calculate the proton chemical shifts from the structures. Two different parameter sets were tried: (i) parameters derived by Ribas-Prado and Giessner-Prettre (GP set) [(1981) J. Mol. Struct., 76, 81-92.]; (ii) parameters derived by Case [(1995) J. Biomol. NMR, 6, 341-346]. Both sets lead to similar results. The detailed analysis was carried using the GP set. The root-mean-square-deviation between the predicted and observed chemical shifts of the complete database is 0.16 ppm with a Pearson correlation coefficient of 0.79. For protons in the usually well-defined A-helix environment these numbers are, 0.08 ppm and 0.96, respectively. As a result of this good correspondence, a reliable analysis could be made of the structural dependencies of the 1H chemical shifts revealing their physical origin. For example, a down-field shift of either H2' or H3' or both indicates a high-syn/syn chi-angle. In an A-helix it is essentially the 5'-neighbor that affects the chemical shifts of H5, H6 and H8 protons. The H5, H6 and H8 resonances can therefore be assigned in an A-helix on the basis of their observed chemical shifts. In general, the chemical shifts were found to be quite sensitive to structural changes. We therefore propose that a comparison between calculated and observed 1H chemical shifts is a good tool for validation and refinement of structures derived from NOEs and J-couplings.
Reverse transcription of hepatitis B virus (HBV) pregenomic RNA is essential for virus replication. In the first step of this process, HBV reverse transcriptase binds to the highly conserved encapsidation signal, epsilon (epsilon), situated near the 5' end of the pregenome. epsilon has been predicted to form a bulged stem-loop with the apical stem capped by a hexa- loop. After the initial binding to this apical stem- loop, the reverse transcriptase synthesizes a 4 nt primer using the bulge as a template. Here we present mutational and structural data from NMR on the apical stem-loop of epsilon. Application of new isotope-labeling techniques (13C/15N/2H-U-labeling) allowed resolution of many resonance overlaps and an extensive structural data set could be derived. The NMR data show that, instead of the predicted hexa-loop, the apical stem is capped by a stable UGU tri-loop closed by a C-G base pair, followed by a bulged out C. The apical stem contains therefore two unpaired pyrimidines (C1882 and U1889), rather than one as was predicted, spaced by 6 nt. C1882, the 3' neighbour to the G of the loop-closing C-G base pair, is completely bulged out, while U1889 is at least partially intercalated into the stem. Analysis of 205 of our own HBV sequences and 1026 strains from the literature, covering all genotypes, reveals a high degree of conservation of epsilon. In particular, the residues essential for this fold are either totally conserved or show rare non-disruptive mutations. These data strongly indicate that this fold is essential for recognition by the reverse transcriptase.
Hepatitis B virus (HBV) HBV is DNA virus with a unique replication strategy, which involves reverse transcription of its pregenomic RNA. Essential for this reverse transcription are the 5'- and 3'-ends of its pregenomic RNA (5'-RT-RNA and 3'-RT-RNA, respectively) which form conserved bulged stem-loop structures. The 5'-RT-RNA consists of a 67 nucleotide bulged stem-loop structure, epsilon, which constitutes the signal for encapsidation of the pregenomic RNA and subsequent reverse transcription. The reverse transcriptase (RT) initially binds to the completely conserved apical loop of epsilon and a 4-nucleotide primer is synthesized from the adjacent 6-nucleotide bulge. Structural studies of epsilon can provide important parameters required for the design of RNA targeted anti- viral drugs directed against Hepatitis B virus. NMR studies of large RNA systems (> ca. 50 nucleotides) require novel approaches, e.g., different labeling schemes and reduction of the system into separate structural building blocks. Recently, a new method of synthesizing (13)C/(15)N/(2)H labeled nucleotides has been developed based on converting specifically labeled glucose and bases into nucleotides by using enzymes from the pentose phosphate pathway and nucleotide and salvage pathways. These NTPs give a large freedom in designing different labeling patterns in in vitro synthesized RNAs under study for NMR. This opens up the way for NMR studies of RNAs that are considerably above the present size limit (up to 150 nucleotides). Here this new technique is applied for structural studies on 27, 36 and 61 nucleotides long RNA fragments, mimicking different regions of epsilon.
An effective in vitro enzymatic synthesis is described for the production of nucleoside triphosphates (NTPs) which are stereo-specifically deuterated on the H5" position with high selectivity (>98%), and which can have a variety of different labels (13C, 15N, 2H) in other positions. The NTPs can subsequently be employed in the enzymatic synthesis of RNAs using T7 polymerase from a DNA template. The stereo-specific deuteration of the H5" immediately provides the stereo-specific assignment of H5' resonances in NMR spectra, giving access to important structural parameters. Stereo-chemical H-exchange was used to convert commercially available 1,2,3,4,5,6,6-2H-1,2,3,4,5,6-13C-D-glucose (d7-13C6-D-glucose) into [1,2,3,4,5,6(R)-2H-1,2,3,4,5,6-13C]-D-glucose (d6-13C6-D-glucose). [1',3',4',5"-2H-1',2',3',4',5'-13C]GTP (d4-13C5-GTP) was then produced from d6-13C6-D-glucose and guanine base via in vitro enzymatic synthesis employing enzymes from the pentose-phosphate, nucleotide biosynthesis and salvage pathways. The overall yield was approximately 60 mg NTP per 1 g glucose, comparable with the yield of NTPs isolated from Escherichia coli grown on enriched media. The d4-13C5-GTP, together with in vitro synthesised d5-UTP, d5-CTP and non-labelled ATP, were used in the synthesis of a 31 nt RNA derived from the primer binding site of hepatitis B virus genomic RNA. (13C,1H) hetero-nuclear multiple-quantum spectra of the specifically deuterated sample and of a non-deuterated uniformly 13C/15N-labelled sample demonstrates the reduced spectral crowding and line width narrowing compared with 13C-labelled non-deuterated RNA.
We propose a strategy for NMR structure determination of RNA based on deuteration and use of specific labeling patterns. This strategy involves the use of NTPs that are deuterated in the ribose ring except for specific positions, e.g. H2', and that are either unlabeled or uniformly labeled in (13)C and (15)N in either the ribose or the base or both. Incorporation of these NTPs into an RNA sequence reduces both resonance line-width and spectral overlap. A limited number of combinations of these differently labeled NTPs in an RNA sequence suffices to obtain all relevant proton resonance assignments and structure parameters necessary for structure determination of larger systems (≫ 50 nucleotides). We describe the in vitro synthesis of the deuterated and/or (13)C/(15)N-labeled NTPs from glucose via separate enzymatic reactions. First, enzymes from the pentose-phosphate pathway efficiently convert glucose into ribose and enzymes from nucleotide biosynthesis and salvage pathways subsequently convert the ribose into nucleosides triphosphates (NTPs). The enzymes from the pentosephosphate pathway are all commercially available; the remaining enzymes have been purified from over-expressing strains. Separate enzymatic reactions were used to convert (2)H(7)- (13)C(6)-glucose into [1',3',4',5',5″-(2)H(5)-1',2',3',4',5',2,4,5,6-(13)C(9)-1,3-(15)N(2)]UTP and (2)H(7)-glucose into [1',3',4',5',5″-(2)H(5)]ATP, [1',3',4',5',5″-(2)H(5)]GTP, and [1',3',4',5',5″-(2)H(5)] CTP. The synthesis yields up to 1 gram of NTPs from 1 gram of glucose, which is about 5 to 10 times as efficient extraction for E. Coli grown on glucose. The synthesis presented here, is a modification of the method described by Tolbert & Williamson (1,2). (1)D and (2)D NMR spectra were acquired to demonstrate the utility of the new labeling patterns. The enzymatically synthesized NTPs were used in the synthesis of a 31-nucleotide RNA derived from the primer binding site of Hepatitis B virus genomic RNA to asses their efficiency in transcription.
We have developed a fast computational method to study the conformation and energetics of several hundred base pairs long DNA loops. The DNA is modeled as a self-equilibrated electrically charged elastic rod. The ensemble of DNA loop conformations, attainable for a given geometry of the DNA ends, is generated as a set of numerical solutions to the equations of the Kirchhoff-Love theory of elasticity. The equations are augmented by electrostatic and steric repulsion force terms. The method provides the basis for multi-resolution modeling of protein-DNA complexes, e.g., through using the calculated forces in subsequent full atom molecular dynamics simulations.We demonstrate the application of the method in several test case studies of protein-DNA systems. One is the promoter DNA loop of E.coli, clamped by the lac repressor and possibly also bound by the catabolite gene activator protein (CAP). Two topologically alternative structures of the loop are found, which exchange the role of the global energy minimum for different DNA lengths. The changes of the loops structure and energy, introduced by CAP, provide insights into the mechanism of the cooperative DNA binding by CAP and the lac repressor, observed in the experiment.Another test system for the developed method is the nucleosom.We solve Kirchhoff equations with the introduced force terms in order to reproduce the experimentally known structure of nucleosomal DNA wound around the histone core. Structural and Functional Studies of Proteins withStructural genomics presents an enormous challenge with up to 100,OOO protein targets in the human genome alone. To speed the current rate of structure determination, judicious selection of targets and a system which allows efficient experimental testing, is neccessary.Pyrobaculum aerophifium (PA), a hyperthermophilic, archeaebacteria has recently been sequenced and annotated. The thermostable proteins from PA are easily purified and appear to crystallize more readily than their mesophilic homologs, providing a good experimental system for high throughput structure determination. A method to assign a probability that a sequence assumes a novel fold was developed and used in conjunction with traditional sequence analysis methods to target proteins from the genome (Mallick unpublished data). Targeted proteins have a probability greater than 0.88 of having a new fold and have disease related human homologs.Targeted proteins were cloned into E. coli expression vectors, over-expressed and purified.Crystal trials for X-ray crystallographic structure determination are underway. In addition functional studies of the proteins are in progress.Our approach to protein targeting will yield valuable protein topological information useful for fold recognition and drugdesign for human diseases. In addition PA will provide insights into thermostable enzymedproteins with potential industrial applications.Structural and functional importance of mutations in 1816 Hepatitis B virus pregenomic RNA studied by NMR 181 8 Prediction of protein foldi...
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.