It is well recognized that base sequence exerts a significant influence on the properties of DNA and plays a significant role in protein–DNA interactions vital for cellular processes. Understanding and predicting base sequence effects requires an extensive structural and dynamic dataset which is currently unavailable from experiment. A consortium of laboratories was consequently formed to obtain this information using molecular simulations. This article describes results providing information not only on all 10 unique base pair steps, but also on all possible nearest-neighbor effects on these steps. These results are derived from simulations of 50–100 ns on 39 different DNA oligomers in explicit solvent and using a physiological salt concentration. We demonstrate that the simulations are converged in terms of helical and backbone parameters. The results show that nearest-neighbor effects on base pair steps are very significant, implying that dinucleotide models are insufficient for predicting sequence-dependent behavior. Flanking base sequences can notably lead to base pair step parameters in dynamic equilibrium between two conformational sub-states. Although this study only provides limited data on next-nearest-neighbor effects, we suggest that such effects should be analyzed before attempting to predict the sequence-dependent behavior of DNA.
We present the results of microsecond molecular dynamics simulations carried out by the ABC group of laboratories on a set of B-DNA oligomers containing the 136 distinct tetranucleotide base sequences. We demonstrate that the resulting trajectories have extensively sampled the conformational space accessible to B-DNA at room temperature. We confirm that base sequence effects depend strongly not only on the specific base pair step, but also on the specific base pairs that flank each step. Beyond sequence effects on average helical parameters and conformational fluctuations, we also identify tetranucleotide sequences that oscillate between several distinct conformational substates. By analyzing the conformation of the phosphodiester backbones, it is possible to understand for which sequences these substates will arise, and what impact they will have on specific helical parameters.
We have carried out a set of explicit solvent molecular dynamics (MD) simulations on two DNA quadruplex (G-DNA) molecules, namely the antiparallel d(G4T4G4)2 dimeric quadruplex with diagonal loops and the parallel-stranded human telomeric monomolecular quadruplex d[AGGG(TTAGGG)3] with three propeller loops. The main purpose of the paper was testing of the capability of the MD simulation technique to describe single-stranded topologies of G-DNA loops, which represent a very challenging task for computational methods. The total amount of conventional and locally enhanced sampling (LES) simulations analyzed in this study exceeds 1.5 μs, while we tested several versions of the AMBER force field (parm99, parmbsc0, and a version with modified glycosidic χ torsion profile) and the CHARMM27 force field. Further, we compared minimal salt and excess salt simulations. Postprocessing MM-PBSA (Molecular Mechanics, Poisson-Boltzmann, Surface Area) free energy calculations are also reported. None of the presently available force fields is accurate enough in describing the G-DNA loops. The imbalance is best seen for the propeller loops, as their experimental structure is lost within a few ns of standard simulations with all force fields. Among them, parmbsc0 provides results that are clearly closest to the experimental target values but still not in full agreement. This confirms that the improvement of the γ torsional profile penalizing the γ trans substates in the parmbsc0 parametrization was a step in the right direction, albeit not sufficient to treat all imbalances. The modified χ parametrization appears to rigidify the studied systems but does not change the ultimate outcome of the present simulations. The structures obtained in simulations with the modified χ profile are predetermined by its combination with either parm99 or parmbsc0. Experimental geometries of diagonal loops of d(G4T4G4)2 are stable in standard simulations on the ∼10 ns time scale but are becoming progressively lost in longer and LES simulations. In addition, the d(G4T4G4)2 quadruplex contains, besides the three genuine binding sites for cations in the channel of its stem, also an ion binding site at each stem-loop junction. This arrangement of five cations in the quadruplex core region is entirely unstable in all 24 simulations that we attempted. Overall, our results confirm that G-DNA loops represent one of the most difficult targets for molecular modeling approaches and should be considered as reference structures in any future studies aiming to develop or tune nucleic acids force fields.
Explicit solvent and counterion molecular dynamics simulations have been carried out for a total of [80 ns on the bacterial and spinach chloroplast 5S rRNA Loop E motifs. The Loop E sequences form unique duplex architectures composed of seven consecutive non-Watson-Crick basepairs. The starting structure of spinach chloroplast Loop E was modeled using isostericity principles, and the simulations refined the geometries of the three non-Watson-Crick basepairs that differ from the consensus bacterial sequence. The deep groove of Loop E motifs provides unique sites for cation binding. Binding of Mg 21 rigidifies Loop E and stabilizes its major groove at an intermediate width. In the absence of Mg 21 , the Loop E motifs show an unprecedented degree of inner-shell binding of monovalent cations that, in contrast to Mg 21 , penetrate into the most negative regions inside the deep groove. The spinach chloroplast Loop E shows a marked tendency to compress its deep groove compared with the bacterial consensus. Structures with a narrow deep groove essentially collapse around a string of Na 1 cations with long coordination times. The Loop E non-Watson-Crick basepairing is complemented by highly specific hydration sites ranging from water bridges to hydration pockets hosting 2 to 3 long-residing waters. The ordered hydration is intimately connected with RNA local conformational variations.
RNA molecules are now known to be involved in the processing of genetic information at all levels, taking on a wide variety of central roles in the cell. Understanding how RNA molecules carry out their biological functions will require an understanding of structure and dynamics at the atomistic level, which can be significantly improved by combining computational simulation with experiment. This review provides a critical survey of the state of molecular dynamics (MD) simulations of RNA, including a discussion of important current limitations of the technique and examples of its successful application. Several types of simulations are discussed in detail, including those of structured RNA molecules and their interactions with the surrounding solvent and ions, catalytic RNAs, and RNA-small molecule and RNA-protein complexes. Increased cooperation between theorists and experimentalists will allow expanded judicious use of MD simulations to complement conceptually related single molecule experiments. Such cooperation will open the door to a fundamental understanding of the structure-function relationships in diverse and complex RNA molecules.
This review provides a critical assessment of the advantages and limitations of modeling methods available for guanine quadruplex (G-DNA) molecules. We characterize the relations of simulations to the experimental techniques and explain the actual meaning and significance of the results. The following aspects are discussed: pair-additive approximation of the empirical force fields, sampling limitations stemming from the simulation time and accuracy of description of base stacking, H-bonding, sugar-phosphate backbone and ions by force fields. Several methodological approaches complementing the classical explicit solvent molecular dynamics simulations are commented on, including enhanced sampling methods, continuum solvent methods, free energy calculations and gas phase simulations. The successes and pitfalls of recent simulation studies of G-DNA are demonstrated on selected results, including studies of cation interactions and dynamics of G-DNA stems, studies of base substitutions (inosine, thioguanine and mixed tetrads), analysis of possible kinetic intermediates in folding pathway of a G-DNA stem and analysis of loop regions of G-DNA molecules.
An extended set of nanosecond-scale molecular dynamics simulations of DNA duplex sequences in explicit solvent interacting with the minor groove binding drug 4',6-diamidino-2-phenylindole (DAPI) are investigated for four different and sequence specific binding modes. Force fields for DAPI have been parametrized to properly reflect its internal nonplanarity. Sequences investigated include the binding modes observed experimentally, that is, AATT in d(CGCGAATTCGCG)(2) and ATTG in d(GGCCAATTGG)(2) and alternative shifted binding modes ATTC and AATT, respectively. In each case, stable MD simulations are obtained, well reproducing specific hydration patterns seen in the experiments. In contrast to the 2.4 A d(CGCGAATTCGCG)(2) crystal structure, the DAPI is nonplanar, consistent with its gas-phase geometry and the higher resolution crystal structure. The simulations also suggest that the DAPI molecule is able to adopt different conformational substates accompanied by specific hydration patterns that include long-residing waters. The MM_PBSA technology for estimating relative free energies was utilized. The most consistent free energy results were obtained with an approach that uses a single trajectory of the DNA-DAPI complex to estimate all free energy terms. It is demonstrated that explicit inclusion of a subset of bound water molecules shifts the calculated relative binding free energies in favor of both crystallographically observed binding modes, underlining the importance of structured hydration.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.