MP2 and CCSD(T) complete basis set (CBS) limit interaction energies and geometries for more than 100 DNA base pairs, amino acid pairs and model complexes are for the first time presented together. Extrapolation to the CBS limit is done by using two-point extrapolation methods and different basis sets (aug-cc-pVDZ - aug-cc-pVTZ, aug-cc-pVTZ - aug-cc-pVQZ, cc-pVTZ - cc-pVQZ) are utilized. The CCSD(T) correction term, determined as a difference between CCSD(T) and MP2 interaction energies, is evaluated with smaller basis sets (6-31G** and cc-pVDZ). Two sets of complex geometries were used, optimized or experimental ones. The JSCH-2005 benchmark set, which is now available to the chemical community, can be used for testing lower-level computational methods. For the first screening the smaller training set (S22) containing 22 model complexes can be recommended. In this case larger basis sets were used for extrapolation to the CBS limit and also CCSD(T) and counterpoise-corrected MP2 optimized geometries were sometimes adopted.
We present here the parmbsc0 force field, a refinement of the AMBER parm99 force field, where emphasis has been made on the correct representation of the alpha/gamma concerted rotation in nucleic acids (NAs). The modified force field corrects overpopulations of the alpha/gamma = (g+,t) backbone that were seen in long (more than 10 ns) simulations with previous AMBER parameter sets (parm94-99). The force field has been derived by fitting to high-level quantum mechanical data and verified by comparison with very high-level quantum mechanical calculations and by a very extensive comparison between simulations and experimental data. The set of validation simulations includes two of the longest trajectories published to date for the DNA duplex (200 ns each) and the largest variety of NA structures studied to date (15 different NA families and 97 individual structures). The total simulation time used to validate the force field includes near 1 mus of state-of-the-art molecular dynamics simulations in aqueous solution.
We report a reparameterization of the glycosidic torsion χ of the Cornell et al. AMBER force field for RNA, χOL. The parameters remove destabilization of the anti region found in the ff99 force field and thus prevent formation of spurious ladder-like structural distortions in RNA simulations. They also improve the description of the syn region and the syn–anti balance as well as enhance MD simulations of various RNA structures. Although χOL can be combined with both ff99 and ff99bsc0, we recommend the latter. We do not recommend using χOL for B-DNA because it does not improve upon ff99bsc0 for canonical structures. However, it might be useful in simulations of DNA molecules containing syn nucleotides. Our parametrization is based on high-level QM calculations and differs from conventional parametrization approaches in that it incorporates some previously neglected solvation-related effects (which appear to be essential for obtaining correct anti/high-anti balance). Our χOL force field is compared with several previous glycosidic torsion parametrizations.
ContentsI. Introduction 3247 II. Calculations 3249 II.1. Interaction Energy 3249 II.2. Correlation Energy 3250 II.3. Density Functional Theory 3251 II.4. Vibrational Frequencies 3251 II.5. Gas-Phase Thermodynamic Characteristics of Nucleic Acid (NA) Base Pairs 3252 III. Structure and Properties of Isolated NA Bases 3252 III.1. Tautomeric Equilibria of Bases 3252 III.2. Protonization Energies of Bases 3253 III.3. Electrostatic Potential around NA Bases 3254 III.4. Polarizability 3255 III.5. Gas-Phase Electron Affinities (EA) and Ionization Potentials (IP) 3255 III.6. Nonplanarity of DNA Bases 3255 IV. Structure, Energetics, and Properties of NA Base Pairs 3257 IV.1. H-Bonded NA Base Pairs 3257 IV.2. Stacked NA Base Pairs 3262 IV.3. Reliability of MP2 Calculations 3267 IV.4. Application of the DFT Method 3269 IV.5. Performance of Empirical Potentials 3269 IV.6. Thermodynamic Characteristics 3270 V. Conclusions 3272 VI. Acknowledgments 3273 VII. References 3273
With both catalytic and genetic functions, ribonucleic acid (RNA) is perhaps the most pluripotent chemical species in molecular biology, and its functions are intimately linked to its structure and dynamics. Computer simulations, and in particular atomistic molecular dynamics (MD), allow structural dynamics of biomolecular systems to be investigated with unprecedented temporal and spatial resolution. We here provide a comprehensive overview of the fast-developing field of MD simulations of RNA molecules. We begin with an in-depth, evaluatory coverage of the most fundamental methodological challenges that set the basis for the future development of the field, in particular, the current developments and inherent physical limitations of the atomistic force fields and the recent advances in a broad spectrum of enhanced sampling methods. We also survey the closely related field of coarse-grained modeling of RNA systems. After dealing with the methodological aspects, we provide an exhaustive overview of the available RNA simulation literature, ranging from studies of the smallest RNA oligonucleotides to investigations of the entire ribosome. Our review encompasses tetranucleotides, tetraloops, a number of small RNA motifs, A-helix RNA, kissing-loop complexes, the TAR RNA element, the decoding center and other important regions of the ribosome, as well as assorted others systems. Extended sections are devoted to RNA–ion interactions, ribozymes, riboswitches, and protein/RNA complexes. Our overview is written for as broad of an audience as possible, aiming to provide a much-needed interdisciplinary bridge between computation and experiment, together with a perspective on the future of the field.
Hydrogen-bonded nucleic acids base pairs substantially contribute to the structure and stability of nucleic acids. The study presents reference ab initio structures and interaction energies of selected base pairs with binding energies ranging from -5 to -47 kcal/mol. The molecular structures are obtained using the RI-MP2 (resolution of identity MP2) method with extended cc-pVTZ basis set of atomic orbitals. The RI-MP2 method provides results essentially identical with the standard MP2 method. The interaction energies are calculated using the Complete Basis Set (CBS) extrapolation at the RI-MP2 level. For some base pairs, Coupled-Cluster corrections with inclusion of noniterative triple contributions (CCSD(T)) are given. The calculations are compared with selected medium quality methods. The PW91 DFT functional with the 6-31G basis set matches well the RI-MP2/CBS absolute interaction energies and reproduces the relative values of base pairing energies with a maximum relative error of 2.6 kcal/mol when applied with Becke3LYP-optimized geometries. The Becke3LYP DFT functional underestimates the interaction energies by few kcal/mol with relative error of 2.2 kcal/mol. Very good performance of nonpolarizable Cornell et al. force field is confirmed and this indirectly supports the view that H-bonded base pairs are primarily stabilized by electrostatic interactions.
Z-DNA duplexes are a particularly complicated test case for current force fields. We performed a set of explicit solvent molecular dynamics (MD) simulations with various AMBER force field parametrizations including our recent refinements of the ε/ζ and glycosidic torsions. None of these force fields described the ZI/ZII and other backbone substates correctly, and all of them underpredicted the population of the important ZI substate. We show that this underprediction can be attributed to an inaccurate potential for the sugar-phosphate backbone torsion angle β. We suggest a refinement of this potential, β(OL1), which was derived using our recently introduced methodology that includes conformation-dependent solvation effects. The new potential significantly increases the stability of the dominant ZI backbone substate and improves the overall description of the Z-DNA backbone. It also has a positive (albeit small) impact on another important DNA form, the antiparallel guanine quadruplex (G-DNA), and improves the description of the canonical B-DNA backbone by increasing the population of BII backbone substates, providing a better agreement with experiment. We recommend using β(OL1) in combination with our previously introduced corrections, εζ(OL1) and χ(OL4), (the combination being named OL15) as a possible alternative to the current β torsion potential for more accurate modeling of nucleic acids.
Ab initio (MP2/6-31G*(0.25)) interaction energies were calculated for almost 240 geometries of 10 stacked nucleic acid−base pairs: A···A, C···C, G···G, U···U, A···C, G···A, A···U, G···C, C···U, and G···U; in some cases uracil was replaced by thymine. The most stable stacked pair is the G···G dimer (−11.3 kcal/mol), and the least stable is the uracil dimer (−6.5 kcal/mol). The interaction energies of H-bonded base pairs range from −25.8 kcal/mol (G···C) to −10.6 kcal/mol (T···T). The stability of stacked pairs originates in the electron correlation, while all the H-bonded pairs are dominated by the HF energy. The mutual orientation of the stacked bases is, however, primarily determined by the HF interaction energy. The ab initio base stacking energies are well reproduced by the empirical potential calculations, provided the atomic charges are derived by the same method as used in the ab initio calculations. Some contributions previously postulated to significantly influence base stacking (induction interactions, π−π interactions) were not found. Base stacking was also investigated in six B-DNA and two Z-DNA base pair steps; their geometries were taken from the oligonucleotide crystal data. The many-body correction was estimated at the HF/MINI-1 level. The sequence-dependent variations of the total base pair step stacking energies range from −9.9 to −14.7 kcal/mol. The range of the calculated many-body corrections to the stacking energy is 2 kcal/mol. The ab initio calculations exclude the consideration that the unusual conformational properties of the CpA(TpG) steps might be associated with attractive induction interactions of the exocyclic groups of DNA bases and the aromatic rings of bases.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.