Atomistic insight into the kinetic pathways for Watson–Crick to Hoogsteen transitions in DNA

Vreede, Jocelyne; Ortíz, Alberto Pérez de Alba; Bolhuis, Peter G.; Swenson, David W.

doi:10.1093/nar/gkz837

Cited by 18 publications

(57 citation statements)

References 35 publications

(52 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…dWC and dHG are closely related (0.64 and −0.56) to atan_dWCdHG, which is to be expected given the function’s arguments. Moreover, in ref ( 56 ), atan_dWCdHG was successfully used as a reaction coordinate to extract a rate constants of the transition, which confirms its validity as an important CV. For a full overview of all the pairwise correlations, see SI Figure 2 .…”

Section: Resultsmentioning

confidence: 62%

“…We start from two 100 ns unbiased pre-equilibrated MD simulations, one sampling the WC state and the other sampling the HG state of the DNA segment. The simulation parameters are the same as in ref ( 56 ). From these trajectories, frames with an interval of 10 ps were taken and used as input for the framework.…”

Section: Resultsmentioning

confidence: 99%

“…The B-DNA sequence is modeled with the BSC1 force field. 60 Further details regarding the simulation can be obtained from ref ( 56 ).…”

Section: Methodsmentioning

confidence: 99%

“… 59 To observe the effects in the competition, we introduce yet another somewhat redundant CV; tBFs is a simplified version of tBF, which does not use centers of mass and is likely less reliable in the face of local deformations. dWC, dHG, dHB, and atan_dWCdHG were used in ref ( 56 ) to perform and analyze WC-to-HG path sampling simulations. tGBs, dCC, dHG, aBPT, and tBP1P2T are based on ref ( 63 ), where they were used to identify HG base pairs in a survey of X-ray structures.…”

Section: Methodsmentioning

confidence: 99%

See 3 more Smart Citations

Discovering Collective Variables of Molecular Transitions via Genetic Algorithms and Neural Networks

Hooft

Ortíz

Ensing

2021

J. Chem. Theory Comput.

View full text Add to dashboard Cite

With the continual improvement of computing hardware and algorithms, simulations have become a powerful tool for understanding all sorts of (bio)molecular processes. To handle the large simulation data sets and to accelerate slow, activated transitions, a condensed set of descriptors, or collective variables (CVs), is needed to discern the relevant dynamics that describes the molecular process of interest. However, proposing an adequate set of CVs that can capture the intrinsic reaction coordinate of the molecular transition is often extremely difficult. Here, we present a framework to find an optimal set of CVs from a pool of candidates using a combination of artificial neural networks and genetic algorithms. The approach effectively replaces the encoder of an autoencoder network with genes to represent the latent space, i.e., the CVs. Given a selection of CVs as input, the network is trained to recover the atom coordinates underlying the CV values at points along the transition. The network performance is used as an estimator of the fitness of the input CVs. Two genetic algorithms optimize the CV selection and the neural network architecture. The successful retrieval of optimal CVs by this framework is illustrated at the hand of two case studies: the well-known conformational change in the alanine dipeptide molecule and the more intricate transition of a base pair in B-DNA from the classic Watson–Crick pairing to the alternative Hoogsteen pairing. Key advantages of our framework include the following: optimal interpretable CVs, avoiding costly calculation of committor or time-correlation functions, and automatic hyperparameter optimization. In addition, we show that applying a time-delay between the network input and output allows for enhanced selection of slow variables. Moreover, the network can also be used to generate molecular configurations of unexplored microstates, for example, for augmentation of the simulation data.

show abstract

Section: Resultsmentioning

confidence: 62%

Section: Resultsmentioning

confidence: 99%

“…The B-DNA sequence is modeled with the BSC1 force field. 60 Further details regarding the simulation can be obtained from ref ( 56 ).…”

Section: Methodsmentioning

confidence: 99%

Section: Methodsmentioning

confidence: 99%

See 2 more Smart Citations

Discovering Collective Variables of Molecular Transitions via Genetic Algorithms and Neural Networks

Hooft

Ortíz

Ensing

2021

J. Chem. Theory Comput.

View full text Add to dashboard Cite

show abstract

“…Related to enzymes are molecular configurational changes in proteins, [60,62,[132][133][134] (including protein dissociation [68] ), lipids, [135] DNA, [136][137][138][139][140][141] and even transport across ion channels. [142] Here, the challenge is that paths tend to be more diffusive, and thus longer (order 1-10 ns), and states are more difficult to define.…”

Section: Applicationsmentioning

confidence: 99%

Transition Path Sampling as Markov Chain Monte Carlo of Trajectories: Recent Algorithms, Software, Applications, and Future Outlook

Bolhuis

Swenson

2021

Advcd Theory and Sims

View full text Add to dashboard Cite

The development of enhanced sampling methods to investigate rare but important events has always been a focal point in the molecular simulation field. Such methods often rely on prior knowledge of the reaction coordinate. However, the search for this reaction coordinate is at the heart of the rare event problem. Transition path sampling (TPS) circumvents this problem by generating an ensemble of dynamical trajectories undergoing the activated event. The reaction coordinate is extracted from the resulting path ensemble using variants of machine learning, making it an output of the method instead of an input. Over the last 20 years, since its inception, many extensions of TPS have been developed. Perhaps surprisingly, large‐scale TPS simulations on complex molecular systems have become possible only recently. Other important developments include the transition interface sampling (TIS) methodology to compute rate constants, the application to multiple states, and adaptive path sampling. The development of OpenPathSampling and PyRETIS has enabled easy and flexible use and implementation of these and other novel path sampling algorithms. In this progress report, a brief overview of recent developments, novel algorithms, and software is given. In addition, several application areas are discussed, and a future outlook for the next decade is given.

show abstract

Enhanced sampling strategies for molecular simulation of DNA

Mohr,

van Heesch,

Pérez de Alba Ortíz

et al. 2024

WIREs Comput Mol Sci

Self Cite

View full text Add to dashboard Cite

Molecular dynamics (MD) simulations can provide detailed insights into complex molecular systems, such as DNA, at high resolution in space and time. Using current computer architectures, time scales of tens of microseconds are feasible with contemporary all‐atom force fields. However, these timescales are insufficient to accurately characterize large conformational transitions in DNA and compare calculations to experimental data. This review discusses the advantages and drawbacks of two simulation approaches to overcome the timescale challenge. The first approach is based on adding biasing potentials to the system to drive transitions. Umbrella sampling, steered MD, and metadynamics are examples of these methods. A key challenge of such methods is the necessity of selecting one or a few efficient coordinates, commonly referred to as collective variables (CVs), along which to apply the biasing potential. The path‐metadynamics methodology addresses this issue by finding the optimal route(s) between states in a multi‐dimensional CV space. The second strategy is path sampling, which focuses MD simulations on the transitions. The assumption is that even though transitions between states are rare, they are generally fast. Stopping the simulations as soon as they reach a stable state can significantly increase simulation efficiency. We introduce these methods on the two‐dimensional Müller–Brown potential. DNA applications are featured for two different processes: the Watson–Crick–Franklin to Hoogsteen transition in adenine–thymine base pairs and the binding of a DNA‐binding protein domain to DNA.This article is categorized under: Molecular and Statistical Mechanics Molecular Dynamics and Monte‐Carlo Methods Molecular and Statistical Mechanics Free Energy Methods Software Simulation Methods

show abstract

Atomistic insight into the kinetic pathways for Watson–Crick to Hoogsteen transitions in DNA

Cited by 18 publications

References 35 publications

Discovering Collective Variables of Molecular Transitions via Genetic Algorithms and Neural Networks

Discovering Collective Variables of Molecular Transitions via Genetic Algorithms and Neural Networks

Transition Path Sampling as Markov Chain Monte Carlo of Trajectories: Recent Algorithms, Software, Applications, and Future Outlook

Enhanced sampling strategies for molecular simulation of DNA

Contact Info

Product

Resources

About