The weighted ensemble (WE) path sampling approach orchestrates an ensemble of parallel calculations with intermittent communication to enhance the sampling of rare events, such as molecular associations or conformational changes in proteins or peptides. Trajectories are replicated and pruned in a way that focuses computational effort on under-explored regions of configuration space while maintaining rigorous kinetics. To enable the simulation of rare events at any scale (e.g. atomistic, cellular), we have developed an open-source, interoperable, and highly scalable software package for the execution and analysis of WE simulations: WESTPA (The Weighted Ensemble Simulation Toolkit with Parallelization and Analysis). WESTPA scales to thousands of CPU cores and includes a suite of analysis tools that have been implemented in a massively parallel fashion. The software has been designed to interface conveniently with any dynamics engine and has already been used with a variety of molecular dynamics (e.g. GROMACS, NAMD, OpenMM, AMBER) and cell-modeling packages (e.g. BioNetGen, MCell). WESTPA has been in production use for over a year, and its utility has been demonstrated for a broad set of problems, ranging from atomically detailed host-guest associations to non-spatial chemical kinetics of cellular signaling networks. The following describes the design and features of WESTPA, including the facilities it provides for running WE simulations, storing and analyzing WE simulation data, as well as examples of input and output.
Equilibrium formally can be represented as an ensemble of uncoupled systems undergoing unbiased dynamics in which detailed balance is maintained. Many nonequilibrium processes can be described by suitable subsets of the equilibrium ensemble. Here, we employ the “weighted ensemble” (WE) simulation protocol [Huber and Kim, Biophys. J.1996, 70, 97–110] to generate equilibrium trajectory ensembles and extract nonequilibrium subsets for computing kinetic quantities. States do not need to be chosen in advance. The procedure formally allows estimation of kinetic rates between arbitrary states chosen after the simulation, along with their equilibrium populations. We also describe a related history-dependent matrix procedure for estimating equilibrium and nonequilibrium observables when phase space has been divided into arbitrary non-Markovian regions, whether in WE or ordinary simulation. In this proof-of-principle study, these methods are successfully applied and validated on two molecular systems: explicitly solvated methane association and the implicitly solvated Ala4 peptide. We comment on challenges remaining in WE calculations.
Molecular dynamics simulations were used to examine the structural dynamics of two fluorescent probes attached to a typical protein, hen egg-white lysozyme (HEWL). The donor probe (D) was attached via a succinimide group, consistent with the commonly-used maleimide conjugation chemistry, and the acceptor probe (A) was bound into the protein as occurs naturally for HEWL and the dye Eosin Y. The
The characterization of protein binding processes — with all of the key conformational changes — has been a grand challenge in the field of biophysics. Here, we have used the weighted ensemble path sampling strategy to orchestrate molecular dynamics simulations, yielding atomistic views of protein–peptide binding pathways involving the MDM2 oncoprotein and an intrinsically disordered p53 peptide. A total of 182 independent, continuous binding pathways were generated, yielding a kon that is in good agreement with experiment. These pathways were generated in 15 days using 3500 cores of a supercomputer, substantially faster than would be possible with “brute force” simulations. Many of these pathways involve the anchoring of p53 residue F19 into the MDM2 binding cleft when forming the metastable encounter complex, indicating that F19 may be a kinetically important residue. Our study demonstrates that it is now practical to generate pathways and calculate rate constants for protein binding processes using atomistic simulation on typical computing resources.
Sodium coupled cotransporters of the five-helix inverted repeat (5HIR) superfamily use an alternating access mechanism to transport a myriad of small molecules across the cell membrane. One of the primary steps in this mechanism is the conformational transition from a state poised to bind extracellular substrates to a state that is competent to deliver substrate to the cytoplasm. Here, we construct a coarse-grained model of the 5HIR benzylhydantoin transporter Mhp1 that incorporates experimental structures of the outward- and inward-open states to investigate the mechanism of this conformational change. Using the weighted ensemble path-sampling method, we rigorously sample the outward- to inward-facing transition path ensemble. The transition path ensemble reveals a heterogeneous set of pathways connecting the two states and identifies two modes of transport: one consistent with a strict alternating access mechanism and another where decoupling of the inner and outer gates causes the transient formation of a continuous permeation pathway through the transporter. We also show that the conformational switch between the outward- and inward-open states results from rigid body motions of the hash motif relative to the substrate bundle, supporting the rocking bundle hypothesis. Finally, our methodology provides the groundwork for more chemically detailed investigations of the alternating mechanism.
Atomically detailed views of molecular recognition events are of great interest to a variety of research areas in biology and chemistry. Here, we apply the weighted ensemble path sampling approach to improve the efficiency of explicit solvent molecular dynamics (MD) simulations in sampling molecular association events between two methane molecules, Na(+) and Cl(-) ions, methane and benzene, and the K(+) ion and 18-crown-6 ether. Relative to brute force simulation, we obtain efficiency gains of at least 300 and 1100-fold for the most challenging system, K(+)/18-crown-6 ether, in terms of sampling the association rate constant k and distribution of times required to traverse transition paths, respectively. Our results indicate that weighted ensemble sampling is likely to allow for even greater efficiencies for more complex systems with higher barriers to molecular association.
We have explored the impact of a number of basic simulation parameters on the results of a recently developed hybrid molecular dynamics-quantum mechanics (MD-QM) method (Mercer et al., J Phys Chem B 1999, 103, 7720). The method utilizes MD simulations to explore the ground-state configuration space of the system and QM evaluation of those structures to yield the time-dependent electronic transition energy, which is transformed into the optical line-broadening function using the second-order cumulant expansion. Both linear and nonlinear optical spectra can then be generated for comparison to experiment. The dependence of the resulting spectra on the length of the MD trajectory, the QM sampling rate, and the QM model chemistry have all been examined. In particular, for the system of oxazine-4 in methanol studied here, at least 20 ps of MD trajectory are needed for qualitative convergence of linear spectral properties, and >100 ps is needed for quantitative convergence. Surprisingly, little difference is found between the 3-21G and 6-31G(d) basis sets, and the CIS and TD-B3LYP methods yield remarkably similar spectra. The semiempirical INDO/s method yields the most accurate results, reproducing the experimental Stokes shift to within 5% and the FWHM to within 20%. Nonlinear 3-pulse photon echo peak shift (3PEPS) decays have also been simulated. Decays are generally poorly reproduced, though the initial peak shift which depends on the overall coupling of motions to the solute transition energy is within 15% of experiment for all model chemistries other than those using the STO-3G basis.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.