The dynamic behavior of complex systems with many degrees of freedom is often analyzed by projection onto one or a few reaction coordinates. The dynamics is then described in a simple and intuitive way as diffusion on the associated free energy profile. In order to use such a picture for a quantitative description of the dynamics one needs to select the coordinate in an optimal way so as to minimize non-Markovian effects due to the projection. For equilibrium dynamics between two boundary states (e.g., a reaction) the optimal coordinate is known as the committor or the pfold coordinate in protein folding studies. While the dynamics projected on the committor is not Markovian, many important quantities of the original multidimensional dynamics on an arbitrarily complex landscape can be computed exactly. Here we summarize the derivation of this result, discuss different approaches to determine and validate the committor coordinate and present three illustrative applications: protein folding, the game of chess, and patient recovery dynamics after kidney transplant.
State of the art realistic simulations of complex atomic processes commonly produce trajectories of large size, making the development of automated analysis tools very important. A popular approach aimed at extracting dynamical information consists of projecting these trajectories into optimally selected reaction coordinates or collective variables. For equilibrium dynamics between any two boundary states, the committor function also known as the folding probability in protein folding studies is often considered as the optimal coordinate. To determine it, one selects a functional form with many parameters and trains it on the trajectories using various criteria. A major problem with such an approach is that a poor initial choice of the functional form may lead to sub-optimal results. Here, we describe an approach which allows one to optimize the reaction coordinate without selecting its functional form and thus avoiding this source of error.
The rebinding of CO to myoglobin (Mb) from locations around the active site is studied using a combination of molecular dynamics and stochastic simulations for native and L29F mutant Mb. The interaction between the dissociated ligand and the protein environment is described by the recently developed fluctuating three-point charge model for the CO molecule. Umbrella sampling along trajectories, previously found to sample the binding site (B) and the Xe4 pocket, is used to construct free-energy profiles for the ligand escape. On the basis of the Smoluchowski equation, the relaxation of different initial population distributions is followed in space and time. For native Mb at room temperature, the calculated rebinding times are in good agreement with experimental values and give an inner barrier of 4.3 kcal/mol between the docking site B (Mb...CO) and the A state (bound MbCO), compared to an effective barrier, Heff, of 4.5 kcal/mol and barriers into the majority conformation A1 and the minority conformation A3 of 2.4 and 4.3 kcal/mol, respectively. In the case of the L29F mutant, the free-energy surface is flatter and the dynamics is much more rapid. As was found in experiment, escape to the Xe4 pocket is facile for L29F whereas, for native Mb, the barriers to this site are larger. At lower temperatures, the rebinding dynamics is delayed by orders of magnitude also due to increased barriers between the docking sites.
The free-energy landscape can provide a quantitative description of folding dynamics, if determined as a function of an optimally chosen reaction coordinate. Here, we construct the optimal coordinate and the associated free-energy profile for all-helical proteins HP35 and its norleucine (Nle/Nle) double mutant, based on realistic equilibrium folding simulations [Piana et al. Proc. Natl. Acad. Sci. U.S.A.2012, 109, 17845]. From the obtained profiles, we directly determine such basic properties of folding dynamics as the configurations of the minima and transition states (TS), the formation of secondary structure and hydrophobic core during the folding process, the value of the pre-exponential factor and its relation to the transition path times, the relation between the autocorrelation times in TS and minima. We also present an investigation of the accuracy of the pre-exponential factor estimation based on the transition-path times. Four different estimations of the pre-exponential factor for both proteins give k0–1 values of approximately a few tens of nanoseconds. Our analysis gives detailed information about folding of the proteins and can serve as a rigorous common language for extensive comparison between experiment and simulation.
The dynamics of processes relevant to chemistry and biophysics on rough free energy landscapes is investigated using a recently developed algorithm to solve the Smoluchowski equation. Two different processes are considered: ligand rebinding in MbCO and protein folding. For the rebinding dynamics of carbon monoxide (CO) to native myoglobin (Mb) from locations around the active site, the two-dimensional free energy surface (FES) is constructed using extensive molecular dynamics simulations. The surface describes the minima in the A state (bound MbCO), CO in the distal pocket and in the Xe4 pocket, and the transitions between these states and allows to study the diffusion of CO in detail. For the folding dynamics of protein G, a previously determined two-dimensional FES was available. To follow the diffusive dynamics on these rough free energy surfaces, the Smoluchowski equation is solved using the recently developed hierarchical discrete approximation method. From the relaxation of the initial nonequilibrium distribution, experimentally accessible quantities such as the rebinding time for CO or the folding time for protein G can be calculated. It is found that the free energy barrier for CO in the Xe4 pocket and in the distal pocket (B state) closer to the heme iron is approximately 6 kcal/mol which is considerably larger than the inner barrier which separates the bound state and the B state. For the folding of protein G, a barrier of approximately 10 kcal/mol between the unfolded and the folded state is consistent with folding times of the order of milliseconds.
The dynamics of complex systems with many degrees of freedom can be analyzed by projecting it onto one or few coordinates (collective variables). The dynamics is often described then as diffusion on a free energy landscape associated with the coordinates.Fep1d is a script for the analysis of such one-dimensional coordinates. The script allows one to construct conventional and cut-based free energy profiles, to assess the optimality of a reaction coordinate, to inspect whether the dynamics projected on the coordinate is diffusive, to transform (re-scale) the reaction coordinate to more convenient ones, and to compute such quantities as the mean first passage time, the transition path times, the coordinate dependent diffusion coefficient, etc. Here we describe the implemented functionality together with the underlying theoretical framework.
We present an efficient and numerically robust algorithm to follow diffusive processes on rough potential energy surfaces. The hierarchical nature of the algorithm (hierarchical discrete approximation or HDA) fully explores the fine- and coarse-grained structure of the underlying interaction potential. The present approach does not impose any restriction on the topology of the potential. The hierarchical grid allows to capture the roughness of the potential and achieve significant reduction of computational time using fewer grid points compared to other DA methods. HDA is shown to be accurate and efficient by comparing with results from the conventional DA and from the "mean first passage time" (MFPT) method. Using potential-optimized grids HDA monotonically converges to results from an analytical treatment for a very rough interaction potential (107 minima). Contrary to MFPT the solution from HDA is numerically stable. Because of the hierarchical structure of the method HDA can be extended to multidimensional problems.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.