Detecting intrinsic slow variables in stochastic dynamical systems by anisotropic diffusion maps

Singer, Amit; Erban, Radek; Kevrekidis, Ioannis G.; Coifman, Ronald R.

doi:10.1073/pnas.0905547106

Cited by 143 publications

(184 citation statements)

References 23 publications

Supporting

Mentioning

183

Contrasting

Order By: Relevance

“…37,38,[51][52][53][54][55][56] Geometrically, this can be understood as the emergence of a small number of collective variables governing the long-time evolution of the system to which the remaining degrees of freedom are slaved. 37,38,[57][58][59] We and others have previously demonstrated that these collective variables can be determined from molecular simulation trajectories using nonlinear manifold learning, [37][38][39]51,[60][61][62][63][64][65][66][67][68][69][70][71][72][73] the particular variant of which we use here are diffusion maps. 60,[65][66][67][68] In a nutshell, the diffusion map approach constructs a random walk over the high-dimensional simulation trajectory with hopping probabilities based on the structural similarity of the constituent snapshots, then performs a spectral analysis of the harmonics of the resultant discrete Markov process to ascertain the effective dimensionality of the underlying "intrinsic manifold" and nonlinear collective variables with which to parameterize it.…”

Section: Diffusion Maps Manifold Learningmentioning

confidence: 99%

A Study of the Morphology, Dynamics, and Folding Pathways of Ring Polymers with Supramolecular Topological Constraints Using Molecular Simulation and Nonlinear Manifold Learning

Wang

Ferguson

2018

Macromolecules

View full text Add to dashboard Cite

Ring polymers are prevalent in natural and engineered systems, including circular bacterial DNA, crown ethers for cation chelation, and mechanical nanoswitches. The morphology and dynamics of ring polymers are governed by the chemistry and degree of polymerization of the ring, and intramolecular and supramolecular topological constraints such as knots or mechanically-interlocked rings. In this study, we perform molecular dynamics simulations of polyethylene ring polymers at two different degrees of polymerization and in different topological states, including a trefoil knot, catenane state (two interlocked rings), and Borromean state (three interlocked rings). We employ nonlinear manifold learning to extract the low-dimensional free energy surface to which the structure and dynamics of the polymer chain are effectively restrained. The free energy surfaces reveal how degree of polymerization and topological constraints affect the thermally accessible conformations, chiral symmetry breaking, and folding and collapse pathways of the rings, and present a means to rationally engineer ring size and topology to stabilize particular conformational states and folding pathways. We compute the rotational diffusion of the ring in these various states as a crucial property required for the design of engineered devices containing ring polymer components.2

show abstract

Section: Diffusion Maps Manifold Learningmentioning

confidence: 99%

A Study of the Morphology, Dynamics, and Folding Pathways of Ring Polymers with Supramolecular Topological Constraints Using Molecular Simulation and Nonlinear Manifold Learning

Wang

Ferguson

2018

Macromolecules

View full text Add to dashboard Cite

show abstract

“…Isomap (5,22) and LLE (23) have been successfully applied to peptide systems, and, although diffusion maps have been used to study phenomena as diverse as chemical reaction networks (24) and defect mobility at an interface (25), they have not been previously applied to systems of biophysical significance.…”

mentioning

confidence: 99%

“…In the case of C 8 we find the dynamics to be governed by torsional motions. For C 16 and C 24 we extract three global order parameters with which we characterize the fundamental dynamics, and determine that the low free-energy pathway of globular collapse proceeds by a "kink and slide" mechanism, whereby a bend near the end of the linear chain migrates toward the middle to form a hairpin and, ultimately, a coiled helix. The low-dimensional representation is subtly perturbed in the solvated phase relative to the ideal gas, and its geometric structure is conserved between C 16 and C 24 .…”

mentioning

confidence: 99%

Systematic determination of order parameters for chain dynamics using diffusion maps

Ferguson

Panagiotopoulos

Debenedetti

et al. 2010

Proc. Natl. Acad. Sci. U.S.A.

Self Cite

150

315

View full text Add to dashboard Cite

We employ the diffusion map approach as a nonlinear dimensionality reduction technique to extract a dynamically relevant, low-dimensional description of n-alkane chains in the ideal-gas phase and in aqueous solution. In the case of C 8 we find the dynamics to be governed by torsional motions. For C 16 and C 24 we extract three global order parameters with which we characterize the fundamental dynamics, and determine that the low free-energy pathway of globular collapse proceeds by a "kink and slide" mechanism, whereby a bend near the end of the linear chain migrates toward the middle to form a hairpin and, ultimately, a coiled helix. The low-dimensional representation is subtly perturbed in the solvated phase relative to the ideal gas, and its geometric structure is conserved between C 16 and C 24 . The methodology is directly extensible to biomolecular self-assembly processes, such as protein folding.I t has long been suspected that cooperative couplings between degrees of freedom render the effective dimensionality of biophysical systems far less than the 3R-dimensional coordinate space of the R constituent atoms (1-5). This has been framed in the projection operator formalism (6) as a separation of time scales in which the important dynamics reside in a "slow subspace" (7) and is associated with a smooth underlying free energy surface (8). For example, two-dimensional descriptions have been formulated for dialanine (9) and a coarse-grained model of the src homology 3 domain (5).Calculation of the effective dimensionality of a dynamical system, and identification of order parameters describing the low-dimensional "intrinsic manifold" to which the system dynamics are effectively restrained, is a long-standing problem in as seemingly disparate fields as data visualization (10), speech recognition (11), semisupervised learning (12), and spectral clustering (13). The fraction of native contacts (Q) (8,14) and the folding probability (P fold ) (8, 15) have been used as reaction coordinates for protein folding, but such coarse variables may lump together structurally and kinetically disparate conformations and can prove inadequate for larger proteins with frustrated folding funnels (5, 8). Empirical order parameters also tend to perform poorly on landscapes exhibiting multiple local free-energy (FE) minima or lacking well-defined unfolded and folded basins. Principal components analysis (PCA) is a popular linear dimensionality reduction technique applied extensively to biophysical systems (1-4, 16) which seeks to describe the "essential subspace" (2) of the dynamics by a set of orthogonal vectors oriented along the directions of largest variance in the data. For the highly nonlinear intrinsic manifolds one expects for complex molecular systems (5), the linearity of this technique renders it appropriate in local regions, but results in a poor characterization of the global features (5, 17). This deficiency leads to poor PCA estimates of the effective dimensionality (17) far in excess of the dimensionality of the phas...

show abstract

“…(d) the vectors of (resVar l ) l for Q 10 (1000, 0), S 9 (1000, 0), Q 10 (1000, 0.1), S 9 (1000, 0.1): it seems hard to see a difference between the intrinsic dimensions 10 and 9, in both the noiseless and noisy cases. (e) The dimension of S 9 (2000, 0.01) in R 100 estimated according to the heuristic in [74] yields the wrong dimension (∼ 8) even for small amounts of noise; this is of course not a rigorous test, and it is a heuristic procedure, not an algorithm, as described in [74].…”

Section: Kernel Methodsmentioning

confidence: 99%

“…We expect similar phenomena to be common to other manifold learning algorithms, and leave a complete investigation to future work. In [74] it is suggested that diffusion maps [7] may be used in order to estimate intrinsic dimension as well as a scale parameter, for example in the context of dynamical systems where a small number of slow variables are present. Rather than an automatic algorithm for dimension estimation, [74] suggests a criterion that involves eyeballing the function i,j e − ||x i −x j || 2 ǫ 2 , as a function of ǫ, to find a region of linear growth, whose slope is an estimate of the intrinsic dimension.…”

Section: Kernel Methodsmentioning

confidence: 99%

Multiscale geometric methods for data sets I: Multiscale SVD, noise and curvature

Little

Maggioni

Rosasco

2017

Applied and Computational Harmonic Analysis

View full text Add to dashboard Cite

Large data sets are often modeled as being noisy samples from probability distributions µ in R D , with D large. It has been noticed that oftentimes the support M of these probability distributions seems to be well-approximated by low-dimensional sets, perhaps even by manifolds. We shall consider sets that are locally well approximated by k-dimensional planes, with k ≪ D, with k-dimensional manifolds isometrically embedded in R D being a special case. Samples from µ are furthermore corrupted by D-dimensional noise. Certain tools from multiscale geometric measure theory and harmonic analysis seem well-suited to be adapted to the study of samples from such probability distributions, in order to yield quantitative geometric information about them. In this paper we introduce and study multiscale covariance matrices, i.e. covariances corresponding to the distribution restricted to a ball of radius r, with a fixed center and varying r, and under rather general geometric assumptions we study how their empirical, noisy counterparts behave. We prove that in the range of scales where these covariance matrices are most informative, the empirical, noisy covariances are close to their expected, noiseless counterparts. In fact, this is true as soon as the number of samples in the balls where the covariance matrices are computed is linear in the intrinsic dimension of M. As an application, we present an algorithm for estimating the intrinsic dimension of M.

show abstract

Detecting intrinsic slow variables in stochastic dynamical systems by anisotropic diffusion maps

Cited by 143 publications

References 23 publications

A Study of the Morphology, Dynamics, and Folding Pathways of Ring Polymers with Supramolecular Topological Constraints Using Molecular Simulation and Nonlinear Manifold Learning

A Study of the Morphology, Dynamics, and Folding Pathways of Ring Polymers with Supramolecular Topological Constraints Using Molecular Simulation and Nonlinear Manifold Learning

Systematic determination of order parameters for chain dynamics using diffusion maps

Multiscale geometric methods for data sets I: Multiscale SVD, noise and curvature

Contact Info

Product

Resources

About