2016
DOI: 10.1063/1.4967809
|View full text |Cite
|
Sign up to set email alerts
|

Optimized parameter selection reveals trends in Markov state models for protein folding

Abstract: As molecular dynamics simulations access increasingly longer time scales, complementary advances in the analysis of biomolecular time-series data are necessary. Markov state models offer a powerful framework for this analysis by describing a system's states and the transitions between them. A recently established variational theorem for Markov state models now enables modelers to systematically determine the best way to describe a system's dynamics. In the context of the variational theorem, we analyze ultra-l… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

1
119
2

Year Published

2017
2017
2020
2020

Publication Types

Select...
3
3
2

Relationship

2
6

Authors

Journals

citations
Cited by 70 publications
(122 citation statements)
references
References 91 publications
1
119
2
Order By: Relevance
“…69,70,71 The simulation trajectories are clustered into micro-states using a hybrid k-centers k-medoids algorithm 68 . The generalized matrix Rayleigh quotient (GMRQ) method 71,72 is applied to optimize the hyperparameters as described in previous studies 73,74 .…”
Section: Discussionmentioning
confidence: 99%
“…69,70,71 The simulation trajectories are clustered into micro-states using a hybrid k-centers k-medoids algorithm 68 . The generalized matrix Rayleigh quotient (GMRQ) method 71,72 is applied to optimize the hyperparameters as described in previous studies 73,74 .…”
Section: Discussionmentioning
confidence: 99%
“…We conducted approximately 500×1 µs explicit-solvent MD simulations from each seed and accumulated 5 milliseconds of aggregate data in 10 million conformational snapshots for apo-SETD8 (Figures S16, Table S3). To identify functionally relevant conformational states and their transitions, we built MSMs using a pipeline that employs machine learning and extensive hyperparameter optimization to identify slow degrees of freedom and structural and kinetic criteria to cluster conformational snapshots into discrete conformational states (Figures S17-24, Tables S4, S5) 38 . This approach identified 24 kinetically metastable conformations (macrostates) from an optimized, cross-validated set of 100 microstates (Figures 2c, S25-30, Tables S6, S7).…”
Section: Figure 2 Markov State Models and Conformational Landscapes mentioning
confidence: 99%
“…To determine the optimal number of microstates, we again used variational scoring [94][95][96][97] combined with cross-validation 38 to evaluate model quality. The full dataset (4.931 ms, 0.5 ns/frame, 9,862,657 frames) was separately featurized with the top-scoring feature sets: 6,567 distances (featurization a above) and 920 dihedral angles (featurization c above).…”
Section: Final Featurization and Microstate Number Selectionmentioning
confidence: 99%
See 1 more Smart Citation
“…All-atom molecular dynamics (MD) simulations have the potential to offer atomistic-level detail in the dynamics of IDPs; however, modelling the dynamics of a protein using MD simulations requires describing the conformation space of the protein in a way that lends insight into states and processes of interest to the researcher. Significant study has gone into how to partition this phase space in the field of modelling the dynamics of structured proteins (3), and use of methods for featurization and time-lagged dimensionality reduction (6,7) have greatly improved the modelling process (8) for proteins that inhabit a few structured conformations. However, in practice, when we use these same tools to analyze MD simulation data of IDPs, we find that these methods face challenges when used to analyze the conformational landscape of an IDP.…”
mentioning
confidence: 99%