2021
DOI: 10.1021/acs.chemrev.0c01195
|View full text |Cite
|
Sign up to set email alerts
|

Unsupervised Learning Methods for Molecular Simulation Data

Abstract: Unsupervised learning is becoming an essential tool to analyze the increasingly large amounts of data produced by atomistic and molecular simulations, in material science, solid state physics, biophysics, and biochemistry. In this Review, we provide a comprehensive overview of the methods of unsupervised learning that have been most commonly used to investigate simulation data and indicate likely directions for further developments in the field. In particular, we discuss feature representation … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
244
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
2
1

Relationship

1
8

Authors

Journals

citations
Cited by 243 publications
(263 citation statements)
references
References 380 publications
(736 reference statements)
0
244
0
Order By: Relevance
“…The conventional machine learning estimators for dimension reduction supported by deeptime are detailed below. For more thorough introductions to available methods and overviews of their relationships, we refer the reader to [11,67,68]. Most of the following methods seek a matrix K ∈ R m×m , a finite-dimensional approximation of a transfer operator that should fulfill…”
Section: Conventional Dimension Reduction and Decompositionmentioning
confidence: 99%
“…The conventional machine learning estimators for dimension reduction supported by deeptime are detailed below. For more thorough introductions to available methods and overviews of their relationships, we refer the reader to [11,67,68]. Most of the following methods seek a matrix K ∈ R m×m , a finite-dimensional approximation of a transfer operator that should fulfill…”
Section: Conventional Dimension Reduction and Decompositionmentioning
confidence: 99%
“…Two main groups of clustering approaches can be distinguished, namely partitional and hierarchical, both of which can be carried out in the bottom-up agglomerative way or using a top-down divisive approach ( Kaufman and Rousseeuw, 1990 ). Another group of data grouping methods are density-based schemes, in which the clusters refer to the peaks of the probability distribution (or free energy minima) from which the data are collected ( Sander, 2011 ; Glielmo et al, 2021 ). In MD simulations, such probability peaks typically correspond to metastable states of the system.…”
Section: Clustering and Reduction Of Data Dimensionalitymentioning
confidence: 99%
“…In the following we apply to the set of inferred interactions {J} two general methods usually applied to the analysis of correlation matrices, i.e. HC and PCA [13,14]. Fig.…”
Section: The Inferred Interactions Networkmentioning
confidence: 99%
“…In order to assess the consistency of the inferred interactions and to apply to them cluster-ization methods, their properties have been contrasted with the ones of RMCs [10][11][12]. Finally, we apply to the inferred interactions set algorithms usually exploited in the analysis of cross-correlations matrices, Hierarchical Clustering (HC) and Principal Component Analysis (PCA) [13,14]. We obtain a division in clusters of different countries based on their interactions profile with all the other countries.…”
Section: Introductionmentioning
confidence: 99%