2022
DOI: 10.48550/arxiv.2205.08437
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Information-theoretical measures identify accurate low-resolution representations of protein configurational space

Abstract: A steadily growing computational power is employed to perform molecular dynamics simulations of biological macromolecules, which represents at the same time an immense opportunity and a formidable challenge. In fact, large amounts of data are produced, from which useful, synthetic, and intelligible information has to be extracted to make the crucial step from knowing to understanding. Here we tackled the problem of coarsening the conformational space sampled by proteins in the course of molecular dynamics simu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1
1

Relationship

2
0

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 43 publications
(51 reference statements)
0
2
0
Order By: Relevance
“…Distance in dynamics After all the pairwise alignments between the elements of the dataset are performed, a distance matrix that expresses differences in the large-scale dynamics is obtained (Figure 1c); then, the dataset undergoes hierarchical clustering [52] based on this distance matrix, in order to identify groups of dynamics-related proteins (Figure 1d). The optimal number of clusters is identified from the interplay between resolution and relevance [53][54][55][56][57]. These two quantities are entropies that are related to each other and depend on the clusterization procedure adopted.…”
Section: Dynamics-based Alignmentmentioning
confidence: 99%
“…Distance in dynamics After all the pairwise alignments between the elements of the dataset are performed, a distance matrix that expresses differences in the large-scale dynamics is obtained (Figure 1c); then, the dataset undergoes hierarchical clustering [52] based on this distance matrix, in order to identify groups of dynamics-related proteins (Figure 1d). The optimal number of clusters is identified from the interplay between resolution and relevance [53][54][55][56][57]. These two quantities are entropies that are related to each other and depend on the clusterization procedure adopted.…”
Section: Dynamics-based Alignmentmentioning
confidence: 99%
“…After all the pairwise alignments between the elements of the dataset are performed, a distance matrix that expresses differences in the large-scale dynamics is obtained; then the dataset undergoes hierarchical clustering [46] based on this distance matrix, in order to identify groups of dynamics-related proteins. The optimal number of clusters is identified from the interplay between resolution and relevance [47][48][49][50][51]. These two quantities, which are defined in more detail in the Methods section, are entropies that are related to each other and depend on the clusterization procedure adopted.…”
Section: Dynamics-based Alignmentmentioning
confidence: 99%