Information-theoretical measures identify accurate low-resolution representations of protein configurational space

Mele, Margherita; Covino, Roberto; Potestio, Raffaello

doi:10.48550/arxiv.2205.08437

Cited by 2 publications

(2 citation statements)

References 43 publications

(51 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Distance in dynamics After all the pairwise alignments between the elements of the dataset are performed, a distance matrix that expresses differences in the large-scale dynamics is obtained (Figure 1c); then, the dataset undergoes hierarchical clustering [52] based on this distance matrix, in order to identify groups of dynamics-related proteins (Figure 1d). The optimal number of clusters is identified from the interplay between resolution and relevance [53][54][55][56][57]. These two quantities are entropies that are related to each other and depend on the clusterization procedure adopted.…”

Section: Dynamics-based Alignmentmentioning

confidence: 99%

In Search of a Dynamical Vocabulary: A Pipeline to Construct a Basis of Shared Traits in Large-Scale Motions of Proteins

et al. 2022

Self Cite

View full text Add to dashboard Cite

The paradigmatic sequence–structure–dynamics–function relation in proteins is currently well established in the scientific community; in particular, a large effort has been made to probe the first connection, indeed providing convincing evidence of its strength and rationalizing it in a quantitative and general framework. In contrast, however, the role of dynamics as a link between structure and function has eluded a similarly clear-cut verification and description. In this work, we propose a pipeline aimed at building a basis for the quantitative characterization of the large-scale dynamics of a set of proteins, starting from the sole knowledge of their native structures. The method hinges on a dynamics-based clusterization, which allows a straightforward comparison with structural and functional protein classifications. The resulting basis set, obtained through the application to a group of related proteins, is shown to reproduce the salient large-scale dynamical features of the dataset. Most interestingly, the basis set is shown to encode the fluctuation patterns of homologous proteins not belonging to the initial dataset, thus highlighting the general applicability of the pipeline used to build it.

show abstract

Section: Dynamics-based Alignmentmentioning

confidence: 99%

In Search of a Dynamical Vocabulary: A Pipeline to Construct a Basis of Shared Traits in Large-Scale Motions of Proteins

et al. 2022

Self Cite

View full text Add to dashboard Cite

show abstract

“…After all the pairwise alignments between the elements of the dataset are performed, a distance matrix that expresses differences in the large-scale dynamics is obtained; then the dataset undergoes hierarchical clustering [46] based on this distance matrix, in order to identify groups of dynamics-related proteins. The optimal number of clusters is identified from the interplay between resolution and relevance [47][48][49][50][51]. These two quantities, which are defined in more detail in the Methods section, are entropies that are related to each other and depend on the clusterization procedure adopted.…”

Section: Dynamics-based Alignmentmentioning

confidence: 99%

In search of a dynamical vocabulary: a pipeline to construct a basis of shared traits in large-scale motions of proteins

Tarenzi

Mattiotti

Rigoli

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

The paradigmatic sequence-structure-dynamics-function relation in proteins is nowadays well established in the scientific community; in particular, large effort has been spent to probe the first connection, indeed providing convincing evidence of its strength and rationalising it in a quantitative and general framework. In contrast, however, the role of dynamics as a link between structure and function has eluded a similarly clear-cut verification and description. In this work, we propose a pipeline aimed at building a basis for the quantitative characterisation of the large-scale dynamics of a set of proteins, starting from the sole knowledge of their native structures. The method hinges on a dynamics-based clusterization, which allows a straightforward comparison with structural and functional protein classifications. The resulting basis set, obtained through the application to a group of related proteins, is shown to reproduce the salient large-scale dynamical features of the dataset. Most interestingly, the basis set is shown to encode the fluctuation patterns of homologous proteins not belonging to the initial dataset, thus highlighting the general applicability of the pipeline used to build it.

show abstract

Information-theoretical measures identify accurate low-resolution representations of protein configurational space

Cited by 2 publications

References 43 publications

In Search of a Dynamical Vocabulary: A Pipeline to Construct a Basis of Shared Traits in Large-Scale Motions of Proteins

In Search of a Dynamical Vocabulary: A Pipeline to Construct a Basis of Shared Traits in Large-Scale Motions of Proteins

In search of a dynamical vocabulary: a pipeline to construct a basis of shared traits in large-scale motions of proteins

Contact Info

Product

Resources

About