2022
DOI: 10.1039/d2sm00636g
|View full text |Cite
|
Sign up to set email alerts
|

Information-theoretical measures identify accurate low-resolution representations of protein configurational space

Abstract: Different clustering strategies can produce qualitatively different low-resolution representations of a protein’s conformational space. The resolution-relevance framework pinpoints those that better preserve important, biologically relevant features.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(4 citation statements)
references
References 55 publications
(72 reference statements)
0
4
0
Order By: Relevance
“…Conversely, CG representations that identified sites with α carbons along the protein backbone resulted in relatively high information loss (i.e., high H map ). These C-α mappings, which are widely adopted in CG protein models, presumably correspond to highly structured conditioned distributions, p r|R , since the atomic structure of the protein backbone can be accurately reconstructed from the α carbon coordinates. , Subsequently, Potestio and co-workers , related the mapping entropy to “resolution” and “relevance” metrics that have been employed to characterize deep learning . As they have emphasized, fundamental insights into the mapping operator may prove useful not only for determining CG representations but also for identifying order parameters to analyze and bias molecular simulations, and even much more generally for understanding complex interacting systems, such as economic markets or social networks. , …”
Section: Coarse-grained Representationmentioning
confidence: 99%
See 1 more Smart Citation
“…Conversely, CG representations that identified sites with α carbons along the protein backbone resulted in relatively high information loss (i.e., high H map ). These C-α mappings, which are widely adopted in CG protein models, presumably correspond to highly structured conditioned distributions, p r|R , since the atomic structure of the protein backbone can be accurately reconstructed from the α carbon coordinates. , Subsequently, Potestio and co-workers , related the mapping entropy to “resolution” and “relevance” metrics that have been employed to characterize deep learning . As they have emphasized, fundamental insights into the mapping operator may prove useful not only for determining CG representations but also for identifying order parameters to analyze and bias molecular simulations, and even much more generally for understanding complex interacting systems, such as economic markets or social networks. , …”
Section: Coarse-grained Representationmentioning
confidence: 99%
“…The large majority of these studies have focused on small molecules with relatively little conformational flexibility , or systems that fluctuate about a well-defined equilibrium conformation. , ,,,, Clementi and co-workers have provided important insights for modeling more complex systems that transition between diverse conformational states . By employing diffusion maps and Markov state methods, they identified coherent domains that persist in microsecond protein simulations with global folding and unfolding transitions. , In such systems, the optimal mapping may dynamically vary as the system transitions among diverse metastable conformations.…”
Section: Coarse-grained Representationmentioning
confidence: 99%
“…The critical variable selection method, also known as resolution-relevance 71 76 , has been successful in identifying optimal clustering for the reduction of complexity in the representation of biomolecules 77 or for a protein conformational landscape 78 .…”
Section: Daily Transition Matrixmentioning
confidence: 99%
“…The relevance is the Shannon entropy associated with this second empirical probability. For both limit cases of 1 and N clusters, , the relevance being non-negative otherwise 45 , 78 . The maximum relevance thus corresponds to an optimal clustering, i.e.…”
Section: Daily Transition Matrixmentioning
confidence: 99%