2022
DOI: 10.1021/acs.jpcb.2c04532
|View full text |Cite
|
Sign up to set email alerts
|

Molecular Information Theory Meets Protein Folding

Abstract: We propose an application of molecular information theory to analyze the folding of single domain proteins. We analyze results from various areas of protein science, such as sequence-based potentials, reduced amino acid alphabets, backbone configurational entropy, secondary structure content, residue burial layers, and mutational studies of protein stability changes. We found that the average information contained in the sequences of evolved proteins is very close to the average information needed to specify a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
1

Relationship

2
4

Authors

Journals

citations
Cited by 6 publications
(5 citation statements)
references
References 117 publications
(294 reference statements)
1
4
0
Order By: Relevance
“…This value, corresponding to an average of 22.66 possible amino acids in each position, or from 5 to 8 considering the predominant range of S0 values estimated in Ref. 6, turns out to be reasonably consistent with previous direct estimates, 15–17 as seen above. There would be other conformations with similar sequence‐dependent energy for these folding sequences but they are expected to have higher sequence‐independent energy, not being native‐like.…”
Section: Discussionsupporting
confidence: 89%
See 1 more Smart Citation
“…This value, corresponding to an average of 22.66 possible amino acids in each position, or from 5 to 8 considering the predominant range of S0 values estimated in Ref. 6, turns out to be reasonably consistent with previous direct estimates, 15–17 as seen above. There would be other conformations with similar sequence‐dependent energy for these folding sequences but they are expected to have higher sequence‐independent energy, not being native‐like.…”
Section: Discussionsupporting
confidence: 89%
“…This value appears to be smaller than expected since it would correspond to an average of just two different amino acids in each position. Deep exploration of sequence space for individual structures 15,16 and recent analyses of large sets of sequence alignments 17 have both provided higher estimates that happen to be closer to the present estimate for SseqTg than for SseqTsel. In any case, regardless of any specific estimate, for each sequence that folds to a particular native structure, having energy close to EnormalN in that structure, there are many other sequences, corresponding to an entropy of SseqTgSseqTsel, that are unable to fold to that structure nor to any other structure.…”
Section: Theoretical Analysismentioning
confidence: 99%
“…In these margins, too many minimally frustrated regions might hinder functional evolution, while the presence of too many highly frustrated regions will prevent folding from happening 14 . Frustration conservation analysis within protein families can be used to define the theoretical limits to preserve function throughout evolution revealing the interplay between sequence, structure(s), dynamic and function 46 . Now that high-quality protein structure models can be obtained for the members of any protein family, our frustration conservation analysis strategy stands as a valuable tool to increase the level of functional annotation in biological databases 47 .…”
Section: Discussionmentioning
confidence: 99%
“…In these margins, too many minimally frustrated regions might hinder functional evolution while the presence of too many highly frustrated regions will prevent folding from happening (39). Frustration conservation analysis within protein families can be used to define the theoretical limits to preserve function throughout evolution revealing the interplay between sequence, structure(s), dynamic and function(s) (40). Now that high quality protein structure models can be obtained for the members of any protein family, our frustration conservation analysis strategy stands as a valuable tool to increase the level of functional annotation in biological databases (41).…”
Section: Discussionmentioning
confidence: 99%