2016
DOI: 10.1021/acs.jctc.6b00090
|View full text |Cite
|
Sign up to set email alerts
|

An Atomistic Statistically Effective Energy Function for Computational Protein Design

Abstract: Shortcomings in the definition of effective free-energy surfaces of proteins are recognized to be a major contributory factor responsible for the low success rates of existing automated methods for computational protein design (CPD). The formulation of an atomistic statistically effective energy function (SEEF) suitable for a wide range of CPD applications and its derivation from structural data extracted from protein domains and protein-ligand complexes are described here. The proposed energy function compris… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
11
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 10 publications
(11 citation statements)
references
References 171 publications
(519 reference statements)
0
11
0
Order By: Relevance
“…As of July 2017, there are ~132,000 structures in the protein data bank (PDB) 29 with a yearly increase of ~10,000, but the number of unique folds has not changed in the past few years, suggesting more data are accumulated on each fold, and therefore statistical learning and utilizing the existing structures are likely able to improve the design methods. 30,31 Recently, two statistical potentials for protein design have been developed, 32,33 and the ABACUS potential 34 has been successfully used in designing proteins. 33,35 While these statistical potentials have a physical basis, machine learning especially deep-learning neural network has recently become a popular method to analyze big data sets, extract complex features, and make accurate predictions.…”
Section: Introductionmentioning
confidence: 99%
“…As of July 2017, there are ~132,000 structures in the protein data bank (PDB) 29 with a yearly increase of ~10,000, but the number of unique folds has not changed in the past few years, suggesting more data are accumulated on each fold, and therefore statistical learning and utilizing the existing structures are likely able to improve the design methods. 30,31 Recently, two statistical potentials for protein design have been developed, 32,33 and the ABACUS potential 34 has been successfully used in designing proteins. 33,35 While these statistical potentials have a physical basis, machine learning especially deep-learning neural network has recently become a popular method to analyze big data sets, extract complex features, and make accurate predictions.…”
Section: Introductionmentioning
confidence: 99%
“…That protein structure can be designed computationally was established a long time ago (1) and has been demonstrated numerous times since (23,26). It is also true that reliance on prior structural data has been broadly explored, both in terms of various statistics-based methods (46,49,52,54) as well as in the creation of chimeras that fused domains or larger fragments from existing structures (64)(65)(66)(67). What is new and exciting about our results here is the marriage between the generality of our approach (i.e., its ability to design sequences for arbitrarily-defined structures) and its reliance on motif-based structural data.…”
Section: Discussionmentioning
confidence: 99%
“…PANENERGY is an example of atomistic statistical energy function built for CPD and related applications (49). It builds upon a traditional atom-pair PDB-derived statistical interaction energy, but applies a novel "connectivity scaling factor" to modulate inter-atomic interactions.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations