2019
DOI: 10.1073/pnas.1908723117
|View full text |Cite
|
Sign up to set email alerts
|

A general-purpose protein design framework based on mining sequence–structure relationships in known protein structures

Abstract: Current state-of-the-art approaches to computational protein design (CPD) aim to capture the determinants of structure from physical principles. While this has led to many successful designs, it does have strong limitations associated with inaccuracies in physical modeling, such that a reliable general solution to CPD has yet to be found. Here, we propose a design framework—one based on identifying and applying patterns of sequence–structure compatibility found in known proteins, rather than approximating them… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

2
115
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
6
2
1

Relationship

1
8

Authors

Journals

citations
Cited by 92 publications
(117 citation statements)
references
References 84 publications
2
115
0
Order By: Relevance
“…This outcome led us to speculate whether structural motifs other than RxxxR could mediate the lateral interaction between helical stacks and promote nanotube formation. The 36-31-3 filament was employed as the starting point for a computational optimization using the design engine dTERMen 74 , 75 . A solubility constraint was applied in which the occupancy of the solvent-contacting b/e/g residues within the heptad repeats was constrained to polar residues.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…This outcome led us to speculate whether structural motifs other than RxxxR could mediate the lateral interaction between helical stacks and promote nanotube formation. The 36-31-3 filament was employed as the starting point for a computational optimization using the design engine dTERMen 74 , 75 . A solubility constraint was applied in which the occupancy of the solvent-contacting b/e/g residues within the heptad repeats was constrained to polar residues.…”
Section: Resultsmentioning
confidence: 99%
“…dTERMen 74 , 75 was used to perform design. dTERMen is a general-purpose method for computational protein design that is based on the concept of structural degeneracy of proteins.…”
Section: Methodsmentioning
confidence: 99%
“…In practice, the generation of diverse sequences with Rosetta often requires manually manipulating amino acid identities at key positions [7,9], adjusting the energy function [43,44], or explicitly modeling perturbations of the protein backbone [45,46,47,48]. Design methods that account for flexibility of the protein backbone often use "softer" potentials [44,49], such as statistical potentials that are derived from data [50,51,50,52,53,54], but these methods often do not perform as favorably as well-parameterized "hard" molecular mechanics force fields.…”
Section: Introductionmentioning
confidence: 99%
“…However, DF and PS1 have distinct bundle architectures with respect to their interhelical packing, helical offsets, and helical registers, which together presented challenges for structural design. To address these challenges, we extended previous fragment-based approaches ( 43 48 ) and designed artificial multidomain proteins with allosterically communicating sites.…”
mentioning
confidence: 99%