2021
DOI: 10.1101/2021.03.10.434454
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Sampling of Structure and Sequence Space of Small Protein Folds

Abstract: Nature only samples a small fraction in sequence space, yet many more amino acid combinations can fold into stable proteins. Furthermore, small structural variations in a single fold, which may only be a few amino acids different from the next homolog, define their molecular function. Hence, to design proteins with novel molecular functionalities, such as molecular recognition, methods to control and sample shape diversity are necessary. To explore this space, we developed and experimentally validated a comput… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
12
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 8 publications
(13 citation statements)
references
References 25 publications
(59 reference statements)
0
12
0
Order By: Relevance
“…This model was trained on a corpus of 113,681 proteins designed by experts using Rosetta ( Alford et al (2017) ) or dTERMen ( Zhou et al (2020) ) software ( Figure 2–Figure Supplement 1, Figure 2–Figure Supplement 2 ), and achieved high performance on held-out test data. The training corpus comprised the designs reported in Rocklin et al (2017) , in Linsky et al (2021) , as well as previously unpublished designs.…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations
“…This model was trained on a corpus of 113,681 proteins designed by experts using Rosetta ( Alford et al (2017) ) or dTERMen ( Zhou et al (2020) ) software ( Figure 2–Figure Supplement 1, Figure 2–Figure Supplement 2 ), and achieved high performance on held-out test data. The training corpus comprised the designs reported in Rocklin et al (2017) , in Linsky et al (2021) , as well as previously unpublished designs.…”
Section: Resultsmentioning
confidence: 99%
“…Library 2 consists of designs from Linsky et al (2021) , which span several topologies (4h: four-helix bundles; HHH: three-helix bundles; beta_grasp: beta-grasp fold; coil: helical bundle; ferredoxin: ferredoxin fold; thio: thioredoxin fold; fold2 and fold4: new folds from Linsky et al (2021) ). Figure 2 – Figure Supplement 2 B shows stability scores for these designs, along with scrambled controls generated from a subset of designs by randomly shuffling the entire designed sequence.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…The structure models of these scaffolds must be quite accurate so that the positioning is correct. Using fragment assembly 7 , piecewise fragment assembly 8 , and helical extension 9 , we designed a large set of miniproteins ranging in length from 50 to 65 amino acids containing larger hydrophobic cores than previous miniprotein scaffold libraries 1 , which makes the protein more stable and more tolerant to introduction of the designed binding surfaces. 84,690 scaffolds spanning 5 different topologies with structural metrics predictive of folding were encoded in large oligonucleotide arrays and 34,507 were found to be stable using a high-throughput proteolysis based protein stability assay 10 .…”
Section: By Reducing Algorithmic Complexity From O(n)mentioning
confidence: 99%