2022
DOI: 10.1371/journal.pone.0265020
|View full text |Cite
|
Sign up to set email alerts
|

Large-scale design and refinement of stable proteins using sequence-only models

Abstract: Engineered proteins generally must possess a stable structure in order to achieve their designed function. Stable designs, however, are astronomically rare within the space of all possible amino acid sequences. As a consequence, many designs must be tested computationally and experimentally in order to find stable ones, which is expensive in terms of time and resources. Here we use a high-throughput, low-fidelity assay to experimentally evaluate the stability of approximately 200,000 novel proteins. These incl… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
18
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
5
3

Relationship

4
4

Authors

Journals

citations
Cited by 20 publications
(18 citation statements)
references
References 76 publications
0
18
0
Order By: Relevance
“…Proteases typically cleave unfolded proteins more quickly than folded ones, and proteolysis assays have been used for decades to measure folding stability (37) and select for high stability proteins (38,39). In 2017, we introduced the high-throughput yeast display proteolysis method for measuring folding stability using next generation sequencing (29,(40)(41)(42)(43)(44)(45)(46). To improve the scale, precision, speed, and cost of stability measurements, we developed cDNA display proteolysis.…”
Section: Massively Parallel Measurement Of Folding Stability By Cdna ...mentioning
confidence: 99%
“…Proteases typically cleave unfolded proteins more quickly than folded ones, and proteolysis assays have been used for decades to measure folding stability (37) and select for high stability proteins (38,39). In 2017, we introduced the high-throughput yeast display proteolysis method for measuring folding stability using next generation sequencing (29,(40)(41)(42)(43)(44)(45)(46). To improve the scale, precision, speed, and cost of stability measurements, we developed cDNA display proteolysis.…”
Section: Massively Parallel Measurement Of Folding Stability By Cdna ...mentioning
confidence: 99%
“…Despite these limitations, our study demonstrates how large-scale experimental testing can be applied to solve a challenging design problem and to quantify the biophysical features that influence design stability. In contrast to other studies that use mutagenesis to study determinants of folding stability (19,(42)(43)(44), our method examines the strengths of different biophysical features across thousands of different protein contexts, although these contexts are all related by the αββα fold and design procedure. Simplified low-resolution models like our linear regression are valuable for building biophysical intuition about the strengths of different interactions (45,46) as well as for guiding the construction of high-resolution models like the Rosetta energy function, which is also an additive model (47).…”
Section: Discussionmentioning
confidence: 99%
“…In our approach, we designed thousands of de novo proteins and measured their folding stabilities using a yeast display-based proteolysis assay coupled to next-generation sequencing (1). Several new studies have applied our methodology (16)(17)(18)(19) as it has several advantages. First, measuring folding stability for thousands of proteins makes it possible to statistically quantify biophysical features that contribute to stability.…”
Section: Introductionmentioning
confidence: 99%
“…The stability score was calculated using the approach in Singer et al, 22 a refinement of the Rocklin et al approach. In brief, fluorescently tagged proteins are expressed on the surface of yeast cells.…”
Section: ■ Computational Methodsmentioning
confidence: 99%