2022
DOI: 10.1101/2022.12.13.520346
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

De novo protein design by inversion of the AlphaFold structure prediction network

Abstract: De novo protein design enhances our understanding of the principles that govern protein folding and interactions, and has the potential to revolutionize biotechnology through the engineering of novel protein functionalities. Despite recent progress in computational design strategies, de novo design of protein structures remains challenging, given the vast size of the sequence-structure space. AlphaFold2 (AF2), a state-of-the-art neural network architecture, achieved remarkable accuracy in predicting protein st… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

1
22
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
3

Relationship

0
6

Authors

Journals

citations
Cited by 14 publications
(23 citation statements)
references
References 58 publications
(75 reference statements)
1
22
0
Order By: Relevance
“…We started with macrocycles composed of 710 amino acids and enumerated 48,000 hallucinated models for each size. We clustered the resulting structures from these large sampling runs using torsion bin-based clustering described earlier, and identified 9941, 13405, 19705, and 22206 unique structural clusters for 7-mers, 8-mers, 9-mers, and 10-mer cyclic peptides, respectively (Figure 3A).…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…We started with macrocycles composed of 710 amino acids and enumerated 48,000 hallucinated models for each size. We clustered the resulting structures from these large sampling runs using torsion bin-based clustering described earlier, and identified 9941, 13405, 19705, and 22206 unique structural clusters for 7-mers, 8-mers, 9-mers, and 10-mer cyclic peptides, respectively (Figure 3A).…”
Section: Introductionmentioning
confidence: 99%
“…Many of these hallucinated sequences demonstrated promising folding propensity in these calculations, with 114 7-mers, 186 8-mers, 139 9-mers, and 76 10-mer sequences showing Rosetta P near values greater than 0.6. We selected one hallucinated design model per size between 710 amino acids with AlphaFold pLDDT > 0.9 and Rosetta P near > 0.9 for further experimental validation and structural characterization. All four selected designs lack regular secondary structures but are stabilized by extensive intramolecular backbone-to-backbone and backbone-to-sidechain hydrogen bonding.…”
Section: Introductionmentioning
confidence: 99%
“…The copyright holder for this this version posted February 25, 2023. ; https://doi.org/10.1101/2023.02.24.529906 doi: bioRxiv preprint in the sequence update loop 5,12,14 . Though this approach worked well for models predicting distribution of distances for every pair of positions, such as TrRosetta, the approach did not work well in more recent models such as AlphaFold, where single amino acid changes could radically alter the predicted structure, resulting in unstable and inefficient optimization.…”
Section: Introductionmentioning
confidence: 99%
“…This approach may also fail to identify a solution, similar to issues faced by numerical solutions to mathematical problems in a random Monte Carlo fashion. To improve the likelihood of generating accurate predictions, researchers have proposed using sequence gradients obtained by inverting structure prediction networks 12 . However, when updating the gradients, a major issue arises due to the discrete, one-hot encoded representation of sequences.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation