2023
DOI: 10.1101/2023.12.22.573145
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Unexplored regions of the protein sequence-structure map revealed at scale by a library of foldtuned language models

Arjuna M. Subramanian,
Matt Thomson

Abstract: Nature has likely sampled only a fraction of all protein sequences and structures allowed by the laws of biophysics. However, the combinatorial scale of amino-acid sequence-space has traditionally precluded substantive study of the full protein sequence-structure map. In particular, it remains unknown how much of the vast uncharted landscape of far-from-natural sequences consists of alternate ways to encode the familiar ensemble of natural folds; proteins in this category also represent an opportunity to diver… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 16 publications
0
1
0
Order By: Relevance
“…It is also possible to design desired enzyme scaffolds/structures (Figure E). One approach is hallucination, where a search algorithm uses a structure predictor to find a sequence that folds to the right structure. ,, Luciferases with high luminescence and selectivity were engineered using deep-learning-assisted protein design, by combining hallucination with Rosetta sequence design . One of the wet-lab-validated designs demonstrated catalytic activity comparable to natural luciferases, with much higher substrate selectivity: the active site and the enzyme scaffold were both entirely different from naturally occurring luciferases.…”
Section: Discovery Of Functional Enzymes With Machine Learningmentioning
confidence: 99%
“…It is also possible to design desired enzyme scaffolds/structures (Figure E). One approach is hallucination, where a search algorithm uses a structure predictor to find a sequence that folds to the right structure. ,, Luciferases with high luminescence and selectivity were engineered using deep-learning-assisted protein design, by combining hallucination with Rosetta sequence design . One of the wet-lab-validated designs demonstrated catalytic activity comparable to natural luciferases, with much higher substrate selectivity: the active site and the enzyme scaffold were both entirely different from naturally occurring luciferases.…”
Section: Discovery Of Functional Enzymes With Machine Learningmentioning
confidence: 99%