2023
DOI: 10.1101/2023.06.21.545880
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

FLOP: Tasks for Fitness Landscapes Of Protein wildtypes

Abstract: Protein engineering has the potential to create optimized protein variants with improved properties and function. An initial step in the protein optimization process typically consists of a search among natural (wildtype) sequences to find the naturally occurring proteins with the most desirable properties. Promising candidates from this initial discovery phase then form the basis of the second step: a more local optimization procedure, exploring the space of variants separated from this candidate by a number … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
2

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 55 publications
0
2
0
Order By: Relevance
“…54,55 Overall, there is also still room to develop better benchmarks for enzyme discovery, to measure the effectiveness of various models and representations. 56 Recently, there has been an explosion in protein structure data from ML-enabled protein structure prediction tools such as AlphaFold2 and others 57−62 and databases of unannotated protein structures. Clustering similar structures is one way to annotate for function.…”
Section: Annotation Of Enzyme Activity Among Known Proteinsmentioning
confidence: 99%
See 1 more Smart Citation
“…54,55 Overall, there is also still room to develop better benchmarks for enzyme discovery, to measure the effectiveness of various models and representations. 56 Recently, there has been an explosion in protein structure data from ML-enabled protein structure prediction tools such as AlphaFold2 and others 57−62 and databases of unannotated protein structures. Clustering similar structures is one way to annotate for function.…”
Section: Annotation Of Enzyme Activity Among Known Proteinsmentioning
confidence: 99%
“…EC numbers do not capture a quantitative notion of similarity between reactions, so enzyme activity prediction would benefit from a learned continuous representation of the similarity between activities, where reactions, substrates, and products are numerically encoded. This could resemble current efforts to encode chemical structures and predict the outcomes of reactions in synthetic organic chemistry. Databases will be useful for the curation and standardization of enzyme reaction data. , Overall, there is also still room to develop better benchmarks for enzyme discovery, to measure the effectiveness of various models and representations …”
Section: Discovery Of Functional Enzymes With Machine Learningmentioning
confidence: 99%
“…Let 20 be defined as in Section 3.3, where G is the space of probability distributions over the 20 amino acids. f IF (x) is the probability distribution over the mutated site of x given by an inverse folding model.…”
Section: A5 Kernel Proofmentioning
confidence: 99%