2022
DOI: 10.1016/j.copbio.2022.102713
|View full text |Cite
|
Sign up to set email alerts
|

Machine learning to navigate fitness landscapes for protein engineering

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
44
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
4
2

Relationship

0
10

Authors

Journals

citations
Cited by 61 publications
(51 citation statements)
references
References 40 publications
0
44
0
Order By: Relevance
“…Effectively exploring rough (i.e., not smooth) landscapes with limited experimental throughput is a challenge for DE [911]. Thankfully, the confluence of advancements in DNA synthesis (bespoke oligonucleotide sequences), DNA sequencing (high throughput screens), and machine learning have enabled a new paradigm of machine learning guided directed evolution (MLDE) [1215] which can address the challenge of exploring rough landscapes with limited data. In each round of MLDE, a set of (variant, activity) pairs are collected, which a practitioner can use to train a genotype-phenotype model to predict the effect of variants.…”
Section: Introductionmentioning
confidence: 99%
“…Effectively exploring rough (i.e., not smooth) landscapes with limited experimental throughput is a challenge for DE [911]. Thankfully, the confluence of advancements in DNA synthesis (bespoke oligonucleotide sequences), DNA sequencing (high throughput screens), and machine learning have enabled a new paradigm of machine learning guided directed evolution (MLDE) [1215] which can address the challenge of exploring rough landscapes with limited data. In each round of MLDE, a set of (variant, activity) pairs are collected, which a practitioner can use to train a genotype-phenotype model to predict the effect of variants.…”
Section: Introductionmentioning
confidence: 99%
“…In recent years, deep representation learning has emerged as an effective method for protein featurization. This approach, which generates a representation of a given sequence, is based on information extracted from the known protein sequence universe ( So and Karplus, 1996a ; Biswas et al, 2021 ; Garruss et al, 2021 ; Freschlin et al, 2022 ; Alley et al, 2019 ). In contrast, our approach is based on features derived from structural and physicochemical properties of amino acids.…”
Section: Resultsmentioning
confidence: 99%
“…In recent years, deep representation learning has emerged as an effective method for protein featurization. This approach, which generates a representation of a given sequence, is based on information extracted from the known protein sequence universe (50,(55)(56)(57)(58). In contrast, our approach is based on features derived from structural and physicochemical properties of amino acids.…”
Section: Comparison With Sequence-based Featurizationmentioning
confidence: 99%