2021
DOI: 10.1038/s41587-020-00793-4
|View full text |Cite
|
Sign up to set email alerts
|

Deep diversification of an AAV capsid protein by machine learning

Abstract: Nature provides abundant examples of protein families with highly diverged sequences. The ability to design new protein homologs has many applications, yet synthetic approaches have been unable to generate similarly diverse protein sequences with functional activity in the lab [1, 2]. New technologies offer a solution: high-throughput DNA synthesis and sequencing technologies allow thousands of designed sequences to be assayed in parallel, enabling deep diversification guided by machine learning (ML) models th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

2
221
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
3
1

Relationship

1
8

Authors

Journals

citations
Cited by 197 publications
(252 citation statements)
references
References 33 publications
2
221
0
Order By: Relevance
“…Another challenging future direction in protein sequence-function modeling will be assessing how well trained models can extrapolate to sequences outside the training distribution, such as variants with higher-order mutations [32, 33]. All of our quantitative evaluations involved test set variants that had similar characteristics to the training set variants.…”
Section: Discussionmentioning
confidence: 99%
“…Another challenging future direction in protein sequence-function modeling will be assessing how well trained models can extrapolate to sequences outside the training distribution, such as variants with higher-order mutations [32, 33]. All of our quantitative evaluations involved test set variants that had similar characteristics to the training set variants.…”
Section: Discussionmentioning
confidence: 99%
“…The trained models were used to design icosahedral capsids based on highly diverse protein sequences. In total, 201,426 variants were predicted of which 110,689 were experimentally verified to be viable as AAV capsid [ 333 ]. This approach unlocked vast areas of functional but previously unreachable sequence space, providing new opportunities, not only for viral vector design but also in general for the design of protein-based (nano-) materials.…”
Section: Discussionmentioning
confidence: 99%
“…Although immune response against exogenous Cas13 protein could reduce its therapeutic activity in repeated dosing, the diversity of CRISPR-Cas13 and recent advances in protein engineering may help alleviate this issue. [34][35][36] Viral proteins of several human RNA viruses and interferon stimulated genes (ISGs) were shown to dampen RNAi activity. 37 In contrast, since CRISPR-Cas13 is of bacterial origin, it is likely that mammalian viruses have never interacted with it in nature and are likely not able to counter it right away.…”
Section: Discussionmentioning
confidence: 99%