2024
DOI: 10.1101/2024.04.22.590591
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Design of highly functional genome editors by modeling the universe of CRISPR-Cas sequences

Jeffrey A. Ruffolo,
Stephen Nayfach,
Joseph Gallagher
et al.

Abstract: Gene editing has the potential to solve fundamental challenges in agriculture, biotechnology, and human health. CRISPR-based gene editors derived from microbes, while powerful, often show significant functional tradeoffs when ported into non-native environments, such as human cells. Artificial intelligence (AI) enabled design provides a powerful alternative with potential to bypass evolutionary constraints and generate editors with optimal properties. Here, using large language models (LLMs) trained on biologi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 8 publications
(2 citation statements)
references
References 89 publications
(130 reference statements)
0
2
0
Order By: Relevance
“…1b ) and generating 20 sequences per parameter set. We tested a sampling generative procedure 26 , with a temperature of 1, max_length of 1024, top_p of 1, repetition penalty 1.2 and 1.3, and top_k for the values from 5 to 20, and 30, 50, 100, 200, and 458. Accuracy to match the natural distribution was computed as the sum of the absolute differences between all amino acid pairs.…”
Section: Functional Prediction Analysesmentioning
confidence: 99%
See 1 more Smart Citation
“…1b ) and generating 20 sequences per parameter set. We tested a sampling generative procedure 26 , with a temperature of 1, max_length of 1024, top_p of 1, repetition penalty 1.2 and 1.3, and top_k for the values from 5 to 20, and 30, 50, 100, 200, and 458. Accuracy to match the natural distribution was computed as the sum of the absolute differences between all amino acid pairs.…”
Section: Functional Prediction Analysesmentioning
confidence: 99%
“…Protein language models (pLMs) have likewise showcased impressive success, including predicting the effect of mutations 14 and protein structures 15,16 by learning statistics of coevolving residues 17 and allowing the step-wise design of de novo proteins 18 . In the context of enzyme design, pLMs and other generative models have provided artificial variants for several enzyme families 19,20 , including artificial TEM-1 β-lactamases 21 , lysozymes 22 , luciferases 2 , malate dehydrogenases 23 , superoxide dismutases 24 , chorismate mutases 25 , and CRISPR-Cas genome editors 26 in the last three years alone 27 . While these works highlight the potential of AI architectures in the protein realm, the implementation of a model that produces highly active enzymes without the need for further training and with high success rates remains a longstanding goal in the field.…”
Section: Introductionmentioning
confidence: 99%