2023
DOI: 10.1101/2023.10.24.563881
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

TRILL: Orchestrating Modular Deep-Learning Workflows for Democratized, Scalable Protein Analysis and Engineering

Zachary A Martinez,
Richard M. Murray,
Matt W. Thomson

Abstract: Deep-learning models have been rapidly adopted by many fields, partly due to the deluge of data humanity has amassed. In particular, the petabases of biological sequencing data enable the unsupervised training of protein language models that learn the "language of life." However, due to their prohibitive size and complexity, contemporary deep-learning models are often unwieldy, especially for scientists with limited machine learning backgrounds. TRILL (TRaining and Inference using the Language of Life) is a pl… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 42 publications
0
2
0
Order By: Relevance
“…LLMs could power these automated systems, with AI flexibly adapting to perform new types of syntheses and screens with robotic scripts written on the fly. At the same time, multiple desirable properties and activity for multiple reactions could be optimized simultaneously during protein engineering campaigns, powered by generalized ML models that can utilize multimodal representations of proteins. With ever increasing amounts of data on protein structures and sequence-fitness pairs, and new tools to conduct experiments and make ML methods for proteins more accessible to the broader community, the future of ML-assisted protein engineering is bright.…”
Section: Conclusion: Toward General Self-driven Protein Engineeringmentioning
confidence: 99%
“…LLMs could power these automated systems, with AI flexibly adapting to perform new types of syntheses and screens with robotic scripts written on the fly. At the same time, multiple desirable properties and activity for multiple reactions could be optimized simultaneously during protein engineering campaigns, powered by generalized ML models that can utilize multimodal representations of proteins. With ever increasing amounts of data on protein structures and sequence-fitness pairs, and new tools to conduct experiments and make ML methods for proteins more accessible to the broader community, the future of ML-assisted protein engineering is bright.…”
Section: Conclusion: Toward General Self-driven Protein Engineeringmentioning
confidence: 99%
“…Other ideas correspond to tools that do not exist yet but could be developed in the near future, in particular given the release of the TRILL toolkit for interfacing existing tools, which in turn could facilitate the contribution of experimentalists: A meta-tool that combines all modeling programs in one pass, ranks them globally and suggests an ideal composite model. Although a jury method has been proposed for the prediction of protein–protein complexes, this is still a largely unexplored possibility. A predictor of post-translational modifications.…”
Section: Wish Listmentioning
confidence: 99%