2022
DOI: 10.1101/2022.05.27.493774
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

CELL-E: A Text-To-Image Transformer for Protein Localization Prediction

Abstract: Predicting the cellular activities of proteins from their primary amino acid sequences is a highly desirable capability that could greatly enhance our functional understanding of the proteome. Here, we demonstrate CELL-E, a text-to-image transformer architecture, which given a protein sequence and a reference image for cell (or nucleus) morphology, can generate a 2D probability density map of the protein distribution within cells. Unlike previous in silico methods, which rely on existing, discrete class annota… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 104 publications
(179 reference statements)
0
1
0
Order By: Relevance
“…Recent advances in machine learning have enabled the study of protein properties using either protein sequences or cellular images. Models have been developed that infer different properties of a protein, such as its localization and structure, from the protein sequence [7][8][9][10][11][12] . While the prediction of protein localization based on its sequence allows generalizing to unseen proteins, the localization prediction task cannot capture relative protein abundance among different cellular compartments, contextual differences in localization among single cells, or cell-type specific localization differences among cell lines.…”
Section: Introductionmentioning
confidence: 99%
“…Recent advances in machine learning have enabled the study of protein properties using either protein sequences or cellular images. Models have been developed that infer different properties of a protein, such as its localization and structure, from the protein sequence [7][8][9][10][11][12] . While the prediction of protein localization based on its sequence allows generalizing to unseen proteins, the localization prediction task cannot capture relative protein abundance among different cellular compartments, contextual differences in localization among single cells, or cell-type specific localization differences among cell lines.…”
Section: Introductionmentioning
confidence: 99%