2023
DOI: 10.1101/2023.04.07.536023
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Prediction of virus-host associations using protein language models and multiple instance learning

Abstract: Predicting virus-host association is essential to understand how viruses interact with host species, and discovering new therapeutics for viral diseases across humans and animals. Currently, the host of the majority of viruses is unknown. Here, we introduce EvoMIL, a deep learning method that predicts virus-host association at the species level from viral sequence only. The method combines a pre-trained large protein language model and attention-based multiple instance learning (MIL) to allow protein-orientate… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 25 publications
0
3
0
Order By: Relevance
“…One explanation is infectivity changes driven by limited mutations, mainly in the spike protein 23,24 . Utilizing protein language models, known for extracting feature vectors that reflect the structural and functional properties of proteins 11 , could enhance model performance by detecting key changes at the amino acid level relevant to viral infectivity 25 .…”
Section: Discussionmentioning
confidence: 99%
“…One explanation is infectivity changes driven by limited mutations, mainly in the spike protein 23,24 . Utilizing protein language models, known for extracting feature vectors that reflect the structural and functional properties of proteins 11 , could enhance model performance by detecting key changes at the amino acid level relevant to viral infectivity 25 .…”
Section: Discussionmentioning
confidence: 99%
“…Predictive modeling is increasingly relevant in virus-host interaction research with regard to functional predictions of virus proteins and their interactions with host proteins [1][2][3][4]. The information gained from prediction models can be used to better understand the nature of virus-virus, virus-host, and host-host protein interactions during viral infection.…”
Section: Introductionmentioning
confidence: 99%
“…The information gained from prediction models can be used to better understand the nature of virus-virus, virus-host, and host-host protein interactions during viral infection. For example, powerful computational methods can predict protein structure and protein functions from viral amino acid sequences [2][3][4]. In cases where the structure of a similar protein to a candidate viral protein is experimentally determined, algorithms based on modeling can provide accurate predictions of the protein structure.…”
Section: Introductionmentioning
confidence: 99%