2021
DOI: 10.1186/s12915-021-01180-4
|View full text |Cite
|
Sign up to set email alerts
|

Predicting the hosts of prokaryotic viruses using GCN-based semi-supervised learning

Abstract: Background Prokaryotic viruses, which infect bacteria and archaea, are the most abundant and diverse biological entities in the biosphere. To understand their regulatory roles in various ecosystems and to harness the potential of bacteriophages for use in therapy, more knowledge of viral-host relationships is required. High-throughput sequencing and its application to the microbiome have offered new opportunities for computational approaches for predicting which hosts particular viruses can inf… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
27
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
1

Relationship

2
4

Authors

Journals

citations
Cited by 21 publications
(27 citation statements)
references
References 41 publications
0
27
0
Order By: Relevance
“…Several state-of-the-art tools are available for phage–host association prediction, including PHP ( Lu et al , 2021 ), HoPhage (consisting of HoPhage-G and HoPhage-S) ( Tan et al , 2022 ), VPF-Class ( Pons et al , 2021 ), VHM-Net ( Wang et al , 2020 ), vHULK ( Amgarten et al , 2020 ), RaFAH ( Coutinho et al , 2021 ) and HostG ( Shang and Sun, 2021 ). Each of these tools has its own limitations making it unsuitable for the question of predicting phage–host contig associations in natural metagenomic settings.…”
Section: Discussionmentioning
confidence: 99%
See 3 more Smart Citations
“…Several state-of-the-art tools are available for phage–host association prediction, including PHP ( Lu et al , 2021 ), HoPhage (consisting of HoPhage-G and HoPhage-S) ( Tan et al , 2022 ), VPF-Class ( Pons et al , 2021 ), VHM-Net ( Wang et al , 2020 ), vHULK ( Amgarten et al , 2020 ), RaFAH ( Coutinho et al , 2021 ) and HostG ( Shang and Sun, 2021 ). Each of these tools has its own limitations making it unsuitable for the question of predicting phage–host contig associations in natural metagenomic settings.…”
Section: Discussionmentioning
confidence: 99%
“…On the other hand, ContigNet can also take the whole phage and host genomes as input because it supports contigs with any length. We compared its performance with other methods on the whole genomes according to the experimental steps described in Shang and Sun (2021) by testing the methods on the whole dataset and the dataset with only phage–host pairs without alignment results. And the results are shown in Figures 11 and 12 .…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…In this work, we present a method, named PhaMer, to identify phage contigs from metagenomic data. Because previous works have shown the importance of protein composition for phage classification [ 24 , 25 ], we employ a contextualized embedding model from natural language processing (NLP) to learn protein-associated patterns in phages. Specifically, by converting a sequence into a sentence composed of protein-based tokens, we employ the embedding model to learn both the protein composition and also their associations in phage sequences.…”
Section: Introductionmentioning
confidence: 99%