2021
DOI: 10.1093/bioinformatics/btab451
|View full text |Cite
|
Sign up to set email alerts
|

efam: an expanded, metaproteome-supported HMM profile database of viral protein families

Abstract: Motivation Viruses infect, reprogram, and kill microbes, leading to profound ecosystem consequences, from elemental cycling in oceans and soils to microbiome-modulated diseases in plants and animals. Although metagenomic datasets are increasingly available, identifying viruses in them is challenging due to poor representation and annotation of viral sequences in databases. Results Here we establish efam, an expanded collectio… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
17
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
2

Relationship

3
6

Authors

Journals

citations
Cited by 20 publications
(25 citation statements)
references
References 61 publications
0
17
0
Order By: Relevance
“…Since amino acid sequences of RdRp varied greatly among all RNA viruses, various bioinformatics attempts have been made to find RdRp genes from metatranscriptome data (Wolf et al, 2018;Bigot et al, 2019;Babaian and Edgar, 2021;Zayed et al, 2021). In this study, RdRp genes were comprehensively obtained (named NeoRdRp-seq), clustered based on amino acid similarity, and HMM profiles were constructed (named NeoRdRp-HMM) (Fig.…”
Section: Discussionmentioning
confidence: 99%
“…Since amino acid sequences of RdRp varied greatly among all RNA viruses, various bioinformatics attempts have been made to find RdRp genes from metatranscriptome data (Wolf et al, 2018;Bigot et al, 2019;Babaian and Edgar, 2021;Zayed et al, 2021). In this study, RdRp genes were comprehensively obtained (named NeoRdRp-seq), clustered based on amino acid similarity, and HMM profiles were constructed (named NeoRdRp-HMM) (Fig.…”
Section: Discussionmentioning
confidence: 99%
“…The prokaryotic Viruses Orthologous Groups (pVOGs) database (Grazziotin et al 2017), the Virus Orthologous Group (VOGDB) database (Kiening et al 2019), and the Prokaryotic virus Remote Homologous Groups (PHROGS) database were searched using hhsearch (Steinegger et al 2019). Finally, the efam database (Zayed et al 2021) was searched using hmmscan (Eddy 2011). At the end, the annotations of each protein were manually curated and compared with the annotations of the other proteins in the same protein cluster.…”
Section: Methodsmentioning
confidence: 99%
“…This is facilitated by the recent creation of free servers, such as PVPredn, that can identify phage virion proteins from nucleotide sequences [38]; further assistance is provided by databases, such as IMG/VR v3 (created in 2016) that, although not specific for bacteriophages, represents the largest collection of viral sequences so far compiled [39]. Additional, more specialized, bioinformatic tools include efam, established by Zayed et al in 2021, and described by the authors as "an expanded collection of Hidden Markov Model (HMM) profiles that represent viral protein families conservatively identified from the Global Ocean Virome 2.0 dataset" [40].…”
Section: Exploiting Bacteriophage Proteomesmentioning
confidence: 99%