2020
DOI: 10.1101/2020.07.24.219899
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

What the Phage: A scalable workflow for the identification and analysis of phage sequences

Abstract: Phages are among the most abundant and diverse biological entities on earth. Identification from sequence data is a crucial first step to understand their impact on the environment. A variety of bacteriophage identification tools have been developed over the years. They differ in algorithmic approach, results and ease of use. We, therefore, developed “What the Phage” (WtP), an easy-to-use and parallel multitool approach for phage identification combined with an annotation and classification downstream strategy… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
16
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
6
2

Relationship

1
7

Authors

Journals

citations
Cited by 14 publications
(16 citation statements)
references
References 35 publications
0
16
0
Order By: Relevance
“…The identified contigs from each assembly were then compiled and clustered at 95% nucleotide identity using CD-HIT v4.8.1 (parameters: -c 0.95 -d 400 -T 20 -M 20000 -n 5) [44], producing 2,885 viral OTUs (vOTUs). Viral contigs identified by (3) were further validated using "What the Phage" (WtP) [45] and VIBRANT (v1.2.1, virome mode) [46], both with default settings. WtP v1.0.0 combines 12 established viral identification tools for the identification and analysis of viral sequences.…”
Section: Identification Of Viral Contigsmentioning
confidence: 99%
“…The identified contigs from each assembly were then compiled and clustered at 95% nucleotide identity using CD-HIT v4.8.1 (parameters: -c 0.95 -d 400 -T 20 -M 20000 -n 5) [44], producing 2,885 viral OTUs (vOTUs). Viral contigs identified by (3) were further validated using "What the Phage" (WtP) [45] and VIBRANT (v1.2.1, virome mode) [46], both with default settings. WtP v1.0.0 combines 12 established viral identification tools for the identification and analysis of viral sequences.…”
Section: Identification Of Viral Contigsmentioning
confidence: 99%
“…To support a transparent and reproducible metagenomics workflow, all reads that cannot be mapped back to the existing high-quality bins (after the refinement) are available as an output for further analysis. These "unused" reads could be further analyzed by other tools such as Kraken2 [ 40 ], Kaiju [ 41 ], or centrifuge [ 42 ] for read classification, "What the Phage" [ 43 ] to search for phages, mi-faser [ 44 ] for functional annotation of the reads or even use these reads as a new input to run MUFFIN.…”
Section: Introductionmentioning
confidence: 99%
“…The same procedure was employed against the mMGE and GPD databases (44,51). Further, phage prediction using the genomic neighborhood information was performed using the upstream and downstream nucleotide sequences (+-10kb) of every representative sequence as a query for the metasoftware WhatThePhage (52). Only sequences predicted to be encoded in phages/prophages by at least 3 different tools included in WhatThePhage were labeled as so.…”
Section: Methodsmentioning
confidence: 99%
“…In addition, UG15 members have been also found within the Helicobacter pylori accessory genome (HPSJM_07740 gene of H. pylori SJM180) (92). To find which UG/Abi RTs were encoded in MGEs, we analyzed the genomic context of every representative sequence in search of phage signatures using the metasoftware WhatThePhage (52) and searched MGE databases (50) with all prokaryotic RTs HMM profiles (Methods).…”
Section: Presence Of Ug/abi Rts In Mobile Genetic Elements and Viral Genomesmentioning
confidence: 99%