2021
DOI: 10.3390/v13020150
|View full text |Cite
|
Sign up to set email alerts
|

ViralRecall—A Flexible Command-Line Tool for the Detection of Giant Virus Signatures in ‘Omic Data

Abstract: Giant viruses are widespread in the biosphere and play important roles in biogeochemical cycling and host genome evolution. Also known as nucleo-cytoplasmic large DNA viruses (NCLDVs), these eukaryotic viruses harbor the largest and most complex viral genomes known. Studies have shown that NCLDVs are frequently abundant in metagenomic datasets, and that sequences derived from these viruses can also be found endogenized in diverse eukaryotic genomes. The accurate detection of sequences derived from NCLDVs is th… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
53
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
8
1
1

Relationship

5
5

Authors

Journals

citations
Cited by 58 publications
(60 citation statements)
references
References 49 publications
(67 reference statements)
0
53
0
Order By: Relevance
“…From each cluster, we chose the genome with the highest N50 contig length as the representative. We then decontaminated the genomes through analysis with ViralRecall v.2.0 [62] (-c parameter), with all contigs with negative scores removed on the grounds that they represent non-Nucleocytoviricota contamination or highly unusual gene composition that cannot be validated by our present knowledge of Nucleocytoviricota genomic content. We only considered contigs >10 kbp, given the inherent difficulty in eliminating contamination derived from short contigs.…”
Section: Nucleocytoviricota Genome Setmentioning
confidence: 99%
“…From each cluster, we chose the genome with the highest N50 contig length as the representative. We then decontaminated the genomes through analysis with ViralRecall v.2.0 [62] (-c parameter), with all contigs with negative scores removed on the grounds that they represent non-Nucleocytoviricota contamination or highly unusual gene composition that cannot be validated by our present knowledge of Nucleocytoviricota genomic content. We only considered contigs >10 kbp, given the inherent difficulty in eliminating contamination derived from short contigs.…”
Section: Nucleocytoviricota Genome Setmentioning
confidence: 99%
“…From each cluster, we chose the genome with the highest N50 contig length as the representative. We then decontaminated the genomes through analysis with ViralRecall v.2.0 [52] (-c parameter), with all contigs with negative scores removed on the grounds that they represent non-Nucleocytoviricota contamination or highly unusual gene composition that cannot be validated by our present knowledge of Nucleocytoviricota genomic content. We only considered contigs > 10 kbp given the inherent difficulty in eliminating contamination derived from short contigs.…”
Section: Nucleocytoviricota Genome Setmentioning
confidence: 99%
“…For this study, we analyzed 35 Asfarvirus MAGs generated in two previous studies (Moniruzzaman et al, 2020a;Schulz et al, 2020) and complete genomes of five Asfarviruses (Reteno et al, 2015;Silva et al, 2015;Bajrai et al, 2016;Andreani et al, 2017;Matsuyama et al, 2020). MAGs were quality-checked using ViralRecall v. 2.0 (default parameters), with results manually inspected to ensure that no large non-NCLDV contigs were present (Aylward and Moniruzzaman, 2021). We used Seqkit v0.12.0 (Shen et al, 2016) for FASTA/Q file manipulation to generate the statistics of the genomes and proteins.…”
Section: Comparative Analysis and Protein Annotationmentioning
confidence: 99%