2022
DOI: 10.1093/bioinformatics/btac468
|View full text |Cite
|
Sign up to set email alerts
|

3CAC: improving the classification of phages and plasmids in metagenomic assemblies using assembly graphs

Abstract: Motivation Bacteriophages and plasmids usually coexist with their host bacteria in microbial communities and play important roles in microbial evolution. Accurately identifying sequence contigs as phages, plasmids and bacterial chromosomes in mixed metagenomic assemblies is critical for further unraveling their functions. Many classification tools have been developed for identifying either phages or plasmids in metagenomic assemblies. However, only two classifiers, PPR-Meta and viralVerify, w… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
12
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
2

Relationship

2
4

Authors

Journals

citations
Cited by 8 publications
(13 citation statements)
references
References 48 publications
(58 reference statements)
0
12
0
Order By: Relevance
“…Despite these limitations, we hope the developed benchmark may be informative to users and would be further developed to include new computational challenges. It should be noted that the results presented here are limited to those tools that could be installed and run by July 2021, and since then many more tools have been published [we are aware of 3CAC ( Pu and Shamir, 2022 ), DeepMicrobeFinder ( Hou et al, 2021 ), INHERIT ( Bai et al, 2022 ), PHAMB ( Johansen et al, 2022 ), PhaMer ( Shang et al, 2022 ), VirMine 2.0 ( Johnson and Putonti, 2022 ), and virSearcher ( Liu Q. et al, 2022 )]. Additionally, modular pipelines such as the IMG/VR viral discovery pipeline ( Paez-Espino et al, 2017 ) and computational pipelines combining several tools presented here, were not evaluated in this work but could be assessed using the same benchmark datasets developed here.…”
Section: Conclusion and Recommendationsmentioning
confidence: 99%
“…Despite these limitations, we hope the developed benchmark may be informative to users and would be further developed to include new computational challenges. It should be noted that the results presented here are limited to those tools that could be installed and run by July 2021, and since then many more tools have been published [we are aware of 3CAC ( Pu and Shamir, 2022 ), DeepMicrobeFinder ( Hou et al, 2021 ), INHERIT ( Bai et al, 2022 ), PHAMB ( Johansen et al, 2022 ), PhaMer ( Shang et al, 2022 ), VirMine 2.0 ( Johnson and Putonti, 2022 ), and virSearcher ( Liu Q. et al, 2022 )]. Additionally, modular pipelines such as the IMG/VR viral discovery pipeline ( Paez-Espino et al, 2017 ) and computational pipelines combining several tools presented here, were not evaluated in this work but could be assessed using the same benchmark datasets developed here.…”
Section: Conclusion and Recommendationsmentioning
confidence: 99%
“…In the 4CAC algorithm, we exploit the assembly graph to improve the initial classification by the following two steps. The description here follows [33].…”
Section: Methodsmentioning
confidence: 99%
“…PlasFlow [17] and PlasClass [13] are two previously published ML plasmid classifiers trained using logistic regression and a neural network, respectively. Both of the models perform best for fragments greater than 10kb, but are also compatible with 5-kb fragments classification.…”
Section: Comparison To Published Methodsmentioning
confidence: 99%
“…Another promising approach to differentiating plasmid and chromosomal contigs is the use of machine learning (ML) techniques [14]. ML models can potentially learn unique sequence features that differentiate plasmids and chromosomes, and recently, several studies that use ML models to distinguish plasmid and chromosome sequences have been published [15][16][17][18]. For example, PlasFlow employs a neural network for identifying bacterial plasmid sequences in environmental samples and achieved accuracies of up to 96%.…”
Section: Introductionmentioning
confidence: 99%