2017
DOI: 10.1093/bioinformatics/btx520
|View full text |Cite
|
Sign up to set email alerts
|

MetaCache: context-aware classification of metagenomic reads using minhashing

Abstract: Supplementary data are available at Bioinformatics online.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
59
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
3
3
2

Relationship

1
7

Authors

Journals

citations
Cited by 50 publications
(59 citation statements)
references
References 22 publications
0
59
0
Order By: Relevance
“…File S1. Other improvements may be possible, for example allowing inexact (approximate) matching between the query and the stored k-mers, as explored in several k-mer based sequence classification tools (Müller et al, 2017;Wood and Salzberg, 2014) or in tools for sequence comparison (Horwege et al, 2014). These or other solutions, adapted to a probabilistic setting, will be examined in the future to reduce the sensitivity of RAPPAS to gaps.…”
Section: Accuracy Limitationsmentioning
confidence: 99%
See 2 more Smart Citations
“…File S1. Other improvements may be possible, for example allowing inexact (approximate) matching between the query and the stored k-mers, as explored in several k-mer based sequence classification tools (Müller et al, 2017;Wood and Salzberg, 2014) or in tools for sequence comparison (Horwege et al, 2014). These or other solutions, adapted to a probabilistic setting, will be examined in the future to reduce the sensitivity of RAPPAS to gaps.…”
Section: Accuracy Limitationsmentioning
confidence: 99%
“…Improvements impacting the computational efficiency (and not just accuracy) of RAPPAS are also conceivable. For example, tailored k-mer indexing techniques can be developed similarly to other recent work (Müller et al, 2017;Liu et al, 2018), and the memory footprint of the pkDB can be reduced by limiting storage to the most discriminant phylokmers (Ounit et al, 2015). Despite the many potential improvements to RAPPAS, it is already faster than other PP implementations on real datasets (Figure 4).…”
Section: Applying Rappas To "Portable" Metagenomicsmentioning
confidence: 99%
See 1 more Smart Citation
“…These methods allow very fast identification but require huge amounts of random-access-memory (RAM), sometimes more than 100 GB, especially for comprehensive databases. Other solutions try to balance between time, accuracy and memory consumption like Centrifuge (8), mash (9), sourmash (10), MetaCache (11), Ganon (12), Kaiju (13) and many more. However, due to the still growing number of reference genomes in, e.g., the NCBIs nucleotide sequence database (14), the amount of primary memory required by almost all of these tools finally scales beyond the scope of a conventional notebook.…”
Section: Introductionmentioning
confidence: 99%
“…Development in this area is continuing in order to increase analysis speed while reducing memory footprint. Currently, DIAMOND is one of the fastest local aligners that has a sensitivity comparable to BLAST (Buchfink et al, 2015), and MetaCache is one of the fastest and most memory e cient k-mer based classifiers, using only a discriminatory subset of available k-mers (Müller et al, 2017). All of these approaches, however, are based on sequence similarity, which can be incongruent with the true phylogenetic relationship of sequences (Smith and Pease, 2017).…”
Section: Introductionmentioning
confidence: 99%