2018
DOI: 10.1186/s12859-018-2094-5
|View full text |Cite
|
Sign up to set email alerts
|

Fast and efficient short read mapping based on a succinct hash index

Abstract: BackgroundVarious indexing techniques have been applied by next generation sequencing read mapping tools. The choice of a particular data structure is a trade-off between memory consumption, mapping throughput, and construction time.ResultsWe present the succinct hash index – a novel data structure for read mapping which is a variant of the classical q-gram index with a particularly small memory footprint occupying between 3.5 and 5.3 GB for a human reference genome for typical parameter settings. The succinct… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
17
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
6
1
1

Relationship

1
7

Authors

Journals

citations
Cited by 20 publications
(22 citation statements)
references
References 25 publications
(44 reference statements)
0
17
0
Order By: Relevance
“…Hash indexes are efficient with precise values searching queries, but the returned values are not sorted. Hash indexes are optimized for queries that use the equality operator and they support full index scans [39].…”
Section: Figurementioning
confidence: 99%
“…Hash indexes are efficient with precise values searching queries, but the returned values are not sorted. Hash indexes are optimized for queries that use the equality operator and they support full index scans [39].…”
Section: Figurementioning
confidence: 99%
“…We choose an error rate of 10 (which is two errors at most for a read or size 20), and discarded reads with more that 2 erros. Other all mappers, such as FEM (Zhang et al, 2018), Hobbes (Ahmadi et al, 2011), and BitMapper2 (Cheng et al, 2019), could not be used, because the reads were too short for an edit distance of 2.…”
Section: Resultsmentioning
confidence: 99%
“…Briefly explained, FM-index alignment tools are derived from the Burrows-Wheeler Transform [ 68 ]—a method to sufficiently compress large amount of data and finding approximate matches of sequences in the reference genome [ 69 ]. Hash table-based aligners uses the seed-and-extend method in combination with additional alignment algorithms [ 68 , 70 , 71 ].…”
Section: Precautions Of Data Output From Sequencingmentioning
confidence: 99%