2018 Proceedings of the Twentieth Workshop on Algorithm Engineering and Experiments (ALENEX) 2018
DOI: 10.1137/1.9781611975055.1
|View full text |Cite
|
Sign up to set email alerts
|

Hybrid Indexing Revisited

Abstract: Hybrid indexing is a recent approach to text indexing that allows the space-usage of conventional text indexes (e.g., suffix trees, suffix arrays, FM-indexes) to scale well with the text size, n, when z, the size of the Lempel-Ziv parsing of the text, is small relative to n. The price for this improved scalability is that an upper bound M on the pattern length that can be searched for must be declared at index construction time. Because the size of the resulting index contains an O(M z) term, M must be kept re… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 14 publications
(10 citation statements)
references
References 21 publications
(21 reference statements)
0
10
0
Order By: Relevance
“…We compared r-index with the state-of-the-art index for each compressibility measure: lzi 23 [73,24] (z), slp 23 [25,24] (g), rlcsa 24 [79,80] (r), and cdawg 25 [9] (e). We also included hyb 26 [30,31], which combines a Lempel-Ziv index with an FM-index, with parameter M = 8, which is optimal for our experiment. We tested rlcsa using three suffix array sample rates per dataset: the rate X resulting in the same size for rlcsa and r-index, plus rates X/2 and X/4.…”
Section: Methodsmentioning
confidence: 99%
“…We compared r-index with the state-of-the-art index for each compressibility measure: lzi 23 [73,24] (z), slp 23 [25,24] (g), rlcsa 24 [79,80] (r), and cdawg 25 [9] (e). We also included hyb 26 [30,31], which combines a Lempel-Ziv index with an FM-index, with parameter M = 8, which is optimal for our experiment. We tested rlcsa using three suffix array sample rates per dataset: the rate X resulting in the same size for rlcsa and r-index, plus rates X/2 and X/4.…”
Section: Methodsmentioning
confidence: 99%
“…5 We use Patricia trees sampling values s = 4, 16, 64. lz-index and lze-index: Two variants of the Lempel-Ziv based index [15]. 6 hyb-index: A hybrid between a Lempel-Ziv and a BWT-based index [7]. 7 We build it with parameters M = 8, 16, the best for this case.…”
Section: Tested Indexesmentioning
confidence: 99%
“…6 hyb-index: A hybrid between a Lempel-Ziv and a BWT-based index [7]. 7 We build it with parameters M = 8, 16, the best for this case.…”
Section: Tested Indexesmentioning
confidence: 99%
See 1 more Smart Citation
“…This has a major impact on the final size of the index. For this reason, the hybrid index is suitable mainly for short-read alignment, although there have been recent heuristic modifications to allow for longer alignments (Ferrada et al, 2018). In contrast, the r-index, of which we provide an implementation in this work, has no such length limitation.…”
Section: Related Workmentioning
confidence: 99%