2018
DOI: 10.1016/j.tcs.2017.12.021
|View full text |Cite
|
Sign up to set email alerts
|

Time–space trade-offs for Lempel–Ziv compressed indexing

Abstract: Given a string S, the compressed indexing problem is to preprocess S into a compressed representation that supports fast substring queries. The goal is to use little space relative to the compressed size of S while supporting fast queries. We present a compressed index based on the Lempel-Ziv 1977 compression scheme. We obtain the following time-space trade-offs: For constant-sized alphabetswhere n and m are the length of the input string and query string respectively, z is the number of phrases in the LZ77 pa… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

1
38
0
1

Year Published

2018
2018
2023
2023

Publication Types

Select...
6
1
1

Relationship

1
7

Authors

Journals

citations
Cited by 41 publications
(41 citation statements)
references
References 37 publications
1
38
0
1
Order By: Relevance
“…4. Self-indexes with efficient extraction require Ω(z log(n/z)) space [105,21,43,10,15], Ω(g) space [17,14], or Ω(e) space [111,7]. 5.…”
Section: Indexmentioning
confidence: 99%
“…4. Self-indexes with efficient extraction require Ω(z log(n/z)) space [105,21,43,10,15], Ω(g) space [17,14], or Ω(e) space [111,7]. 5.…”
Section: Indexmentioning
confidence: 99%
“…On the other hand, there exists a "universal" set Γ P,i of 4(k − 1) positions within the occurrence P i in T C that covers all substrings of P i of length ≤ k. 8 In particular, Γ P,i covers the strings x i for j ∈ {2, . .…”
Section: Computational Complexitymentioning
confidence: 99%
“…Extracting text from Lempel-Ziv compressed text is a notoriously difficult problem. No efficient solution is known within O(z) space (they all require time proportional to the parse's height), although efficient queries can be supported by raising the space by a logarithmic factor [8,6]. Grammars, on the other hand, allow for more compact and timeefficient extraction strategies.…”
mentioning
confidence: 99%
“…see e.g. [10,3,9,8,17,16,13]. This problem is highly relevant as the amount of highlyrepetitive data increases rapidly, and thus it is possible to handle greater amounts of data by compressing it.…”
Section: Introductionmentioning
confidence: 99%
“…Bille et al [3] O(z(lg(n/z) + lg ǫ z)) O(m + occ(lg ǫ n + lg lg n)) n O(1) Bille et al [3] O(z lg(n/z) lg lg z) O(m + occ lg lg n) O(1) Bille et al [3] O(z lg(n/z)) O(m(1 + lg ǫ z lg(n/z) ) + occ(lg ǫ n + lg lg n)) O(1)…”
Section: Introductionunclassified