2006
DOI: 10.1007/11780441_29
|View full text |Cite
|
Sign up to set email alerts
|

Reducing the Space Requirement of LZ-Index

Abstract: Abstract. The LZ-index is a compressed full-text self-index able to represent a text T 1...u , over an alphabet of size σ and with k-th order empirical entropy H k (T ), using 4uH k (T ) + o(u log σ ) bits for any k = o(log σ u). It can report all the occ occurrences of a pattern P 1...m in T in O(m 3 log σ + (m + occ) log u) worst case time. This is the only existing data structure of size O(uH k (T )) able of spending O(log u) time per occurrence reported. Its main drawback is the factor 4 in its space compl… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
50
0

Year Published

2006
2006
2016
2016

Publication Types

Select...
6
2

Relationship

4
4

Authors

Journals

citations
Cited by 28 publications
(50 citation statements)
references
References 21 publications
(53 reference statements)
0
50
0
Order By: Relevance
“…We plan to try out others that have mild dependence on the alphabet size. In particular, adapting the LZ-index [23,2] should offer fast locating of occurrences.…”
Section: Discussionmentioning
confidence: 99%
“…We plan to try out others that have mild dependence on the alphabet size. In particular, adapting the LZ-index [23,2] should offer fast locating of occurrences.…”
Section: Discussionmentioning
confidence: 99%
“…The case when P overlaps three or more words is a generalization of the case with two words. Recently, Arroyuelo et al [1] improved the original data structure in [10]. For any text T,t h e improved index uses (2 + )jTjH k (T)+o(jTj log j˙j)bits of space, where H k (T)i st h ek-th order empirical entropy of T, and reports all occ occurrences of P in T in O(jPj 2 log jPj +(jPj + occ)logjTj)time.…”
Section: Compressed Full-text Indexingmentioning
confidence: 99%
“…A new LZ-index, the Inverted LZ-index (ILZI for short) [38], has appeared independently and simultaneously with our work [4]. The ILZI is faster than our data structures since it can report the pattern occurrences in O((m + occ) log u) time, but at the price of a higher space requirement: (5 + ǫ)uH k (T ) + o(u log σ) bits.…”
Section: Our Contributionmentioning
confidence: 99%