2021
DOI: 10.1109/tit.2020.3038147
|View full text |Cite
|
Sign up to set email alerts
|

The Smallest Grammar Problem Revisited

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
6
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 6 publications
(7 citation statements)
references
References 39 publications
1
6
0
Order By: Relevance
“…Other encodings were recently studied by Ganczorz [5]. Since Re-Pair is a so-called irreducible grammar, its grammar size, i.e., the sum of the symbols on the right-hand side of all rules, is upper bounded by O(n/ log σ n) ([3], Lemma 2), which matches the information-theoretic lower bound on the size of a grammar for a string of length n. Comparing this size with the size of the smallest grammar, its approximation ratio has O((n/ lg n) 2/3 ) as an upper bound [6] and Ω(lg n/ lg lg n) as a lower bound [7]. On the practical side, Yoshida and Kida [8] presented an efficient fixed-length code for compressing the Re-Pair grammar.…”
Section: Introductionsupporting
confidence: 58%
“…Other encodings were recently studied by Ganczorz [5]. Since Re-Pair is a so-called irreducible grammar, its grammar size, i.e., the sum of the symbols on the right-hand side of all rules, is upper bounded by O(n/ log σ n) ([3], Lemma 2), which matches the information-theoretic lower bound on the size of a grammar for a string of length n. Comparing this size with the size of the smallest grammar, its approximation ratio has O((n/ lg n) 2/3 ) as an upper bound [6] and Ω(lg n/ lg lg n) as a lower bound [7]. On the practical side, Yoshida and Kida [8] presented an efficient fixed-length code for compressing the Re-Pair grammar.…”
Section: Introductionsupporting
confidence: 58%
“…To this end, one either has to improve the upper bound z b = O(bz log n z ) or has to provide a more elaborate series of examples improving the lower bound z b = Ω(bz) from Section 3 (obviously, the examples must deal with non-phrase-aligned parsings). We point out, however, that the tightness of the bound from Theorem 3 would necessarily imply the tightness of the currently best upper bound g = O(z log n z ) [4,18] from Lemma 3 that relates the size g of the minimal grammar generating the string and the size z of the LZ77 parsing for the string (the best lower bound up-to-date is g = Ω(z log n log log n ) [8,10]). Indeed, for a constant b > 1, if there exists a string whose LZ77 parsing has size z and whose b-block contraction can have only LZ77 parsings of size at least Ω(z log n z ), then the minimal grammar of such string must have a size of at least g = Ω(z log n z ) since, by Lemma 3, the string has a phrase-aligned LZ77 parsing of size g, and thus, by Theorem 2, the b-block contraction has an LZ77 parsing of size O(bg), which is O(g) as b is constant.…”
Section: Discussionmentioning
confidence: 96%
“…By a non-constructive argument [8,10,16], one can show that the converse equivalent reduction from LZ77 parsings to SLP grammars is not possible: in some cases, the size of the minimal SLP grammar can be Ω( log n log log n )-times larger than the size of the greedy (i.e., minimal) LZ77 parsing. For completeness, let us show this by repeating here the counting argument essentially used in [10,17].…”
Section: Lz77 Parsingsmentioning
confidence: 99%
See 1 more Smart Citation
“…The α-balanced grammar of Charikar et al [9] produces a (non-SLP) grammar of size O(g * log(N/g * )), where g * denotes the size of the smallest (non-SLP) grammar. Upper bounds and lower bounds for the approximation ratios of other practical grammar compressors including LZ78 [41], BISECTION [26], RePair [29], SEQUEN-TIAL [39], LONGEST MATCH [26], and GREEDY [1], are also known [9,2]. Charikar et al [9] showed that the approximation ratio of RePair to the smallest (non-SLP) grammar is at most O((N/ log N ) 2/3 ) and is at least Ω( √ log N ).…”
Section: Related Workmentioning
confidence: 99%