2000
DOI: 10.1109/18.850665
|View full text |Cite
|
Sign up to set email alerts
|

Universal lossless compression via multilevel pattern matching

Abstract: A universal lossless data compression code called the multilevel pattern matching code (MPM code) is introduced. In processing a finite-alphabet data string of length , the MPM code operates at (log log) levels sequentially. At each level, the MPM code detects matching patterns in the input data string (substrings of the data appearing in two or more nonoverlapping positions). The matching patterns detected at each level are of a fixed length which decreases by a constant factor from level to level, until this… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
45
0
3

Year Published

2002
2002
2019
2019

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 82 publications
(48 citation statements)
references
References 13 publications
0
45
0
3
Order By: Relevance
“…LZW [21] including LZ78 [24], and BISECTION [8] are considered as algorithms that computes straight-line programs, CFGs formed from Chomsky normal form formulas. Also algorithms for restricted CFGs have been presented in [6], [10], [15], [16], [22].…”
Section: Introductionmentioning
confidence: 99%
“…LZW [21] including LZ78 [24], and BISECTION [8] are considered as algorithms that computes straight-line programs, CFGs formed from Chomsky normal form formulas. Also algorithms for restricted CFGs have been presented in [6], [10], [15], [16], [22].…”
Section: Introductionmentioning
confidence: 99%
“…MPM code was developed by Kieffer et al [11] for efficient grammar-based text compression. Since MPM grammar sizes can be exponentially small with respect to the original string sizes, it is a rather hard task to solve the pattern matching problem in time proportional only to the grammar sizes.…”
Section: Conclusion and Further Discussionmentioning
confidence: 99%
“…Since computing a minimal SLP that generates a given string is known to be NP-complete, it is of great significance to develop approximative algorithms for generating small grammars [19,2]. One of those algorithms is the multilevel pattern matching code (MPM code) introduced by Kieffer et al [11]. MPM code is attractive in that it performs in linear time with respect to the input string size, and is capable of exponential compression -the generated grammar size can be exponentially small with respect to the input string size.…”
Section: Introductionmentioning
confidence: 99%
“…Following the idea of XMill, AXECHOP divides the source XML document into structural and data parts. However, for the structure part, AXECHOP uses a byte tokenization scheme that preserves the original structure of the document and then uses the MPM compression algorithm [33] to generate a context-free grammar which is represents this structure information. The generated grammar is then passed through an adaptive arithmetic coder before being written to the compressed file.…”
Section: Schema Independent Compression Schemesmentioning
confidence: 99%