2007
DOI: 10.1002/asi.20496
|View full text |Cite
|
Sign up to set email alerts
|

Lempel‐Ziv compression of highly structured documents

Abstract: The authors describe Lempel-Ziv to Compress Structure (LZCS), a novel Lempel-Ziv approach suitable for compressing structured documents. LZCS takes advantage of repeated substructures that may appear in the documents, by replacing them with a backward reference to their previous occurrence. The result of the LZCS transformation is still a valid structured document, which is human-readable and can be transmitted by ASCII channels. Moreover, LZCS transformed documents are easy to search, display, access at rando… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0

Year Published

2007
2007
2015
2015

Publication Types

Select...
4
1

Relationship

1
4

Authors

Journals

citations
Cited by 8 publications
(6 citation statements)
references
References 29 publications
0
6
0
Order By: Relevance
“…The operations Access, Depth, Height, Size, Parent, Firstchild, and NextSibling all take a node v in T as input 1 and return its label, its depth, its height, the size of its subtree, its parent, its first child, and its sibling to the right, respectively. The LevelAncestor returns an ancestor at a specified distance from v, and NCA returns the nearest common ancestor to a given pair of nodes.…”
Section: Our Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…The operations Access, Depth, Height, Size, Parent, Firstchild, and NextSibling all take a node v in T as input 1 and return its label, its depth, its height, the size of its subtree, its parent, its first child, and its sibling to the right, respectively. The LevelAncestor returns an ancestor at a specified distance from v, and NCA returns the nearest common ancestor to a given pair of nodes.…”
Section: Our Resultsmentioning
confidence: 99%
“…Its size can be exponentially smaller than n. DAG representation of trees are broadly used for identifying and sharing common subexpressions, e.g., in programming languages [25] and binary decision diagrams [24]. Compression based on DAGs has also been studied more recently in [7,14,22] and a Lempel-Ziv analog of subtree repeats was suggested in [1]. It is possible to support navigational queries [6] and path queries [7] directly on the DAG representation in logarithmic time.…”
Section: Previous Workmentioning
confidence: 99%
See 1 more Smart Citation
“…The research [8] presents a new lossless image compression algorithm based on a linear prediction, modified Golomb-Rice code family, and a very fast prediction error modeling method for large images, for natural images of high bit depths, and for noisy images. Lempel-Ziv to Compress Structure [9] focuses on the compression of structured text such as highly structured data including XML forms, invoices, e-commerce, and Web-service exchange documents. The research in [10] describes the operating principle of content-addressable memory as a compression accelerator and the control of the Lempel-Ziv data compression circuit as an application for an explanation of the hardware.…”
Section: Other Compression Methodsmentioning
confidence: 99%
“…LZCS (Adiego et al, 2004;Adiego et al, 2006) This compressor uses an idea similar to LZ77, restricted to replacing whole subtrees. This permits the compressed text being accessed, searched, and navigated without decompressing it.…”
Section: Structured Text Compressionmentioning
confidence: 99%