Proceedings 14th International Conference on Data Engineering
DOI: 10.1109/icde.1998.655800
|View full text |Cite
|
Sign up to set email alerts
|

Compressing relations and indexes

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
97
0

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 114 publications
(97 citation statements)
references
References 9 publications
0
97
0
Order By: Relevance
“…In general, the list-adaptive codecs demonstrated a good tradeoff in inverted index size and query response time. As our best practice recommendation, we found that the simpler Frame of Reference (FOR) [24] codec results in fast average response times for all types of posting list payload, slightly outperforming the more complex but state-of-the-art PFOR implementations. Finally, we note that the gains obtainable from the compression of the various information within a posting list are cumulative.…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…In general, the list-adaptive codecs demonstrated a good tradeoff in inverted index size and query response time. As our best practice recommendation, we found that the simpler Frame of Reference (FOR) [24] codec results in fast average response times for all types of posting list payload, slightly outperforming the more complex but state-of-the-art PFOR implementations. Finally, we note that the gains obtainable from the compression of the various information within a posting list are cumulative.…”
Section: Discussionmentioning
confidence: 99%
“…Indeed, the contributions of this paper are three-fold: Firstly, we separately analyse the compression benefits of document ids, term frequencies, field frequencies and positions, in terms of average query response time and index size; Secondly, we analyse how different term distributions caused by anchor text affect the achievable compression; Finally, we use these thorough experiments on two standard corpora, namely GOV2 and ClueWeb09 to derive best practices for index compression. The results of our study demonstrate that the contribution of compression in time-and space-efficiency varies greatly depending on the type of posting payload information, while leading us to suggest the use of the simpler Frame of reference (FOR) [24] list-adaptive algorithms for the best benefit to average query response times. This paper is structured as follows: Section 2 provides a background on efficient ranked retrieval; Section 3 summarises the compression codecs we analyse later within an IR system; Section 4 describes our experimental setup.…”
Section: Introductionmentioning
confidence: 97%
See 1 more Smart Citation
“…In the past years, compression techniques have focussed on CPU optimised compression algorithms [64,65,66,67]. It has been shown in [65,66,67] that the decompression performance depends on the complexity of the execution flow of the algorithm.…”
Section: High-performance Compression For Node Indexing Schemementioning
confidence: 99%
“…FOR determines the range of possible values in a block, called a frame, and maps each value into this range by storing just enough bits to distinguish between the values [64]. Given a frame [min, max], FOR needs log 2 (max − min + 1) bits, that we call bit frame in the rest of the paper, to encode each integer in a block.…”
Section: Frame Of Reference (For)mentioning
confidence: 99%