2007 IEEE 23rd International Conference on Data Engineering 2007
DOI: 10.1109/icde.2007.367889
|View full text |Cite
|
Sign up to set email alerts
|

The Haar+ Tree: A Refined Synopsis Data Structure

Abstract: We introduce the Haar + tree: a refined, wavelet-inspired data structure for synopsis construction. The advantages of this structure are twofold: First, it achieves higher synopsis quality at the task of summarizing data sets with sharp discontinuities than state-of-the-art histogram and Haar wavelet techniques. Second, thanks to its search space delimitation capacity, Haar + synopsis construction operates in time linear to the size of the data set for any monotonic distributive error metric. Through experimen… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
72
0

Year Published

2009
2009
2020
2020

Publication Types

Select...
6

Relationship

1
5

Authors

Journals

citations
Cited by 22 publications
(72 citation statements)
references
References 29 publications
(49 reference statements)
0
72
0
Order By: Relevance
“…The synopsis construction model based on the Haar + tree [26]. This model supersedes previous wavelet-based techniques [9,13].…”
mentioning
confidence: 91%
See 3 more Smart Citations
“…The synopsis construction model based on the Haar + tree [26]. This model supersedes previous wavelet-based techniques [9,13].…”
mentioning
confidence: 91%
“…For example, if we occupy node c4 in Figure 1, then it is allowed to occupy any of its descendant nodes, as well as nodes that either fully contain, or are disjoint from, range R4, i.e., nodes c0, c1, c2, c28 and c35. The approximation of a data value di represented by an LH is constructed as the value of the lowest occupied node affecting di, by means of an interval tree; hence, data reconstruction requires O(log B) time (as for other summarization techniques [32,16,26,24]). An optimal LH synopsis of D in space B should achieve the minimum error * achievable in B space for the employed error metric.…”
Section: The Lattice Histogrammentioning
confidence: 99%
See 2 more Smart Citations
“…The former is presented in [52] [52]. Following the same approach used in the experiments of [54], and coherently with the approximate nature of [45,48], we compare our method with [49] (since the chosen metrics is the relative error). This obviously works as (indirect) comparison with [45,48], which present the best solution of the trade-off between feasibility and closeness to the optimal accuracy.…”
Section: Related Workmentioning
confidence: 99%