2009
DOI: 10.14778/1687627.1687703
|View full text |Cite
|
Sign up to set email alerts
|

Optimality and scalability in lattice histogram construction

Abstract: The Lattice Histogram is a recently proposed data summarization technique that achieves approximation quality preferable to that of an optimal plain histogram. Like other hierarchical synopsis methods, a lattice histogram (LH) aims to approximate data using a hierarchical structure. Still, this structure is not defined a priori; it consists an unknown, not a given, of the problem. Past work has defined the properties that an LH needs to obey and developed general-purpose approximation algorithms for the constr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2009
2009
2017
2017

Publication Types

Select...
3
2

Relationship

1
4

Authors

Journals

citations
Cited by 6 publications
(4 citation statements)
references
References 42 publications
0
4
0
Order By: Relevance
“…For instance, c-Tree could be used for clustering [69] and classification tasks, with (hopefully) no appreciable difference in the quality of classification. It would also be interesting to investigate if our bit-saving representation could be applied to other synopses using a hierarchical structure, like the very recent Lattice Histogram [53]. Finally, the high capability of c-Tree to work properly in small space suggests us to study the effectiveness of its hardware implementation, which could be a profitable solution in the field of sensor mining.…”
Section: Discussionmentioning
confidence: 99%
“…For instance, c-Tree could be used for clustering [69] and classification tasks, with (hopefully) no appreciable difference in the quality of classification. It would also be interesting to investigate if our bit-saving representation could be applied to other synopses using a hierarchical structure, like the very recent Lattice Histogram [53]. Finally, the high capability of c-Tree to work properly in small space suggests us to study the effectiveness of its hardware implementation, which could be a profitable solution in the field of sensor mining.…”
Section: Discussionmentioning
confidence: 99%
“…A simplified variant of the Haar + tree, called the compact hierarchical histogram (CHH), is introduced in [17,26]. The concept of winning intervals used in our algorithms was previously investigated in [14,17].…”
Section: Related Workmentioning
confidence: 99%
“…For a set S of synopses with the same size, we define the winning interval of a synopsis s in S as the incoming value interval such that s has the smallest error among all synopses in S. Note that the concept of winning intervals was previously investigated for a hierarchical structure, such as CHH or LH, in [14,17]. Meanwhile, we compute the winning interval of each synopsis in the synopses constructed in each node of a coe cient tree.…”
Section: A Required Extended Synopsis Setmentioning
confidence: 99%
“…An algorithm that aims to minimize L2 in practice works on the sum-of-squared-errors (SSE) i |di − di| 2 . Previous studies [20,5,23,9,36,24,25,26,21,22] have generalized their results into wider classes of maximum, distributive, Minkowskidistance, and relative-error metrics. Still, the Euclidean error L2 remains an important error metric for several applications, such as database query optimization [16], context recognition [12], and time series mining [4].…”
Section: Background and Related Workmentioning
confidence: 99%