Optimality and scalability in lattice histogram construction

Karras, Panagiotis

doi:10.14778/1687627.1687703

Cited by 6 publications

(4 citation statements)

References 42 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For instance, c-Tree could be used for clustering [69] and classification tasks, with (hopefully) no appreciable difference in the quality of classification. It would also be interesting to investigate if our bit-saving representation could be applied to other synopses using a hierarchical structure, like the very recent Lattice Histogram [53]. Finally, the high capability of c-Tree to work properly in small space suggests us to study the effectiveness of its hardware implementation, which could be a profitable solution in the field of sensor mining.…”

Section: Discussionmentioning

confidence: 99%

Approximating sliding windows by cyclic tree-like histograms for efficient range queries

Buccafurri

Lax

2010

Data & Knowledge Engineering

View full text Add to dashboard Cite

Section: Discussionmentioning

confidence: 99%

Approximating sliding windows by cyclic tree-like histograms for efficient range queries

Buccafurri

Lax

2010

Data & Knowledge Engineering

View full text Add to dashboard Cite

“…A simplified variant of the Haar + tree, called the compact hierarchical histogram (CHH), is introduced in [17,26]. The concept of winning intervals used in our algorithms was previously investigated in [14,17].…”

Section: Related Workmentioning

confidence: 99%

“…For a set S of synopses with the same size, we define the winning interval of a synopsis s in S as the incoming value interval such that s has the smallest error among all synopses in S. Note that the concept of winning intervals was previously investigated for a hierarchical structure, such as CHH or LH, in [14,17]. Meanwhile, we compute the winning interval of each synopsis in the synopses constructed in each node of a coe cient tree.…”

Section: A Required Extended Synopsis Setmentioning

confidence: 99%

Efficient Haar ⁺ synopsis construction for the maximum absolute error measure

2017

View full text Add to dashboard Cite

Several wavelet synopsis construction algorithms were previously proposed based on dynamic programming for unrestricted Haar wavelet synopses as well as Haar + synopses. However, they find an optimal synopsis for every incoming value in each node of a coe cient tree, even if di↵erent incoming values share an identical optimal synopsis. To alleviate the limitation, we present novel algorithms, which keep only a minimal set of the distinct optimal synopses in each node of the tree, for the error-bounded synopsis problem. Furthermore, we propose the methods to restrict coe cient values to be considered to compute the optimal synopses in each node. In addition, by partitioning all optimal synopses in each node into a set of groups, such that every group can be represented by a compact representation, we significantly improve the performance of the proposed algorithms.

show abstract

“…An algorithm that aims to minimize L2 in practice works on the sum-of-squared-errors (SSE) i |di − di| 2 . Previous studies [20,5,23,9,36,24,25,26,21,22] have generalized their results into wider classes of maximum, distributive, Minkowskidistance, and relative-error metrics. Still, the Euclidean error L2 remains an important error metric for several applications, such as database query optimization [16], context recognition [12], and time series mining [4].…”

Section: Background and Related Workmentioning

confidence: 99%

Fast and effective histogram construction

Halim

Karras

Yap

2009

Proceedings of the 18th ACM Conference on Information and Knowledge Management

Self Cite

View full text Add to dashboard Cite

Histogram construction or sequence segmentation is a basic task with applications in database systems, information retrieval, and knowledge management. Its aim is to approximate a sequence by line segments. Unfortunately, the quadratic algorithm that derives an optimal histogram for Euclidean error lacks the desired scalability. Therefore, sophisticated approximation algorithms have been recently proposed, while several simple heuristics are used in practice. Still, these solutions fail to resolve the efficiency-quality tradeoff in a satisfactory manner. In this paper we take a fresh view on the problem. We propose conceptually clear and scalable algorithms that efficiently derive high-quality histograms. We experimentally demonstrate that existing approximation schemes fail to deliver the desired efficiency and conventional heuristics do not fare well on the side of quality. On the other hand, our schemes match or exceed the quality of the former and the efficiency of the latter.

show abstract

Optimality and scalability in lattice histogram construction

Cited by 6 publications

References 42 publications

Approximating sliding windows by cyclic tree-like histograms for efficient range queries

Approximating sliding windows by cyclic tree-like histograms for efficient range queries

Efficient Haar ⁺ synopsis construction for the maximum absolute error measure

Fast and effective histogram construction

Contact Info

Product

Resources

About

Optimality and scalability in lattice histogram construction

Cited by 6 publications

References 42 publications

Approximating sliding windows by cyclic tree-like histograms for efficient range queries

Approximating sliding windows by cyclic tree-like histograms for efficient range queries

Efficient Haar + synopsis construction for the maximum absolute error measure

Fast and effective histogram construction

Contact Info

Product

Resources

About

Efficient Haar ⁺ synopsis construction for the maximum absolute error measure