2007 IEEE 23rd International Conference on Data Engineering 2007
DOI: 10.1109/icde.2007.368961
|View full text |Cite
|
Sign up to set email alerts
|

Space Efficient Streaming Algorithms for the Maximum Error Histogram

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
54
0
2

Year Published

2008
2008
2019
2019

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 42 publications
(56 citation statements)
references
References 15 publications
0
54
0
2
Order By: Relevance
“…The algorithm they propose is based on using a fixed-length sliding window of data points. In [4], Buragohain et al also address the histogram construction problem. However they represent each bucket by a line segment rather than a single value.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…The algorithm they propose is based on using a fixed-length sliding window of data points. In [4], Buragohain et al also address the histogram construction problem. However they represent each bucket by a line segment rather than a single value.…”
Section: Related Workmentioning
confidence: 99%
“…We refer to the endpoints of the line segments as the recordings. If g (k-1) and g k are disconnected, k∈ [2,K] (3,5) = (0,0,5,0), and (9,9,9,9)-V 4 (3,5) = (9,9,4,9). Figure 1 shows a sample signal and a possible piece-wise linear approximation illustrating most of the notations described above.…”
Section: Problem Statement and Notationsmentioning
confidence: 99%
“…Thresholded approximation has been used in the context of histograms before, in the context of "dual" problems where the summary size is minimized to achieve a predetermined error [4,20,11]. Concretely, recall that the maximum error histogram construction problem is: given a set of numbers X = x 1 , x 2 , .…”
Section: The Setup: Requirementsmentioning
confidence: 99%
“…We use three main ideas: (i) we use the notion of a "thresholded approximation" where the goal is to minimize the error assuming we know the optimum error 1 , (ii) we run multiple copies (but controlled in number) of the algorithm corresponding to different estimates of the final error and, (iii) we use a "streamstrapping" procedure to use partially completed summarization for a certain estimate to create summarization for a different estimate of error. The first two ideas have been explicitly used in the context of summarization before, see [4,9,10,20,11] among many others. We are unaware of the use of the third idea in any previous work and we believe that this notion will be useful in a variety of different problems.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation