2003
DOI: 10.1109/tkde.2003.1198389
|View full text |Cite
|
Sign up to set email alerts
|

One-pass wavelet decompositions of data streams

Abstract: We present techniques for computing small space representations of massive data streams. These are inspired by traditional wavelet-based approximations that consist of specific linear projections of the underlying data. We present general "sketch"based methods for capturing various linear projections and use them to provide pointwise and rangesum estimation of data streams. These methods use small amounts of space and per-item time while streaming through the data and provide accurate representation as our exp… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
97
0
4

Year Published

2003
2003
2017
2017

Publication Types

Select...
5
3
2

Relationship

0
10

Authors

Journals

citations
Cited by 110 publications
(102 citation statements)
references
References 37 publications
0
97
0
4
Order By: Relevance
“…Wavelets have been used extensively for approximate answering of different query types and/or in different environments: multidimensional aggregate queries (range-sum queries) in OLAP environments [75,76], aggregate and non-aggregate relational queries with computations directly on the stored wavelet coefficients [14], and selection and aggregate queries over streams [28]. As with histograms, there have also been efforts to devise wavelet-based techniques whose approximate query answers are provided with error guarantees [24], as well as to construct and maintain the most important wavelet coefficients dynamically [57].…”
Section: Competitors Of Histogramsmentioning
confidence: 99%
“…Wavelets have been used extensively for approximate answering of different query types and/or in different environments: multidimensional aggregate queries (range-sum queries) in OLAP environments [75,76], aggregate and non-aggregate relational queries with computations directly on the stored wavelet coefficients [14], and selection and aggregate queries over streams [28]. As with histograms, there have also been efforts to devise wavelet-based techniques whose approximate query answers are provided with error guarantees [24], as well as to construct and maintain the most important wavelet coefficients dynamically [57].…”
Section: Competitors Of Histogramsmentioning
confidence: 99%
“…The online update of such structures in a dynamic scenario is also a required property. Sampling [39], hot lists [11,30], wavelets [9,18,24], sketches [10] and histograms [21][22][23] are examples of synopses methods to obtain fast and approximated answers.…”
Section: Data Summarizationmentioning
confidence: 99%
“…The difficulty is that it does not seem easy to compose with the 2-wise independent functions f and g. An example 4-wise independent function would be s(x) = x T Ax+Bx+c for certain random matrices A, B, and c from a special family of second-order Reed-Muller codes [26]. We do not know how to quickly count the number of solutions x to the equation s(x) = d for general matrices A, B and vector c, while we can count such x for heavilystructured A, B, and c coming from a special family based on Reed-Muller codes.…”
Section: Other Approachesmentioning
confidence: 99%