2007
DOI: 10.1007/s00446-007-0048-7
|View full text |Cite
|
Sign up to set email alerts
|

Sketching asynchronous data streams over sliding windows

Abstract: We study the problem of maintaining a sketch of recent elements of a data stream. Motivated by applications involving network data, we consider streams that are asynchronous, in which the observed order of data is not the same as the time order in which the data was generated. The notion of recent elements of a stream is modeled by the sliding timestamp window, which is the set of elements with timestamps that are close to the current time. We design algorithms for maintaining sketches of all elements within t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
43
0

Year Published

2009
2009
2019
2019

Publication Types

Select...
3
3
1

Relationship

5
2

Authors

Journals

citations
Cited by 25 publications
(43 citation statements)
references
References 22 publications
(38 reference statements)
0
43
0
Order By: Relevance
“…Stream sampling has a long history of research, starting from the popular reservoir sampling algorithm, attributed to Waterman (see Algorithm R from [13]) that has been known since the 1960s. Follow-up work includes speeding up reservoir sampling [13], weighted reservoir sampling [14], sampling over a sliding window and stream evolution [15], [16], [17], [18], [19]. Stream sampling has been used extensively in large scale data mining applications, see for example [20], [21], [22], [23].…”
Section: Related Workmentioning
confidence: 99%
“…Stream sampling has a long history of research, starting from the popular reservoir sampling algorithm, attributed to Waterman (see Algorithm R from [13]) that has been known since the 1960s. Follow-up work includes speeding up reservoir sampling [13], weighted reservoir sampling [14], sampling over a sliding window and stream evolution [15], [16], [17], [18], [19]. Stream sampling has been used extensively in large scale data mining applications, see for example [20], [21], [22], [23].…”
Section: Related Workmentioning
confidence: 99%
“…To our knowledge, all previous work on computing time-decayed aggregates on streams, including [14,6,18,22,19,4,34,10,27,35], considered decomposable decay functions. For example, exponential decay, sliding window decay, and polynomial decay are all decomposable.…”
Section: Definition 13 a Decay Function F (W X) Is A Decomposable mentioning
confidence: 99%
“…This algorithm was generalized by Pavan and Tirthapura [32] to compute the duplicate insensitive sum as well as other aggregates such as max-dominance norm. Xu, Tirthapura, and Busch [34] proposed the concept of asynchronous streams and gave a randomized algorithm to approximate the sum and median over a sliding window. Here, we extend this line of work to handle both general decay and duplicate arrivals.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations