2016
DOI: 10.1016/j.neucom.2016.01.072
|View full text |Cite
|
Sign up to set email alerts
|

CVS: Fast cardinality estimation for large-scale data streams over sliding windows

Abstract: Estimating the cardinality of data streams over a sliding window is an important problem in many applications, such as network traffic monitoring, web access log analysis and database. The problem becomes more difficult in largescale data streams when time and space complexity is taken into account. In this paper, we present a novel randomized data structure to address the problem. The significant contributions are as follows. (1) A space-efficient counter vector sketch (CVS) are proposed, which extends the we… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
4
0

Year Published

2017
2017
2021
2021

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 11 publications
(4 citation statements)
references
References 30 publications
(51 reference statements)
0
4
0
Order By: Relevance
“…In order to improve the speed, Jingsong et al proposed CVS [22] algorithm and LRU-Sketch [23] algorithm. CVS adopts the strategy of random updating, and randomly selects a part of the counter to update each time.…”
Section: Sliding Time Window and Distinct Time Windowmentioning
confidence: 99%
“…In order to improve the speed, Jingsong et al proposed CVS [22] algorithm and LRU-Sketch [23] algorithm. CVS adopts the strategy of random updating, and randomly selects a part of the counter to update each time.…”
Section: Sliding Time Window and Distinct Time Windowmentioning
confidence: 99%
“…Therefore, we use the sliding time window to cope with these problems by adopting fine-grained moving steps [31]: the size of steps is sufficiently small so that the process can be viewed as a continuously sliding time window as suggested by its name. But detecting super points under the sliding time window is more difficult considering the intense observing frequency [32]. What's more, to work under the sliding time window, an algorithm must preserve hosts' state of a previous period [33].…”
Section: A Discrete Time Window and Sliding Time Windowmentioning
confidence: 99%
“…Let H (x, n, A) represent a hash function that can randomly map an integer x to a number between 0 and n − 1 according to random seed parameter A [19], where n is an positive integer. For each bip in ST (t, 1), RE 0 maps ''bip'' randomly to an integer between 0 and 2 32 − 1 using the hash function H (bip, 2 32 , A 0 ) with random seed parameter A 0 . The hash function H (bip, g, A 1 ) with random seed parameter A 1 is used to map ''bip'' to an integer of 32 , A 0 ))).…”
Section: B Rough Estimatormentioning
confidence: 99%
See 1 more Smart Citation