2013 IEEE 12th International Symposium on Network Computing and Applications 2013
DOI: 10.1109/nca.2013.11
|View full text |Cite
|
Sign up to set email alerts
|

Sketch ⋆-Metric: Comparing Data Streams via Sketching

Abstract: In this paper, we consider the problem of estimating the distance between any two large data streams in smallspace constraint. This problem is of utmost importance in data intensive monitoring applications where input streams are generated rapidly. These streams need to be processed on the fly and accurately to quickly determine any deviance from nominal behavior. We present a new metric, the Sketch ⋆-metric, which allows to define a distance between updatable summaries (or sketches) of large data streams. An … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2013
2013
2018
2018

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(4 citation statements)
references
References 30 publications
0
4
0
Order By: Relevance
“…Papapetrou et al present a novel sketching technique ECM-sketch that allows effective summarization of distributed data streams over sliding windows with probabilistic accuracy guarantees [16]. For measuring the distance between updatable sketches, Anceaume et al present a novel metric Sketch *-metric that reflects the relationships between any two discrete probability distributions in the massive data streams [3].…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations
“…Papapetrou et al present a novel sketching technique ECM-sketch that allows effective summarization of distributed data streams over sliding windows with probabilistic accuracy guarantees [16]. For measuring the distance between updatable sketches, Anceaume et al present a novel metric Sketch *-metric that reflects the relationships between any two discrete probability distributions in the massive data streams [3].…”
Section: Related Workmentioning
confidence: 99%
“…Since the data of objects are stored in the EU-sketches, the computation of similarity should be designed based on the EU-sketch. We combine the Sketch *-metric [3] and the KL divergence [2] to quantify the similarity between two uncertain objects.…”
Section: B Divergence-based Similaritymentioning
confidence: 99%
See 2 more Smart Citations