2022
DOI: 10.1109/tdsc.2021.3111328
|View full text |Cite
|
Sign up to set email alerts
|

ExtendedSketch: Fusing Network Traffic for Super Host Identification With a Memory Efficient Sketch

Abstract: Super host refers to the host that has a high cardinality or exhibits a big change in a network. Facing big-volume network traffic, sketches have been widely applied to identify super hosts in an efficient and accurate way. However, most sketches cannot flexibly balance memory usage and accuracy in host cardinality estimation. Setting an inappropriate counter size for a sketch could either lead to inaccurate host cardinality estimation or cause memory waste. In order to solve this issue, we propose a novel ext… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
1
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6

Relationship

1
5

Authors

Journals

citations
Cited by 13 publications
(5 citation statements)
references
References 39 publications
0
1
0
Order By: Relevance
“…This is consistent with the theoretical analysis in Eq. (20). We find that vHLL and AROMA+ are also unbiased, but rSkt1 has −5% bias due to hash collision.…”
Section: Flow Spread Estimation Errormentioning
confidence: 73%
See 3 more Smart Citations
“…This is consistent with the theoretical analysis in Eq. (20). We find that vHLL and AROMA+ are also unbiased, but rSkt1 has −5% bias due to hash collision.…”
Section: Flow Spread Estimation Errormentioning
confidence: 73%
“…Perhaps due to its higher memory cost, per-flow spread estimation problem has a relatively smaller number of existing works than per-flow size estimation, such as virtual Bitmap (vBitmap) [4], virtual HyperLogLog (vHLL) [5], WavingSketch [19], ExtendedSketch [20], Self-Morphing Bitmaps [21] and randomized error-reduction sketch (rSkt) [14]. As mentioned before, vBitmap and vHLL have high query time cost, preventing them from online query.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Thus, how to provide services with a large amount of data in a limited space is challenging. To tackle the challenge, effective data compression methods (e.g., bloom filters [47], cuckoo filters [48] and sketch [49,50]) become crucially important. We design a cache query structure based on a new probabilistic data structure to efficiently perform presence detection and feature query, thereby supporting big data processing.…”
Section: Enclave Cache Query Modulementioning
confidence: 99%