2018
DOI: 10.1145/3264427
|View full text |Cite
|
Sign up to set email alerts
|

An Optimal Algorithm for ℓ 1 -Heavy Hitters in Insertion Streams and Related Problems

Abstract: We give the first optimal bounds for returning the ℓ1-heavy hitters in a data stream of insertions, together with their approximate frequencies, closing a long line of work on this problem. For a stream of m items in {1, 2, . . . , n} and parameters 0 < ε < ϕ 1, let fi denote the frequency of item i, i.e., the number of times item i occurs in the stream. With arbitrarily large constant probability, our algorithm returns all items i for which fi ϕm, returns no items j for which fj (ϕ − ε)m, and returns approxim… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
11
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 10 publications
(11 citation statements)
references
References 61 publications
0
11
0
Order By: Relevance
“…Trivially, one can treat an update (i, ∆) as ∆ updates of the form (i, 1), but this does not maintain the time bound (even in an amortized sense) if ∆ = ω(1) on average. We now sketch an adaptation that has the same accuracy guarantees as the simple algorithm of Bhattacharyya et al [3], and runs in constant amortized time per stream update with high probability, assuming that the number of stream updates n is Ω(ε −2 log(1/δ)).…”
Section: Other Prior Workmentioning
confidence: 99%
See 3 more Smart Citations
“…Trivially, one can treat an update (i, ∆) as ∆ updates of the form (i, 1), but this does not maintain the time bound (even in an amortized sense) if ∆ = ω(1) on average. We now sketch an adaptation that has the same accuracy guarantees as the simple algorithm of Bhattacharyya et al [3], and runs in constant amortized time per stream update with high probability, assuming that the number of stream updates n is Ω(ε −2 log(1/δ)).…”
Section: Other Prior Workmentioning
confidence: 99%
“…If t samples are required before the sum of the variables is more than ∆, then the algorithm feeds update (i, t) into an instance of any counter-based algorithm capable handling weighted updates. The remainder of the details, and the error analysis, are similar to [3], and we omit them for brevity.…”
Section: Other Prior Workmentioning
confidence: 99%
See 2 more Smart Citations
“…For example, Jayaram and Woodruff showed that for p ∈ (0, 1] an approximate counter can be used effectively as a subroutine in an algorithm for approximating the pth moment of an insertion-only stream up to 1 + ε in Õ(1/ε 2 + log n) bits of space [JW19], improving over a derandomization of an algorithm of Indyk that uses O(ε −2 log n) bits [Ind06,KNW10]. Approximate counting also finds use in approximating large frequency moments [AMS99, GS09], approximate reservoir sampling [GS09], approximating the number of inversion when streaming over a permutation [AJKS02], and 1 heavy hitters in insertion-only streams [BDW19].…”
Section: Introductionmentioning
confidence: 99%