Efficient sampling of non-strict turnstile data streams

Barkay, Neta; Porat, Ely; Shalem, Bar

doi:10.1016/j.tcs.2015.01.026

Cited by 9 publications

(14 citation statements)

References 29 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Constructions of k-sample recovery mechanisms are known which require spaceÕ(k) and fail only with probability polynomially small in n [5]. We apply this algorithm to the neighborhood of vertices: for each node v, we can maintain an instance of the k-sample recovery sketch (or algorithm) to the vector corresponding to the row of the adjacency matrix for v. Note that as edges are inserted or deleted, we can propagate these to the appropriate k-sample recovery algorithms, without needing knowledge of the full neighborhood of nodes.…”

Section: Preliminariesmentioning

confidence: 99%

Parameterized Streaming: Maximal Matching and Vertex Cover

Chitnis¹,

Cormode²,

Hajiaghayi³

et al. 2014

Proceedings of the Twenty-Sixth Annual ACM-SIAM Symposium on Discrete Algorithms

View full text Add to dashboard Cite

As graphs continue to grow in size, we seek ways to effectively process such data at scale. The model of streaming graph processing, in which a compact summary is maintained as each edge insertion/deletion is observed, is an attractive one. However, few results are known for optimization problems over such dynamic graph streams.In this paper, we introduce a new approach to handling graph streams, by instead seeking solutions for the parameterized versions of these problems. Here, we are given a parameter k and the objective is to decide whether there is a solution bounded by k. By combining kernelization techniques with randomized sketch structures, we obtain the first streaming algorithms for the parameterized versions of Maximal Matching and Vertex Cover. We consider various models for a graph stream on n nodes: the insertion-only model where the edges can only be added, and the dynamic model where edges can be both inserted and deleted. More formally, we show the following results:• In the insertion only model, there is a one-pass deterministic algorithm for the parameterized Vertex Cover problem which computes a sketch using

show abstract

Section: Preliminariesmentioning

confidence: 99%

Parameterized Streaming: Maximal Matching and Vertex Cover

Chitnis¹,

Cormode²,

Hajiaghayi³

et al. 2014

Proceedings of the Twenty-Sixth Annual ACM-SIAM Symposium on Discrete Algorithms

View full text Add to dashboard Cite

show abstract

“…Some increase in space usage seems inevitable as we have to store § This approach of sampling and looking for duplicates may be folklore; it was described to the authors by T. S. Jayram each individual sample, however for better update times there are alternative solutions with faster processing times than the naive solution. In this direction, Barkay et al [9] have shown an L 0 sampler with O(log s δ ) update time and O(s log s δ ) sample extraction time, at the expense of relaxing the independence requirement of the samples. The extracted samples are guaranteed to be O(log 1 δ )-wise independent which is sufficient for most applications.…”

Section: Sampling Multiple Itemsmentioning

confidence: 99%

L _p Samplers and Their Applications

Cormode

Jowhari

2019

ACM Comput. Surv.

View full text Add to dashboard Cite

The notion of L p sampling, and corresponding algorithms known as L p samplers, has found a wide range of applications in the design of data stream algorithms and beyond. In this survey, we present some of the core algorithms to achieve this sampling distribution based on ideas from hashing, sampling, and sketching. We give results for the special cases of insertion-only inputs, lower bounds for the sampling problems, and ways to efficiently sample multiple elements. We describe a range of applications of L p sampling, drawing on problems across the domain of computer science, from matrix and graph computations, as well as to geometric and vector streaming problems.

show abstract

“…Such algorithms have been designed to operate deterministically, and require O(k polylog n) space [1].…”

Section: Maximal Matchingmentioning

confidence: 99%

“…Randomized constructions of k-sample algorithms are known (which use k-sparse recovery algorithms within them), and require O(k polylog n) space [1].…”

Section: Maximal Matchingmentioning

confidence: 99%