When Distributed Computation Is Communication Expensive

Woodruff, David P.; Zhang, Qin

doi:10.1007/978-3-642-41527-2_2

Cited by 35 publications

(27 citation statements)

References 25 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Notice that in the Congested Clique model the input graph G is tightly coupled with the communication network N and the graph is distributed among the machines via a vertex partition. This is not the case in other related models for distributed graph processing, such as [1,30,19]. In these papers the input graph can be much larger than the machine network and the distribution of the graph among machines is via an edge partition.…”

Section: The Modelmentioning

confidence: 94%

“…In [19] this edge partition is assumed to be random (initially), in [30] the edge partition can be worst case, whereas in [1] the edge partition is worst case, but with the requirement that each processor has the same number of edges. It is worth noting that [30] does prove message complexity lower bound for problems such as GC, but these lower bounds make crucial use of the worst case distribution of edges and do not apply in our model. Similarly, the lower bounds in the setting of [1] do not seem to directly apply in the Congested Clique model.…”

Section: The Modelmentioning

confidence: 99%

See 1 more Smart Citation

Toward Optimal Bounds in the Congested Clique

Hegeman

Pandurangan

Pemmaraju

et al. 2015

Proceedings of the 2015 ACM Symposium on Principles of Distributed Computing

View full text Add to dashboard Cite

We study two fundamental graph problems, Graph Connectivity (GC) and Minimum Spanning Tree (MST), in the well-studied Congested Clique model, and present several new bounds on the time and message complexities of randomized algorithms for these problems. No non-trivial (i.e., super-constant) time lower bounds are known for either of the aforementioned problems; in particular, an important open question is whether or not constant-round algorithms exist for these problems. We make progress toward answering this question by presenting randomized Monte Carlo algorithms for both problems that run in O(log log log n) rounds (where n is the size of the clique). Our results improve by an exponential factor on the long-standing (deterministic) time bound of O(log log n) rounds for these problems due to Lotker et al. (SICOMP 2005). Our algorithms make use of several algorithmic tools including graph sketching, random sampling, and fast sorting.The second contribution of this paper is to present several almosttight bounds on the message complexity of these problems. Specifically, we show that Ω(n 2 ) messages are needed by any algorithm (including randomized Monte Carlo algorithms, and regardless of the number of rounds) that solves the GC (and hence also the MST) problem if each machine in the Congested Clique has initial knowledge only of itself (the so-called KT0 model). In contrast, if the machines have initial knowledge of their neighbors' IDs (the so-called KT1 model), we present a randomized Monte Carlo algorithm for MST that uses O(n polylog n) messages and runs in O(polylog n) rounds. To complement this, we also present a lower bound in the KT1 model that shows that Ω(n) messages are required by any al- * gorithm that solves GC, regardless of the number of rounds used. Our results are a step toward understanding the power of randomization in the Congested Clique with respect to both time and message complexity.

show abstract

Section: The Modelmentioning

confidence: 94%

Section: The Modelmentioning

confidence: 99%

Toward Optimal Bounds in the Congested Clique

Hegeman

Pandurangan

Pemmaraju

et al. 2015

Proceedings of the 2015 ACM Symposium on Principles of Distributed Computing

View full text Add to dashboard Cite

show abstract

“…In distributed computing, the total amount of communication is often the most relevant complexity measure. For example, Woodruff and Zhang [44] and Klauck et al [25] identify models and problems for which there is no algorithm that beats the communication benchmark of sending the entire input to a single machine. Because massively parallel systems are designed to send a potentially large amount of data in a single round, such communication lower bounds do not generally imply lower bounds for round complexity.…”

Section: Related Workmentioning

confidence: 99%

Shuffles and Circuits (On Lower Bounds for Modern Parallel Computation)

Roughgarden

Vassilvitskii

Wang

2018

J. ACM

View full text Add to dashboard Cite

The goal of this article is to identify fundamental limitations on how efficiently algorithms implemented on platforms such as MapReduce and Hadoop can compute the central problems in motivating application domains, such as graph connectivity problems. We introduce an abstract model of massively parallel computation, where essentially the only restrictions are that the "fan-in" of each machine is limited to s bits, where s is smaller than the input size n, and that computation proceeds in synchronized rounds, with no communication between different machines within a round. Lower bounds on the round complexity of a problem in this model apply to every computing platform that shares the most basic design principles of MapReduce-type systems. We prove that computations in our model that use few rounds can be represented as low-degree polynomials over the reals. This connection allows us to translate a lower bound on the (approximate) polynomial degree of a Boolean function to a lower bound on the round complexity of every (randomized) massively parallel computation of that function. These lower bounds apply even in the "unbounded width" version of our model, where the number of machines can be arbitrarily large. As one example of our general results, computing any nontrivial monotone graph property-such as connectivity-requires a super-constant number of rounds when every machine receives only a subpolynomial (in n) number of input bits s. Finally, we prove that, in two senses, our lower bounds are the best one could hope for. For the unbounded-width model, we prove a matching upper bound. Restricting to a polynomial number of machines, we show that asymptotically better lower bounds would separate P from NC 1 .

show abstract

“…The coordinator model has attracted a lot of attentions in recent years [1,25,41,45]. In the high level, it is similar to the congested clique model [16,35,36,40] and the k-machine model [32].…”

Section: Introductionmentioning

confidence: 99%

“…The k sites would like to jointly compute some statistical function f defined on S by treating items from the same group as the same item. For example, the distinct elements function is defined to be F0(S) = |G| = n. We always allow a (1 + )-approximation since for exact computation, in the worst case, there is often no better way than shipping all items to one site (for many statistical problems, this holds even in the noise-free case, see [45]). The precise meaning of the (1 + )-approximation depends on specific problems.…”

Section: Introductionmentioning

confidence: 99%

Communication-Efficient Computation on Distributed Noisy Datasets

Zhang

2015

Proceedings of the 27th ACM Symposium on Parallelism in Algorithms and Architectures

Self Cite

View full text Add to dashboard Cite

This paper gives a first attempt to answer the following general question: Given a set of machines connected by a point-to-point communication network, each having a noisy dataset, how can we perform communication-efficient statistical estimations on the union of these datasets? Here 'noisy' means that a real-world entity may appear in different forms in different datasets, but those variants should be considered as the same universe element when performing statistical estimations. We give a first set of communicationefficient solutions for statistical estimations on distributed noisy datasets, including algorithms for distinct elements, L0-sampling, heavy hitters, frequency moments and empirical entropy.

show abstract

When Distributed Computation Is Communication Expensive

Cited by 35 publications

References 25 publications

Toward Optimal Bounds in the Congested Clique

Toward Optimal Bounds in the Congested Clique

Shuffles and Circuits (On Lower Bounds for Modern Parallel Computation)

Communication-Efficient Computation on Distributed Noisy Datasets

Contact Info

Product

Resources

About