The Communication Complexity of Set Intersection and Multiple Equality Testing

Huang, Dawei; Pettie, Seth; Zhang, Yixiang; Zhang, Zhijun

doi:10.1137/1.9781611975994.105

Cited by 11 publications

(6 citation statements)

References 33 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…An additional algorithm for deterministic triangle listing is given by Huang, Pettie, Zhang, and Zhang [HPZZ20]. The complexity of this algorithm, given in terms of the maximum degree ∆, is O(∆/ log n + log log ∆) rounds, w.h.p.…”

Section: Triangle Finding In the Congest Modelmentioning

confidence: 99%

See 1 more Smart Citation

Distributed Subgraph Finding: Progress and Challenges

Censor-Hillel¹

2022

Preprint

View full text Add to dashboard Cite

This is a survey of the exciting recent progress made in understanding the complexity of distributed subgraph finding problems. It overviews the results and techniques for assorted variants of subgraph finding problems in various models of distributed computing, and states intriguing open questions. This version contains some updates over the ICALP 2021 version, and I will try to keep updating it as additional progress is made.However, it is possible to do better, as we overview in this section. Triangle Finding in the CLIQUE ModelWe begin with the CLIQUE model.Triangle listing in the CLIQUE model. The first non-trivial algorithm for triangle finding is due to Dolev, Lenzen, and Peled [DLP12]. This is a deterministic triangle listing algorithm for the CLIQUE model, which has a complexity of O(n 1/3 / log n) rounds. The simplicity of this algorithm turned out to be a huge advantage for later additional results, as we will see. The algorithm works as follows: The vertices of the graph are partitioned into n 1/3 subsets S 1 , . . . , S n 1/3 , each of n 2/3 nodes. Each of the n nodes receives a different tuple of three of these subsets. A node that receives S i 1 , S i 2 , S i 3 for indices 1 ≤ i 1 , i 2 , i 3 ≤ n 1/3 (that are not necessarily different) collects all edges with one endpoint in one of the three subsets and one endpoint in another, that is, this node collects all edges in E(S i 1 , S i 2 ) ∪ E(S i 1 , S i 3 ) ∪ E(S i 2 , S i 3 ), and reports all triangles that it finds. It is straightforward to see that all triangles are listed by this algorithm since the number of 3-tuples of subsets is n and so each is handled by some node.The round complexity of the algorithm follows by proving that each node needs to send and receive O(n 4/3 ) edges in total, which are to and from locations that are known to all nodes (we will discuss this knowledge property later), since the partition to subsets is hardcoded and so it is known to all nodes. Sending: Take a node v and assume that it is in the subset S i . There can be at most n 2/3 edges between v and nodes in S j and these edges need to be sent to all nodes that have S i and S j in their 3-tuple. Since there are n 1/3 such 3-tuples, these n 2/3 edges need to be sent to n 1/3 nodes. Repeating this for all n 1/3 possibilities for j gives a total of n 2/3+1/3+1/3 = n 4/3 edges that v has to send. Receiving: Each node needs to learn 3 subsets of edges, each containing at most n 2/3 • n 2/3 = n 4/3 edges. To conclude the complexity analysis, one can use the simple claim that [DLP12] proves, which states that a routing task in which each node needs to send and receive n messages in a known pattern can be done in 2 rounds. This means that the O(n 4/3 ) sent and received messages per node are divided by n, yielding a complexity of O(n 1/3 ) rounds. Noticing that the partition and routing are fixed, one can refrain from sending actual edge identifiers and replace them with a bit mask, which saves a logarithmic factor and results in a complexity of O(n 1/3 / log n) rounds....

show abstract

Section: Triangle Finding In the Congest Modelmentioning

confidence: 99%

“…Since the complexities of both [CS20] and [HPZZ20] do not reach yet the lower bound of Ω(n 1/3 ) rounds, we note the following open question.…”

Section: Triangle Finding In the Congest Modelmentioning

confidence: 99%

Distributed Subgraph Finding: Progress and Challenges

Censor-Hillel¹

2022

Preprint

View full text Add to dashboard Cite

show abstract

“…Subsequently, [CPSZ21] showed a breakthrough by listing in the optimal complexity of ˜ ( 1/3 ) rounds, w.h.p., using an expander decomposition together with the routing techniques of [GKS17,GL18]. [HPZZ21] show an algorithm which takes (Δ/log + log log Δ) rounds, where Δ is the maximal degree in the graph. following: Subsequently, [CPZ19] showed a breakthrough by listing in ˜ ( 1/2 ) rounds, w.h.p., using an expander decomposition, which was later improved to get the optimal, yet randomized, ˜ ( 1/3 )-round algorithm of [CS19].…”

Section: Simulating Partial-pass Streaming Algorithms In Cmentioning

confidence: 99%

Deterministic Near-Optimal Distributed Listing of Cliques

Censor-Hillel,

Leitersdorf,

Vulakh

2022

Preprint

View full text Add to dashboard Cite

The importance of classifying connections in large graphs has been the motivation for a rich line of work on distributed subgraph finding that has led to exciting recent breakthroughs. A crucial aspect that remained open was whether deterministic algorithms can be as efficient as their randomized counterparts, where the latter are known to be tight up to polylogarithmic factors.We give deterministic distributed algorithms for listing cliques of size in 1−2/ + (1) rounds in the C model. For triangles, our 1/3+ (1) round complexity improves upon the previous state of the art of 2/3+ (1) rounds [Chang and Saranurak, FOCS 2020].For cliques of size ≥ 4, ours are the first non-trivial deterministic distributed algorithms. Given known lower bounds, for all values ≥ 3 our algorithms are tight up to a (1) subpolynomial factor, which comes from the deterministic routing procedure we use.

show abstract

“…Subsequently, [13] showed a breakthrough by listing in the optimal complexity of Õ (𝑛 1/3 ) rounds, w.h.p., using an expander decomposition together with the routing techniques of [25,26]. [29] show an 𝑂 (Δ/log 𝑛 + log log Δ)-round algorithm, where Δ is the maximal degree in the graph. The first non-trivial deterministic algorithm was given by [16], taking 𝑂 (𝑛 0.58 ) rounds for detection and 𝑛 2/3+𝑜 (1) for listing.…”

Section: Further Related Workmentioning

confidence: 99%

Deterministic Near-Optimal Distributed Listing of Cliques

Censor-Hillel

Leitersdorf

Vulakh

2022

Proceedings of the 2022 ACM Symposium on Principles of Distributed Computing

View full text Add to dashboard Cite

The importance of classifying connections in large graphs has been the motivation for a rich line of work on distributed subgraph finding that has led to exciting recent breakthroughs. A crucial aspect that remained open was whether deterministic algorithms can be as efficient as their randomized counterparts, where the latter are known to be tight up to polylogarithmic factors.We give deterministic distributed algorithms for listing cliques of size 𝑝 in 𝑛 1−2/𝑝+𝑜 (1) rounds in the Congest model. For triangles, our 𝑛 1/3+𝑜 (1) round complexity improves upon the previous state of the art of 𝑛 2/3+𝑜 (1) rounds [Chang and Saranurak, FOCS 2020]. For cliques of size 𝑝 ≥ 4, ours are the first non-trivial deterministic distributed algorithms. Given known lower bounds, for all values 𝑝 ≥ 3 our algorithms are tight up to a 𝑛 𝑜 (1) subpolynomial factor, which comes from the deterministic routing procedure we use. CCS CONCEPTS• Theory of computation → Distributed algorithms.

show abstract

The Communication Complexity of Set Intersection and Multiple Equality Testing

Cited by 11 publications

References 33 publications

Distributed Subgraph Finding: Progress and Challenges

Distributed Subgraph Finding: Progress and Challenges

Deterministic Near-Optimal Distributed Listing of Cliques

Deterministic Near-Optimal Distributed Listing of Cliques

Contact Info

Product

Resources

About