Proceedings of the 37th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems 2018
DOI: 10.1145/3196959.3196964
|View full text |Cite
|
Sign up to set email alerts
|

Distributed Statistical Estimation of Matrix Products with Applications

Abstract: We consider statistical estimations of a matrix product over the integers in a distributed setting, where we have two parties Alice and Bob; Alice holds a matrix A and Bob holds a matrix B, and they want to estimate statistics of A · B. We focus on the well-studied ℓ p -norm, distinct elements (p = 0), ℓ 0 -sampling, and heavy hitter problems. The goal is to minimize both the communication cost and the number of rounds of communication.This problem is closely related to the fundamental set-intersection join pr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
3
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
3
2

Relationship

2
3

Authors

Journals

citations
Cited by 7 publications
(5 citation statements)
references
References 42 publications
0
3
0
Order By: Relevance
“…Namely, nearly all lower bounds for the space complexity of randomized streaming algorithms are derived via reductions from communication problems. For an incomplete list of such reductions, see [58,61,45,42,46,10,16,57,48,49,40] and the references therein. Now nearly all such lower bounds (and all of the ones that were just cited) hold in either the 2-party setting (G has 2 vertices), the coordinator model, or the black-board model.…”
Section: Multi-party Communicationmentioning
confidence: 99%
See 1 more Smart Citation
“…Namely, nearly all lower bounds for the space complexity of randomized streaming algorithms are derived via reductions from communication problems. For an incomplete list of such reductions, see [58,61,45,42,46,10,16,57,48,49,40] and the references therein. Now nearly all such lower bounds (and all of the ones that were just cited) hold in either the 2-party setting (G has 2 vertices), the coordinator model, or the black-board model.…”
Section: Multi-party Communicationmentioning
confidence: 99%
“…The black-board model is also considered frequently for designing communication upper bounds, such as those for set disjointness [6,16,30]. Finally, there is substantial literature which considers numerical linear algebra and clustering problems in the coordinator model [61,20,5,62]. Thus, our upper bounds can be seen as a new and useful contribution to these bodies of literature as well.…”
Section: Multi-party Communicationmentioning
confidence: 99%
“…Statistics of a matrix product. In [45], an algorithm was given for estimating \| A \cdot B\| p for integer matrices A and B with O(log n) bit integer entries (see Algorithm 1 in [45] for the general algorithm). When p = 0, this estimates the number of nonzero entries of A \cdot B, which may be useful since there are faster algorithms for matrix product when the output is sparse; see [34] and the references therein.…”
Section: Problemmentioning
confidence: 99%
“…More generally, norms of the product A \cdot B can be used to determine how correlated the rows of A are with the columns of B. The bit complexity of this problem was studied in [42,45]. In [42] a lower bound of \Omega (\varepsi - 2 n) bits was shown for estimating \| AB\| 0 for n \times n matrices A, B up to a (1 + \varepsi ) factor, assuming n \geq 1/\varepsi 2 (this lower bound holds already for binary matrices A and B).…”
Section: Problemmentioning
confidence: 99%
See 1 more Smart Citation