1995
DOI: 10.1147/rd.395.0575
|View full text |Cite
|
Sign up to set email alerts
|

A three-dimensional approach to parallel matrix multiplication

Abstract: A three-dimensional (3D) matrix multiplication algorithm for massively parallel processing systems is presented. The P processors are con gured as a \virtual" processing cube with dimensions p 1 , p 2 , and p 3 proportional to the matrices' dimensions|M, N, and K. Each processor performs a single local matrix multiplication of size M=p 1 N=p 2 K=p 3. Before the local computation can be carried out, each subcube must receive a single submatrix of A and B. After the single matrix multiplication has completed, K=… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
128
0

Year Published

1997
1997
2020
2020

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 122 publications
(128 citation statements)
references
References 13 publications
(2 reference statements)
0
128
0
Order By: Relevance
“…If the matrices are square and of size S, and there is enough memory for only O(1) copies of the matrices (S = Θ(M · p)), matrix multiplication can be performed on a 2D grid of processors in a communication-optimal fashion. In particular, blocked Cannon's algorithm [9] and SUMMA [1,53] achieve the bandwidth cost…”
Section: Matrix Multiplication Algorithmsmentioning
confidence: 99%
See 3 more Smart Citations
“…If the matrices are square and of size S, and there is enough memory for only O(1) copies of the matrices (S = Θ(M · p)), matrix multiplication can be performed on a 2D grid of processors in a communication-optimal fashion. In particular, blocked Cannon's algorithm [9] and SUMMA [1,53] achieve the bandwidth cost…”
Section: Matrix Multiplication Algorithmsmentioning
confidence: 99%
“…Given an unlimited amount of memory, it is possible to replicate the data to reduce the communication cost, which is realized by a 3D matrix multiplication algorithm [32,12,1,2,23] with a resulting bandwidth cost of…”
Section: Matrix Multiplication Algorithmsmentioning
confidence: 99%
See 2 more Smart Citations
“…We do not consider algorithms that replicate data (e.g. 3D matrix multiplication [1]). However, the extension is natural, given the replication approach presented in [11].…”
Section: Matrix Layoutmentioning
confidence: 99%