2013
DOI: 10.1007/978-3-642-39958-9_1
|View full text |Cite
|
Sign up to set email alerts
|

Strassen’s Communication-Avoiding Parallel Matrix Multiplication Algorithm for All-Port 2D Torus Networks

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2015
2015
2015
2015

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 9 publications
0
2
0
Order By: Relevance
“…Optimizations of matrix multiplication have been extensively studied in distributed computing, scientific computing and high-performance computing [3]- [8]. In recent studies, it has been argued that communication costs are and will continue to be the limiting factor in the design of efficient parallel algorithms [9]. As a result, some significant communication-avoiding matrix multiplication algorithms have been proposed to address the bottleneck in HPC domain [3], [9]- [12].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Optimizations of matrix multiplication have been extensively studied in distributed computing, scientific computing and high-performance computing [3]- [8]. In recent studies, it has been argued that communication costs are and will continue to be the limiting factor in the design of efficient parallel algorithms [9]. As a result, some significant communication-avoiding matrix multiplication algorithms have been proposed to address the bottleneck in HPC domain [3], [9]- [12].…”
Section: Introductionmentioning
confidence: 99%
“…In recent studies, it has been argued that communication costs are and will continue to be the limiting factor in the design of efficient parallel algorithms [9]. As a result, some significant communication-avoiding matrix multiplication algorithms have been proposed to address the bottleneck in HPC domain [3], [9]- [12]. The state-of-the-art one is the Communication-Avoiding Parallel Strassen (CAPS) algorithm which reportedly performs asymptotically better than any previous classical or Strassen-based parallel algorithm and obtains 24% to 184% speedups on Cray XT4 [10].…”
Section: Introductionmentioning
confidence: 99%