2008
DOI: 10.1109/ipdps.2008.4536305
|View full text |Cite
|
Sign up to set email alerts
|

Avoiding communication in sparse matrix computations

Abstract: The performance of sparse iterative solvers is typically limited by sparse matrix-vector multiplication, which is itself limited by memory system and network performance. As the gap between computation and communication speed continues to widen, these traditional sparse methods will suffer. In this paper we focus on an alternative building block for sparse iterative solvers, the "matrix powers ker-, and show that by organizing computations around this kernel, we can achieve nearminimal communication costs. We … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
76
0

Year Published

2012
2012
2023
2023

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 96 publications
(76 citation statements)
references
References 16 publications
0
76
0
Order By: Relevance
“…Unstructured cache blocking [20] is another approach. The communication-avoiding algorithms of Demmel et al [18] also create sparse tiles, but those tiles overlap so as to enable parallel execution of the tiles and minimal communication between tiles.…”
Section: Input Abstract Relations Transformation Specificationmentioning
confidence: 99%
“…Unstructured cache blocking [20] is another approach. The communication-avoiding algorithms of Demmel et al [18] also create sparse tiles, but those tiles overlap so as to enable parallel execution of the tiles and minimal communication between tiles.…”
Section: Input Abstract Relations Transformation Specificationmentioning
confidence: 99%
“…ClouDiA works for two large classes of data-driven applications. The first class, which contains many HPC applications, is sensitive to the worst-link latency, as this latency can significantly affect total time-to-solution in a variety of scientific applications [1,12,22,35]. The second class, represented by search engines as well as web services and portals, is sensitive to the longest path between application nodes, as this cost models the network links with the highest potential impact on application response time.…”
Section: Contributions Of This Papermentioning
confidence: 99%
“…As the end of a time step is a logical barrier, worst-link latency essentially determines communication cost [1,12,35,72]. Similar communication patterns are common in multiple linear algebra computations [22]. Another example of an HPC application where time-to-solution is critical is dynamic traffic assignment [64].…”
Section: Goal: Time-to-solutionmentioning
confidence: 99%
“…Some studies [31] proposed to avoid communication at the expense of performing some redundant computation. While communicating less often certainly helps, this technique alone cannot deal with latency spikes.…”
Section: Related Workmentioning
confidence: 99%