2011
DOI: 10.1137/100788926
|View full text |Cite
|
Sign up to set email alerts
|

CALU: A Communication Optimal LU Factorization Algorithm

Abstract: Abstract. Since the cost of communication (moving data) greatly exceeds the cost of doing arithmetic on current and future computing platforms, we are motivated to devise algorithms that communicate as little as possible, even if they do slightly more arithmetic, and as long as they still get the right answer. This paper is about getting the right answer for such an algorithm.It discusses CALU, a communication avoiding LU factorization algorithm based on a new pivoting strategy, that we refer to as ca-pivoting… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
76
0

Year Published

2013
2013
2021
2021

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 47 publications
(76 citation statements)
references
References 21 publications
0
76
0
Order By: Relevance
“…Second, the distant data accesses has an impact on the overall performance. Adapting the communication avoiding techniques of [19] in the framework of our recursive algorithm is highly relevant. Over a finite field, tournament pivoting seem to reduce to computing the union of the non-zero pivots found in concurrent eliminations.…”
Section: Resultsmentioning
confidence: 99%
“…Second, the distant data accesses has an impact on the overall performance. Adapting the communication avoiding techniques of [19] in the framework of our recursive algorithm is highly relevant. Over a finite field, tournament pivoting seem to reduce to computing the union of the non-zero pivots found in concurrent eliminations.…”
Section: Resultsmentioning
confidence: 99%
“…When the factorization in Equation (AA4) is computed in a communication-avoiding way using the tall-and-skinny LU factorization [13] (TSLU), L is still bounded, but the bound is 2 bh , where h is a parameter of TSLU that normally satisfies h = O(log n). This can obviously be much larger than 1, although experiments indicate that L is usually much smaller.…”
Section: 4mentioning
confidence: 99%
“…SMLU uses partial pivoting and incurs a slight bandwidth cost overhead compared to RLU (an extra logarithmic factor). Another algorithm which reduces latency cost even further is the communication-avoiding tall-skinny LU algorithm (TSLU) [13]. The algorithm can be applied to general matrices, but the main innovation focuses on tall-skinny matrices.…”
Section: Algorithm Wordsmentioning
confidence: 99%
“…As examples of very recent activities on the error analysis of GE, I discuss here very briefly the research presented in the papers Grigori, Demmel and Xiang (2011) and Dopico and Molera (2012) published in the last two years. Grigori, Demmel and Xiang (2011) consider the LU factorization in the context of one of the hottest topics of numerical computations of the last years: "communication avoiding algorithms".…”
Section: Research On Error Analysis Of Gaussian Elimination Is Still mentioning
confidence: 99%
“…Grigori, Demmel and Xiang (2011) consider the LU factorization in the context of one of the hottest topics of numerical computations of the last years: "communication avoiding algorithms". In current and future computers the cost of communication (moving data between different levels of memory or between different processors) greatly exceeds the cost of performing arithmetic operations, therefore there is a strong motivation for developing new algorithms that communicate as little as possible, even if they do more arithmetic.…”
Section: Research On Error Analysis Of Gaussian Elimination Is Still mentioning
confidence: 99%