2012
DOI: 10.1137/080731992
|View full text |Cite
|
Sign up to set email alerts
|

Communication-optimal Parallel and Sequential QR and LU Factorizations

Abstract: We present parallel and sequential dense QR factorization algorithms that are both optimal (up to polylogarithmic factors) in the amount of communication they perform, and just as stable as Householder QR. Our first algorithm, Tall Skinny QR (TSQR), factors m × n matrices in a one-dimensional (1-D) block cyclic row layout, and is optimized for m n. Our second algorithm, CAQR (Communication-Avoiding QR), factors general rectangular matrices distributed in a two-dimensional block cyclic layout. It invokes TSQR f… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

3
408
0
1

Year Published

2012
2012
2021
2021

Publication Types

Select...
7

Relationship

1
6

Authors

Journals

citations
Cited by 320 publications
(416 citation statements)
references
References 53 publications
3
408
0
1
Order By: Relevance
“…However, it is not numerically stable if A is ill conditioned [1]. A good comparison regarding parallel performance and numerical stability of the cited algorithms can be found in [9]. Table 1 summarizes this work.…”
mentioning
confidence: 98%
See 2 more Smart Citations
“…However, it is not numerically stable if A is ill conditioned [1]. A good comparison regarding parallel performance and numerical stability of the cited algorithms can be found in [9]. Table 1 summarizes this work.…”
mentioning
confidence: 98%
“…Except of lower order terms, the first step Parallel algorithm Flops on critical path Messages Comm. volume PDGEQRF 2mn 2 P + n 2 2 log(P ) 2n log(P ) n 2 2 log(P ) TSQR 2mn 2 P + 2 3 n 3 log(P ) log(P ) n 2 2 log(P ) CholeskyQR 2mn 2 P + n 3 3 log(P ) log(P ) n 2 2 log(P ) Table 1: Out of [9]: Performance model of selected parallel QR-decomposition algorithms. n × m stands for the size of the matrix, P for the number of parallel processes.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…We consider a variant of this algorithm, so-called "Communication-Avoiding QR" (CAQR) [14]. As in ScaLAPACK, the basic operation of CAQR is the factorization of a panel followed by the update of the trailing submatrix.…”
Section: Qr Factorization Of Tall and Skinny Matricesmentioning
confidence: 99%
“…TSQR is particularly well adapted to the factorization of tall and skinny matrices, i.e., matrices satisfying M ≫ N (see [14]). TSQR is an important kernel for two reasons.…”
Section: Qr Factorization Of Tall and Skinny Matricesmentioning
confidence: 99%