1998
DOI: 10.1137/s1064827595296732
|View full text |Cite
|
Sign up to set email alerts
|

A BLAS-3 Version of the QR Factorization with Column Pivoting

Abstract: The QR factorization with column pivoting (QRP), originally suggested by Golub [Numer. Math., 7 (1965), 206-216], is a popular approach to computing rank-revealing factorizations. Using Level 1 BLAS, it was implemented in LINPACK, and, using Level 2 BLAS, in LAPACK. While the Level 2 BLAS version delivers superior performance in general, it may result in worse performance for large matrix sizes due to cache effects. We introduce a modification of the QRP algorithm which allows the use of Level 3 BLAS kernels … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
70
0

Year Published

2005
2005
2023
2023

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 92 publications
(70 citation statements)
references
References 23 publications
0
70
0
Order By: Relevance
“…dgeqp3. The implementation of blocked HQRP that is part of the netlib implemenation of LAPACK, based on [31], modified so that the block size can be controlled. HQRRPbasic.…”
Section: Performance Experimentsmentioning
confidence: 99%
See 2 more Smart Citations
“…dgeqp3. The implementation of blocked HQRP that is part of the netlib implemenation of LAPACK, based on [31], modified so that the block size can be controlled. HQRRPbasic.…”
Section: Performance Experimentsmentioning
confidence: 99%
“…Widely used current implementations of the level-3 BLAS are based on techniques exposed by Goto [15,14] The fundamental problem with the classical approach to HQRP is that only half of the computation can be cast in terms of gemm, as described in the paper [31] that…”
mentioning
confidence: 99%
See 1 more Smart Citation
“…This factorization was introduced in [16], and the first algorithm to compute it was proposed in [6] and is based on the QR factorization with column pivoting (QRCP). A BLAS-3 version of this algorithm [25] is implemented in LAPACK [1], and its parallel version in ScaLAPACK [5].…”
mentioning
confidence: 99%
“…In classic QR factorization with column pivoting, at each step i of the factorization, the remaining unselected column of maximum norm is selected and exchanged with the i-th column, its subdiagonal elements are annihilated, using for example a Householder transformation, and then the trailing matrix is updated. A block version of this algorithm is described in [25]. The main difficulty in reducing communication in rank revealing QR factorization lies in identifying b pivot columns at each step of the block algorithm.…”
mentioning
confidence: 99%