2015
DOI: 10.1137/140952429
|View full text |Cite
|
Sign up to set email alerts
|

A Hierarchically Blocked Jacobi SVD Algorithm for Single and Multiple Graphics Processing Units

Abstract: Abstract. We present a hierarchically blocked one-sided Jacobi algorithm for the singular value decomposition (SVD), targeting both single and multiple graphics processing units (GPUs). The blocking structure reflects the levels of GPU's memory hierarchy. The algorithm may outperform MAGMA's dgesvd, while retaining high relative accuracy. To this end, we developed a family of parallel pivot strategies on GPU's shared address space, but applicable also to inter-GPU communication. Unlike common hybrid approaches… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
24
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
7

Relationship

2
5

Authors

Journals

citations
Cited by 19 publications
(24 citation statements)
references
References 37 publications
0
24
0
Order By: Relevance
“…The mathematical equivalence of Mazzoni's method (17) and (18) to the SVD-based predicted-covariancematrix-SR-time-propagation (31) to (34) follows immediately from the SR condition (22) and the obvious property of SVD (33) stating that…”
Section: The Time-update Step Of the Sr-acd-eukf Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…The mathematical equivalence of Mazzoni's method (17) and (18) to the SVD-based predicted-covariancematrix-SR-time-propagation (31) to (34) follows immediately from the SR condition (22) and the obvious property of SVD (33) stating that…”
Section: The Time-update Step Of the Sr-acd-eukf Methodsmentioning
confidence: 99%
“…The entries σ1,σ2,,σmin{m,n} mentioned in Theorem 2 are referred to as the hyperbolic singular values of the matrix A . The HSVD plays an important role in a number of tasks of practical value, which are outlined by Onn et al So, its efficient implementation is of high importance and discussed extensively . Below, we explain how the HSVD can contribute to square‐rooting the ACD‐EUKF methods with negative UT weights.…”
Section: The Hyperbolic Svd‐based Jsr‐acd‐eukf With Negative Weightsmentioning
confidence: 99%
“…The identity matrix is sent to the MATLAB routines for implementing usual QR/SVD. More precisely, the hyperbolic QR is implemented by hyperbolic Givens rotations [42] while Jacobi‐type SVD implementation strategy [35] is utilised for implementing HSVD [The MATLAB routine ‘jqr’ is freely available at https://www.mathworks.com/matlabcentral/fileexchange/50329-jqr-jrq-jql-jlq-factorizations. The hyperbolic SVD implementation is based on the routine ‘jqr’ and Jacobi‐type ordinary SVD implementation ‘svdsim’ at https://www.mathworks.com/matlabcentral/fileexchange/12674-simple-svd.].…”
Section: Numerical Experimentsmentioning
confidence: 99%
“…Such methods require the hyperbolic SVD (HSVD) factorisation instead of usual SVD in the classical Riccati‐based KF. The existence of the HSVD is proved in [32, 33] and its efficient implementation is discussed in [34, 35] and many other works. It is worth noting here that the newly‐proposed HSVD‐based filter enriches a family of factored‐from (square‐root) Chandrasekhar‐based methods, which so far consists of Cholesky‐based algorithms, only.…”
Section: Introductionmentioning
confidence: 99%
“…On the other hand, an efficient, parallel and blocked one-sided Jacobi-type algorithm for the “ordinary” and the hyperbolic SVD (Novaković, 2015, 2017) of a single real matrix has been developed for the GPUs, that utilizes the GPUs almost fully, with the CPU serving only the controlling purpose in the single-GPU case.…”
Section: Introductionmentioning
confidence: 99%