2011
DOI: 10.1007/s10543-011-0333-5
|View full text |Cite
|
Sign up to set email alerts
|

A GPU-based hyperbolic SVD algorithm

Abstract: A one-sided Jacobi hyperbolic singular value decomposition (HSVD) algorithm, using a massively parallel graphics processing unit (GPU), is developed. The algorithm also serves as the final stage of solving a symmetric indefinite eigenvalue problem. Numerical testing demonstrates the gains in speed and accuracy over sequential and MPI-parallelized variants of similar Jacobi-type HSVD algorithms. Finally, possibilities of hybrid CPU--GPU parallelism are discussed.Comment: Accepted for publication in BIT Numerica… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
24
0

Year Published

2014
2014
2021
2021

Publication Types

Select...
5
1
1

Relationship

2
5

Authors

Journals

citations
Cited by 30 publications
(25 citation statements)
references
References 21 publications
0
24
0
Order By: Relevance
“…The eigenvalue order tends to stabilize eventually, thus no swapping is usually needed in the last few sweeps [28,29]. If the rotation is hyperbolic, to keep J partitioned, set V ′ pq = V pq (there is no sorting, and the new diagonal elements are not needed).…”
Section: A Single-gpu Algorithmmentioning
confidence: 99%
See 3 more Smart Citations
“…The eigenvalue order tends to stabilize eventually, thus no swapping is usually needed in the last few sweeps [28,29]. If the rotation is hyperbolic, to keep J partitioned, set V ′ pq = V pq (there is no sorting, and the new diagonal elements are not needed).…”
Section: A Single-gpu Algorithmmentioning
confidence: 99%
“…The new strategies may be seen as the generalizations of the Mantharam-Eberlein block-recursive strategy [27] to all even matrix orders. These new strategies are combined with the multilevel blocking and parallelization techniques explored in [20,21,37,36,29], to deliver the Jacobi-type (H)SVD algorithms for the graphics processing unit(s), competitive with the leading hybrid (CPU + GPU) alternatives, like MAGMA. The new algorithms are carefully designed to use a CPU primarily as a controlling unit.…”
Section: A Multi-gpu Algorithmmentioning
confidence: 99%
See 2 more Smart Citations
“…It should be noted that an important drawback of hybrid algorithms is the need to copy data from the CPU to the GPU and vice versa, which leads to significant time delays [16], [18]. Communication costs are considerably reduced by means of filtration procedure.…”
Section: The Development Of Hybrid Cpu-gpu Algorithmmentioning
confidence: 99%