2010
DOI: 10.1016/j.laa.2010.06.032
|View full text |Cite
|
Sign up to set email alerts
|

Block-oriented J-Jacobi methods for Hermitian matrices

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
31
0

Year Published

2011
2011
2021
2021

Publication Types

Select...
6

Relationship

3
3

Authors

Journals

citations
Cited by 18 publications
(31 citation statements)
references
References 27 publications
0
31
0
Order By: Relevance
“…The most promising way to further enhance these characteristics is to modify them to become BLAS 3 algorithms (see [12,28,31,33,34,52,53]). Such methods are referred to as block diagonalization or block Jacobi-type methods.…”
mentioning
confidence: 99%
“…The most promising way to further enhance these characteristics is to modify them to become BLAS 3 algorithms (see [12,28,31,33,34,52,53]). Such methods are referred to as block diagonalization or block Jacobi-type methods.…”
mentioning
confidence: 99%
“…The new strategies may be seen as the generalizations of the Mantharam-Eberlein block-recursive strategy [27] to all even matrix orders. These new strategies are combined with the multilevel blocking and parallelization techniques explored in [20,21,37,36,29], to deliver the Jacobi-type (H)SVD algorithms for the graphics processing unit(s), competitive with the leading hybrid (CPU + GPU) alternatives, like MAGMA. The new algorithms are carefully designed to use a CPU primarily as a controlling unit.…”
Section: A Multi-gpu Algorithmmentioning
confidence: 99%
“…GPUs instead offer a complex memory hierarchy, with different access speeds and patterns, and both automatically and programmatically managed caches. Even more so than in the CPU world, a (less) careful hardware-adapted blocking of a GPU algorithm is the key technique by which considerable speedups are gained (or lost).After the introductory paper [29], here we present a family of the full block [21] and the block-oriented [20] one-sided Jacobi-type algorithm variants for the ordinary (SVD) and the hyperbolic singular value decomposition (HSVD) of a matrix, targeting both a single and the multiple GPUs. The blocking of our algorithm follows the levels of the GPU memory hierarchy; namely, the innermost level of blocking tries to maximize the amount of computation done inside the fastest (and smallest) memory of the registers and manual caches.…”
mentioning
confidence: 99%
See 2 more Smart Citations