2016
DOI: 10.1007/978-3-319-32149-3_9
|View full text |Cite
|
Sign up to set email alerts
|

Dense Symmetric Indefinite Factorization on GPU Accelerated Architectures

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

3
14
0

Year Published

2016
2016
2020
2020

Publication Types

Select...
3
1
1

Relationship

2
3

Authors

Journals

citations
Cited by 7 publications
(17 citation statements)
references
References 23 publications
3
14
0
Order By: Relevance
“…This paper extends our previous proceedings paper [16] presented at the PPAM 2015 conference. Section 2 describes the three algorithms for solving dense symmetric indefinite systems (i.e., the Bunch-Kaufman and Aasen's algorithms, and the RBTs) and their implementations on the hybrid CPU/GPU architecture.…”
Section: Introductionsupporting
confidence: 80%
“…This paper extends our previous proceedings paper [16] presented at the PPAM 2015 conference. Section 2 describes the three algorithms for solving dense symmetric indefinite systems (i.e., the Bunch-Kaufman and Aasen's algorithms, and the RBTs) and their implementations on the hybrid CPU/GPU architecture.…”
Section: Introductionsupporting
confidence: 80%
“…Although our focus of this paper is on the deterministic algorithms with theoretical error bounds, there are growing interests in randomized algorithms . When combined with the iterative refinements, these randomized algorithms may compute the solution of the desired accuracy without pivoting, while obtaining the high performance on modern computers …”
Section: Related Workmentioning
confidence: 99%
“…As a result, it can obtain a great speedup when the matrix is significantly greater than the GPU memory. We note that this implementation is different from the previous CA Aasen's algorithm proposed and studied elsewhere . Although the previous algorithm avoids some of the communication (instead of hiding the communication), it accesses all the previously factorized column of L for each panel factorization; some of which may not fit in the GPU memory.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations