2016
DOI: 10.1007/s00450-016-0326-3
|View full text |Cite
|
Sign up to set email alerts
|

Energy-aware solution of linear systems with many right hand sides

Abstract: The solution of linear systems of equations with many right hand sides is mostly seen as a trivial extension of solving a linear system and the algorithmic developments mostly focus on the efficient computation of the LU decomposition. This is, however, not regarding the case where many right hand sides increase the runtime influence of the forward/backward substitution. In this contribution we present a GPU accelerated Gauss-Jordan-elimination based all-at-once solution scheme which focuses on minimizing the … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
15
0

Year Published

2017
2017
2018
2018

Publication Types

Select...
2

Relationship

2
0

Authors

Journals

citations
Cited by 2 publications
(15 citation statements)
references
References 10 publications
(20 reference statements)
0
15
0
Order By: Relevance
“…Without loss of generality, we neglect the pivoting matrices P i for deriving the level‐3 formulation of the algorithm. The integration of the pivoting is straightforward, as described in the works of Quintana‐Ortí et al and Köhler et al Note that the application of a Gauss‐Transform G i from the left to a matrix A can be reformulated into a rank‐1 update A:=A1aii()a1i,0.1em,afalse(i1false)i,0,afalse(i+1false)i,,amiThTAi,· with the subsequent row scaling Ai,·:=1aiiAi,·. The partitioning of the matrix A into…”
Section: Gauss‐jordan Eliminationmentioning
confidence: 99%
See 4 more Smart Citations
“…Without loss of generality, we neglect the pivoting matrices P i for deriving the level‐3 formulation of the algorithm. The integration of the pivoting is straightforward, as described in the works of Quintana‐Ortí et al and Köhler et al Note that the application of a Gauss‐Transform G i from the left to a matrix A can be reformulated into a rank‐1 update A:=A1aii()a1i,0.1em,afalse(i1false)i,0,afalse(i+1false)i,,amiThTAi,· with the subsequent row scaling Ai,·:=1aiiAi,·. The partitioning of the matrix A into…”
Section: Gauss‐jordan Eliminationmentioning
confidence: 99%
“…Using a classic hybrid CPU‐GPU approach, we compute the matrix H on the CPU and perform the update on the remaining parts of the matrix A on the GPUs. Due to the fact that we only have a small number of GPUs inside one system, we chose the Column Block Cyclic (CBC) distribution of A across the available GPUs . The structure of the rank‐ N B updates then allows an easy parallel execution by only distributing the matrix H to all GPUs in each iteration.…”
Section: Gauss‐jordan Eliminationmentioning
confidence: 99%
See 3 more Smart Citations