2011
DOI: 10.1109/tpds.2010.61
|View full text |Cite
|
Sign up to set email alerts
|

Cyclic Reduction Tridiagonal Solvers on GPUs Applied to Mixed-Precision Multigrid

Abstract: Abstract-We have previously suggested mixed precision iterative solvers specifically tailored to the iterative solution of sparse linear equation systems as they typically arise in the finite element discretization of partial differential equations. These schemes have been evaluated for a number of hardware platforms, in particular single precision GPUs as accelerators to the general purpose CPU. This paper reevaluates the situation with new mixed precision solvers that run entirely on the GPU: We demonstrate … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

1
49
0
2

Year Published

2013
2013
2015
2015

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 98 publications
(52 citation statements)
references
References 28 publications
1
49
0
2
Order By: Relevance
“…The paper also suggested the possibility of reducing the system size by using the cyclic reduction and the global memory in order to fit the reduced system into the local memory. The cyclic reduction and the local memory were also used in [6]. The paper introduced a clever permutation pattern which reduces the number of bank conflicts.…”
Section: Previous Work On Tridiagonal System Solvers On a Gpumentioning
confidence: 99%
See 4 more Smart Citations
“…The paper also suggested the possibility of reducing the system size by using the cyclic reduction and the global memory in order to fit the reduced system into the local memory. The cyclic reduction and the local memory were also used in [6]. The paper introduced a clever permutation pattern which reduces the number of bank conflicts.…”
Section: Previous Work On Tridiagonal System Solvers On a Gpumentioning
confidence: 99%
“…The idea is that the rows belonging to these four parts are permuted before the beginning of the reduction process in such a way that all odd numbered rows are stored into the first and third section, and all even numbered rows are stored into the second and fourth section. This permutation pattern resembles the one presented in [6]. After the reduction step is performed, the rows are permuted in such a way that all rows, which are going to be odd numbered during the next reduction step, are stored into the second section and all rows, which are going to be even numbered during the next reduction step, are stored into the fourth section.…”
Section: Tridiagonal System Solver Implementationmentioning
confidence: 99%
See 3 more Smart Citations