2021
DOI: 10.1002/cpe.6449
|View full text |Cite
|
Sign up to set email alerts
|

Solving tridiagonal Toeplitz systems of linear equations on GPU‐accelerated computers

Abstract: The aim of this article is to show that solvers for tridiagonal Toeplitz systems of linear equations can be efficiently implemented for a variety of modern GPU-accelerated and multicore architectures using OpenACC. We consider two parallel algorithms for solving such systems with special assumptions about coefficient matrices. As the first algorithm, we propose a new, faster implementation of the divide and conquer method.The next algorithm is a new, vectorizable algorithm based on a recently introduced sequen… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
2

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(3 citation statements)
references
References 23 publications
0
3
0
Order By: Relevance
“…Note that the prefix x in function names indicates the precision TA B L E 4 Relative error of all considered methods (double precision). 3.3e-10 1.2e-11 0.0e-00 0.0e-00 0.0e-00 0.0e-00 0.0e-00 0.0e-00 2 30 2 4 4.2e-10 3.5e-11 0.0e-00 0.0e-00 0.0e-00 0.0e-00 0.0e-00 0.0e-00 TA B L E 5 Relative error of all considered methods (single and mixed precision). used (i.e., D for double, F for single, respectively).…”
Section: Results Of Experimentsmentioning
confidence: 99%
See 1 more Smart Citation
“…Note that the prefix x in function names indicates the precision TA B L E 4 Relative error of all considered methods (double precision). 3.3e-10 1.2e-11 0.0e-00 0.0e-00 0.0e-00 0.0e-00 0.0e-00 0.0e-00 2 30 2 4 4.2e-10 3.5e-11 0.0e-00 0.0e-00 0.0e-00 0.0e-00 0.0e-00 0.0e-00 TA B L E 5 Relative error of all considered methods (single and mixed precision). used (i.e., D for double, F for single, respectively).…”
Section: Results Of Experimentsmentioning
confidence: 99%
“…However, if summation is only a part of implemented problem, for example when summed numerical values are computed during summation using a more complicated procedure, then the use of multiple processors can be profitable even for smaller problem sizes. In the future, we plan to implement several algorithms for solving such problems (numerical integration, solving ordinary differential equations and tridiagonal systems of linear equations 30 ) in order to examine how the use of parallel and vectorized compensated summation algorithms affects accuracy and performance.…”
Section: Discussionmentioning
confidence: 99%
“…Dmitruk et al 3 show how the OpenACC standard can be efficiently used to implement solvers for tridiagonal Toeplitz systems of linear equations for a variety of modern GPU‐accelerated and multicore architectures. Two parallel algorithms are studied concerning particular assumptions about coefficient matrices.…”
Section: Accepted Papers For the Special Issue: Summarymentioning
confidence: 99%