2016
DOI: 10.21914/anziamj.v56i0.9371
|View full text |Cite
|
Sign up to set email alerts
|

Implementation of parallel tridiagonal solvers for a heterogeneous computing environment

Abstract: Tridiagonal diagonally dominant linear systems arise in many scientific and engineering applications. The standard Thomas algorithm for solving such systems is inherently serial, forming a bottleneck in computation. Algorithms such as cyclic reduction and spike reduce a single large tridiagonal system into multiple small independent systems which are solved in parallel. We develop portable cyclic reduction and the spike algorithm for Open Computing Language implementations on a range of co-processors in a hete… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
8
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(8 citation statements)
references
References 10 publications
0
8
0
Order By: Relevance
“…With the introduction of High-Level synthesis (HLS) tools, a number of more recent works [14], [15], [16], [29] implemented the Thomas, PCR, and Spike algorithms on FPGA using HLS tools. Many of these works did not demonstrate the solver working on full applications, with the exception of Lászl ó et al in 2015 [14] which compared a one factor Black-Scholes option pricing equation using explicit and implicit methods on different architectures such as multi core CPUs, GPUs, and FPGAs.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…With the introduction of High-Level synthesis (HLS) tools, a number of more recent works [14], [15], [16], [29] implemented the Thomas, PCR, and Spike algorithms on FPGA using HLS tools. Many of these works did not demonstrate the solver working on full applications, with the exception of Lászl ó et al in 2015 [14] which compared a one factor Black-Scholes option pricing equation using explicit and implicit methods on different architectures such as multi core CPUs, GPUs, and FPGAs.…”
Section: Related Workmentioning
confidence: 99%
“…Macintosh, et al in 2014 [16] uses an OpenCL based implementation targeting an Altera Stratix V FPGA using PCR and Spike algorithms. The performance on the FPGA is compared to a GPU implementation on an Nvidia Quadro 4000 GPU.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…They introduced a nonstandard hybrid Thomas/PCR algorithm for solving the tridiagonal systems for the implicit solver. In 2016 Macintosh et al [12] compared the capability of PCR algorithm implemented on GPUs using OpenCL (Open Computing Language) for solving tridiagonal linear systems. They evaluated these designs in the context of the solver performance, resource efficiency and numerical accuracy.…”
Section: Introductionmentioning
confidence: 99%
“…Previous efforts in developing FPGA-based routines to solve tridiagonal systems have been limited to solving small systems with the serial omas algorithm [11][12][13]. We have previously investigated the feasibility of FPGA implementations of parallel algorithms including the parallel cyclic reduction and SPIKE [14] for solving small tridiagonal linear systems.…”
Section: Introductionmentioning
confidence: 99%