2014
DOI: 10.1016/j.procs.2014.05.009
|View full text |Cite
|
Sign up to set email alerts
|

Comparison of High Level FPGA Hardware Design for Solving Tri-diagonal Linear Systems

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
9
0

Year Published

2015
2015
2022
2022

Publication Types

Select...
5
4

Relationship

1
8

Authors

Journals

citations
Cited by 16 publications
(10 citation statements)
references
References 8 publications
0
9
0
Order By: Relevance
“…As such, our underlying goal is to understand the criteria for a given system solver to be amenable to FPGA implementation and uncover the limitations and profitability of such accelerators. Previous work on tridiagonal system solvers for FPGAs utilized both low-level hardware description languages [18], [28], [31] as well as high-level synthesis tools [3], [14], [15], [16], [29]. They demonstrated implementation of standard tridiagonal system solver algorithms (Thomas, PCR, and Spike), evaluating how to best utilize FPGA resources to maximize performance.…”
Section: Introductionmentioning
confidence: 99%
“…As such, our underlying goal is to understand the criteria for a given system solver to be amenable to FPGA implementation and uncover the limitations and profitability of such accelerators. Previous work on tridiagonal system solvers for FPGAs utilized both low-level hardware description languages [18], [28], [31] as well as high-level synthesis tools [3], [14], [15], [16], [29]. They demonstrated implementation of standard tridiagonal system solver algorithms (Thomas, PCR, and Spike), evaluating how to best utilize FPGA resources to maximize performance.…”
Section: Introductionmentioning
confidence: 99%
“…Using the Altera based Opencl, Warne et al [18,19] demonstrated the ease in which a custom tridiagonal linear system solver can be deployed. In this work, we extend our previous efforts towards a more general highly parallel solution, targeting fpgas in particular, but also other Opencl compliant co-processors that may be present within a heterogeneous computing environment.…”
Section: C448mentioning
confidence: 99%
“…We converted the estimated throughput to system solves per second (rather than more usual flop based measures) to ease comparison with the most relevant studies. In reality, data transfer overheads can impede this throughput [19]; however, there are a number of coding practices which can assist in minimising this impact [2].…”
Section: Compute Performancementioning
confidence: 99%
“…FPGA accelerator cards require an order of magnitude less power compared to HPC grade CPUs and GPUs. Previous efforts in developing FPGA-based routines to solve tridiagonal systems have been limited to solving small systems with the serial omas algorithm [11][12][13]. We have previously investigated the feasibility of FPGA implementations of parallel algorithms including the parallel cyclic reduction and SPIKE [14] for solving small tridiagonal linear systems.…”
Section: Introductionmentioning
confidence: 99%