2014 24th International Conference on Field Programmable Logic and Applications (FPL) 2014
DOI: 10.1109/fpl.2014.6927464
|View full text |Cite
|
Sign up to set email alerts
|

An efficient sparse conjugate gradient solver using a Beneš permutation network

Abstract: Abstract-The conjugate gradient (CG) is one of the most widely used iterative methods for solving systems of linear equations. However, parallelizing CG for large sparse systems is difficult due to the inherent irregularity in memory access pattern. We propose a novel processor architecture for the sparse conjugate gradient method. The architecture consists of multiple processing elements and memory banks, and is able to compute efficiently both sparse matrix-vector multiplication, and other dense vector opera… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3

Citation Types

0
13
0

Year Published

2015
2015
2017
2017

Publication Types

Select...
4
1
1

Relationship

4
2

Authors

Journals

citations
Cited by 12 publications
(13 citation statements)
references
References 15 publications
(17 reference statements)
0
13
0
Order By: Relevance
“…Although early implementations introduced too much overhead [12], recent preprocessing, scheduling and partitioning techniques have been employed successfully [2,7]. Pre-processing has become essential for modern SpMV based applications and it is also used in our approach to enable partitioning and blocking.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Although early implementations introduced too much overhead [12], recent preprocessing, scheduling and partitioning techniques have been employed successfully [2,7]. Pre-processing has become essential for modern SpMV based applications and it is also used in our approach to enable partitioning and blocking.…”
Section: Introductionmentioning
confidence: 99%
“…For the HPC community to use FPGAs effectively, SpMV kernels need to have good performance. However, the dynamic nature of the data flow in SpMV [2,3] requires expensive and complex circuitry and it becomes a challenge to achieve effective use of arithmetic and logic resources and on-chip and off-chip memory bandwidth. Furthermore, performance varies greatly based on the matrix instance [4][5][6].…”
Section: Introductionmentioning
confidence: 99%
“…In this regard, FPGAs may have a considerable advantage compared to general purpose architectures: the fine degree of customisation available can be used to directly and carefully orchestrate data movement on and off-chip resulting in good performance on the SpMV kernel [4]- [6]. Furthermore, when using FPGAs there is great potential for application and domain driven customisation: wordlengths, reduction circuits, memory controller infrastructure can all be optimised to direct resources to the most critical component [7]- [11].…”
Section: Introductionmentioning
confidence: 99%
“…[16] proposes one of the first parametric designs for floating point SpMV and demonstrates how the flexibility of FPGAs can be used to achieve good performance compared to general purpose systems. More recently, the focus has shifted to efficient use of on-chip memory resources and DRAM bandwidth utilisation [5], [7], [9]. Recently, compression techniques have been proposed to improve the performance on memory bound matrices [8], [17] The constant sparsity structure in the context of iterative methods has also been exploited to optimise FPGA architectures for SpMV [18].…”
mentioning
confidence: 99%
See 1 more Smart Citation