2016 26th International Conference on Field Programmable Logic and Applications (FPL) 2016
DOI: 10.1109/fpl.2016.7577352
|View full text |Cite
|
Sign up to set email alerts
|

Optimising Sparse Matrix Vector multiplication for large scale FEM problems on FPGA

Abstract: Abstract-Sparse Matrix Vector multiplication (SpMV) is an important kernel in many scientific applications. In this work we propose an architecture and an automated customisation method to detect and optimise the architecture for block diagonal sparse matrices. We evaluate the proposed approach in the context of the spectral/hp Finite Element Method, using the local matrix assembly approach. This problem leads to a large sparse system of linear equations with block diagonal matrix which is typically solved usi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 21 publications
(9 citation statements)
references
References 23 publications
0
9
0
Order By: Relevance
“…For example, Dziekoński et al (2017) used a conjugate gradient solver and optimized matvec as its important part. Dehnavi et al (2010) and Grigoras et al (2016) presented similar findings and provided optimization strategies. However, in our case the action of the operator is local to the element.…”
Section: Introductionmentioning
confidence: 62%
“…For example, Dziekoński et al (2017) used a conjugate gradient solver and optimized matvec as its important part. Dehnavi et al (2010) and Grigoras et al (2016) presented similar findings and provided optimization strategies. However, in our case the action of the operator is local to the element.…”
Section: Introductionmentioning
confidence: 62%
“…Their implementation demonstrates up to 2× speedup in the best case, but hardly achieves any speedup on most of the matrices due to data format conversion overhead. A similar approach has been implemented by Grigoraş et al (2016) with a better speedup for FPGA architectures.…”
Section: Discussion Of Existing Studiesmentioning
confidence: 99%
“…[50] proposed a new sparse matrix storage method called "BVCSR" to compress the indices of non-zero elements, thus increasing the valid bandwidth of FPGA. [51] and [52] proposed an architecture for large-scale SpMV in the FEM problem. [51] co-designed an FPGA SpMV architecture with a matrix stripping and partitioning algorithms that enable the architecture to process arbitrarily large matrices without changing the PE quantities.…”
Section: Related Workmentioning
confidence: 99%