Proceedings of the 2016 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays 2016
DOI: 10.1145/2847263.2847338
|View full text |Cite
|
Sign up to set email alerts
|

Cask

Abstract: Sparse matrix vector multiplication (SpMV) is an important kernel in many scientific applications. To improve the performance and applicability of FPGA based SpMV, we propose an approach for exploiting properties of the input matrix to generate optimised custom architectures. The architectures generated by our approach are between 3.8 to 48 times faster than the worst case architectures for each matrix, showing the benefits of instance specific design for SpMV.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
0
0

Year Published

2016
2016
2023
2023

Publication Types

Select...
4
2
1

Relationship

2
5

Authors

Journals

citations
Cited by 17 publications
(6 citation statements)
references
References 18 publications
0
0
0
Order By: Relevance
“…However, they assume that the memory access of the vector is sequential. [10] proposes a framework that can generate a specific accelerator for each matrix. However, this framework requires a quite long preprocessing time.…”
Section: Related Workmentioning
confidence: 99%
“…However, they assume that the memory access of the vector is sequential. [10] proposes a framework that can generate a specific accelerator for each matrix. However, this framework requires a quite long preprocessing time.…”
Section: Related Workmentioning
confidence: 99%
“…However, each processing element holds a copy of the result vector which has a similar overhead as holding multiple copies of the input vector. [20] proposes a framework that can generate a specific accelerator for each matrix. The framework automatically determines the off-chip data layout and on-chip hardware designs.…”
Section: Related Workmentioning
confidence: 99%
“…Select as many elements as possible from C S ; end if 25: end while column if the longest column has remaining elements (line 10). If the buffered vector value is available, we try to form a group with elements from the buffered column and the shortest column (line [15][16][17][18][19][20][21][22][23]. If the group has not been fulfilled, we use elements from the current shortest column to fill it (line [21][22].…”
Section: Format Conversionmentioning
confidence: 99%
See 2 more Smart Citations