2007 International Conference on Field Programmable Logic and Applications 2007
DOI: 10.1109/fpl.2007.4380769
|View full text |Cite
|
Sign up to set email alerts
|

FPGA Based Sparse Matrix Vector Multiplication using Commodity DRAM Memory

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
11
0

Year Published

2010
2010
2016
2016

Publication Types

Select...
4
4
1

Relationship

0
9

Authors

Journals

citations
Cited by 20 publications
(11 citation statements)
references
References 10 publications
0
11
0
Order By: Relevance
“…Depending on the implementation, the meta-data for CSR is either pre-loaded into the bitstream or dynamically accessed from external memory. While earlier designs were restricted to on-die memory capacities (e.g., [18]), more recent designs incorporate memory hierarchies that can handle large data sets exceeding the available onchip memories [24,25,26,11,10,27,9,28,29,30,14,23].…”
Section: Related Workmentioning
confidence: 99%
“…Depending on the implementation, the meta-data for CSR is either pre-loaded into the bitstream or dynamically accessed from external memory. While earlier designs were restricted to on-die memory capacities (e.g., [18]), more recent designs incorporate memory hierarchies that can handle large data sets exceeding the available onchip memories [24,25,26,11,10,27,9,28,29,30,14,23].…”
Section: Related Workmentioning
confidence: 99%
“…Gregg et al [3] used a variant of CSC, dubbed the sparse matrix architecture and representation (SPAR) format [2], where the row ind and col ptr are combined into a single vector with zero padding introduced at the start of each column of data vector. David DuBois and Andrew DuBois [4] made no assumption about the structure of the sparse matrix with the exception that the implementation was designed to process up to 7 elements per row, and if any row contains fewer than 7 elements it must be padded with zeros to the full 7 elements.…”
Section: Introductionmentioning
confidence: 99%
“…For sparse-matrix vector multiplication kernel, a data reordering scheme was presented in [7]. This data reordering scheme improved the cache-misses when accessing the vector elements from the external memory.…”
Section: A Related Workmentioning
confidence: 99%
“…There has been work in using external memory for other BLAS kernels on FPGA [7], [5]. For sparse-matrix vector multiplication kernel, a data reordering scheme was presented in [7].…”
Section: A Related Workmentioning
confidence: 99%