16th Euromicro Conference on Parallel, Distributed and Network-Based Processing (PDP 2008) 2008
DOI: 10.1109/pdp.2008.41
|View full text |Cite
|
Sign up to set email alerts
|

Understanding the Performance of Sparse Matrix-Vector Multiplication

Abstract: In this paper we revisit the performance issues of the widely used sparse matrix-vector multiplication (SpMxV) kernel on modern microarchitectures. Previous scientific work reports a number of different factors that may significantly reduce performance. However, the interaction of these factors with the underlying architectural characteristics is not clearly understood, a fact that may lead to misguided and thus unsuccessful attempts for optimization. In order to gain an insight on the details of SpMxV perform… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

2
51
0
2

Year Published

2008
2008
2023
2023

Publication Types

Select...
3
3
2

Relationship

2
6

Authors

Journals

citations
Cited by 53 publications
(56 citation statements)
references
References 19 publications
2
51
0
2
Order By: Relevance
“…Depending on the structure of the input matrix, memory accesses are more or less regular [23]. We conclude that, due to the two-level cache hierarchy, the smaller last-level cache and the lower memory bandwidth, this has a higher impact on performance variation than in the high-performance architecture.…”
Section: B Periodic Samplingmentioning
confidence: 89%
“…Depending on the structure of the input matrix, memory accesses are more or less regular [23]. We conclude that, due to the two-level cache hierarchy, the smaller last-level cache and the lower memory bandwidth, this has a higher impact on performance variation than in the high-performance architecture.…”
Section: B Periodic Samplingmentioning
confidence: 89%
“…The matrix suite for the experiments comprised of 59 matrices for double-precision and 40 matrices for singleprecision arithmetic selected from Tim Davis' collection of sparse matrices [2]. For more information about the matrices used, the reader is referred to [4]. The reason for selecting different sets for double-and single-precision is mainly technical.…”
Section: Experimental Evaluationmentioning
confidence: 99%
“…The SpMV kernel poses a variety of performance issues both in single and multicore configurations [4], [12], [16], which are mainly due to the memory-intensive nature of the SpMV algorithm. To this end, a number of optimization techniques have been proposed, such as register and cache blocking [6], [7] compression [9], [10], [15], column or row reordering [11], and others.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…In our recent previous work [5], as well as in related literature [1], the memory subsystem and more specifically the memory bandwidth is identified as the main performance bottleneck of the SpMxV kernel. The above statement can be supported by the fact that SpMxV performs O(nnz) operations on O(nnz) amount of data, which means that most of the data are accessed in a streaming manner and there is little temporal locality.…”
Section: Introductionmentioning
confidence: 99%