2012
DOI: 10.1016/j.micpro.2011.05.005
|View full text |Cite
|
Sign up to set email alerts
|

Optimization of sparse matrix–vector multiplication using reordering techniques on GPUs

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
26
0

Year Published

2013
2013
2021
2021

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 46 publications
(26 citation statements)
references
References 17 publications
0
26
0
Order By: Relevance
“…Nishtala et al [48] designed a high-level data partitioning method for SpMV to achieve better cache locality on multicore CPUs. Pichel et al [49] evaluated how reordering techniques influence performance of SpMV on GPUs. Baskaran and Bordawekar [50] improved off-chip and on-chip memory access patterns of SpMV on GPUs.…”
Section: Comparison To Related Methodsmentioning
confidence: 99%
“…Nishtala et al [48] designed a high-level data partitioning method for SpMV to achieve better cache locality on multicore CPUs. Pichel et al [49] evaluated how reordering techniques influence performance of SpMV on GPUs. Baskaran and Bordawekar [50] improved off-chip and on-chip memory access patterns of SpMV on GPUs.…”
Section: Comparison To Related Methodsmentioning
confidence: 99%
“…Sparse matrices are matrices with a considerable number of zero value elements and the storage methods mainly developed to reduce storage space [8] by storing information only related to nonzero elements. Most of such methods used in optimizing data storage size, perform data processing, increasing computation performance etc in computer science [3][4] [5].…”
Section: Introductionmentioning
confidence: 99%
“…few applications need time efficient storage methods and few will need space efficient methods, storage methods are selected according to the purpose and applications [5][2] [6].…”
Section: Introductionmentioning
confidence: 99%
“…Enhancements are implemented on graphics processing units (GPUs) to speed up the computation. Sparse matrix vector multiplication through reordering techniques has been explored with Tesla C1060 and Tesla M2050 [1]. The computation of isogeometric analysis stiffness matrix exhibits increased speed when implemented on GeForce GTX680 [2].…”
Section: Introductionmentioning
confidence: 99%