2019
DOI: 10.3390/app9050947
|View full text |Cite
|
Sign up to set email alerts
|

SURAA: A Novel Method and Tool for Loadbalanced and Coalesced SpMV Computations on GPUs

Abstract: Sparse matrix-vector (SpMV) multiplication is a vital building block for numerous scientific and engineering applications. This paper proposes SURAA (translates to speed in arabic), a novel method for SpMV computations on graphics processing units (GPUs). The novelty lies in the way we group matrix rows into different segments, and adaptively schedule various segments to different types of kernels. The sparse matrix data structure is created by sorting the rows of the matrix on the basis of the nonzero element… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
15
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
6
1
1

Relationship

3
5

Authors

Journals

citations
Cited by 22 publications
(17 citation statements)
references
References 75 publications
0
15
0
Order By: Relevance
“…A range of technologies is contributing to the development of these smart systems. These include the Internet of Things (IoT) [32][33][34][35][36], social media [21][22][23]37,38], big data [39][40][41][42][43][44], high performance computing (HPC) [45][46][47][48], cloud, fog, and edge computing [34,[49][50][51][52], and machine learning [36,53]. The applications include healthcare [34,39,[54][55][56], transportation [57,58], and others [59,60].…”
Section: Literature Reviewmentioning
confidence: 99%
“…A range of technologies is contributing to the development of these smart systems. These include the Internet of Things (IoT) [32][33][34][35][36], social media [21][22][23]37,38], big data [39][40][41][42][43][44], high performance computing (HPC) [45][46][47][48], cloud, fog, and edge computing [34,[49][50][51][52], and machine learning [36,53]. The applications include healthcare [34,39,[54][55][56], transportation [57,58], and others [59,60].…”
Section: Literature Reviewmentioning
confidence: 99%
“…The sparsity of a matrix and a suitable data structure determine the computational cost as well as the transmission time [31]. Adaptive partitioning here employs the Compressed Sparse Row (CSR) storage scheme [32]. The size of a matrix in CSR format is 12 •ẑ + 4 • (m + 1) bytes, whereẑ is the total number of non-zero elements and m the number of rows; in contrast, a dense matrix requires 8 • m • n bytes.…”
Section: A Dnn Layer Representationmentioning
confidence: 99%
“…High performance computing (HPC) typically exploits parallel computing features of the underlying software and hardware infrastructure to solve large problems faster. HPC has been applied to SpMV/linear algebra [39]- [42], and other problems for several decades. Big data and data-driven approaches [35], [36], [43], [44] have been used relatively recently in scientific computing to address HPC related challenges, and this has given rise to the convergence of HPC and big data [45], [46].…”
mentioning
confidence: 99%
“…The characteristics of the hardware platforms that could affect the SpMV performance include the DRAM bandwidth, cache hierarchy, the available parallelism in the hardware, and others. A range of hardware architecture are being used for SpMV implementations including CPUs [55]- [57], MIC [40], GPUs [39], [58], and other architectures [59], [60].…”
mentioning
confidence: 99%
See 1 more Smart Citation