Proceedings of the 25th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming 2020
DOI: 10.1145/3332466.3374521
|View full text |Cite
|
Sign up to set email alerts
|

spECK

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 33 publications
(5 citation statements)
references
References 15 publications
0
5
0
Order By: Relevance
“…When an element 𝐴 𝑖𝑘 × 𝐵 𝑘 𝑗 is computed, one needs to know whether it results in a new non-zero element in column 𝑗 of 𝐶 or it needs to be accumulated with already computed values 𝐴 𝑖𝑙 × 𝐵 𝑙 𝑗 for any 𝑙 ≠ 𝑘. This operation can be carried out by a hash table [6,20], by sorting and merging keys [11,15], or by the use of a dense vector, that is, a dense data structure that stores all intermediate accumulated values [21,39]. Section 3 proposes two new algorithms based on the dense vector and the hash table accumulators, respectively, to efficiently run SpGEMM on vector processors.…”
Section: Gustavson Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…When an element 𝐴 𝑖𝑘 × 𝐵 𝑘 𝑗 is computed, one needs to know whether it results in a new non-zero element in column 𝑗 of 𝐶 or it needs to be accumulated with already computed values 𝐴 𝑖𝑙 × 𝐵 𝑙 𝑗 for any 𝑙 ≠ 𝑘. This operation can be carried out by a hash table [6,20], by sorting and merging keys [11,15], or by the use of a dense vector, that is, a dense data structure that stores all intermediate accumulated values [21,39]. Section 3 proposes two new algorithms based on the dense vector and the hash table accumulators, respectively, to efficiently run SpGEMM on vector processors.…”
Section: Gustavson Methodsmentioning
confidence: 99%
“…Load imbalance Load imbalance is a frequently mentioned problem of the Gustavson method, especially on GPU. Parger et al [39] use a low complexity pre-processing analysis of the matrices, linear in the number of non-zeros. Depending on the result, a binning method can be used to reduce load imbalance of the algorithm on a GPU.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Nagasaka et al (2017) proposed NSparse for NVIDIA Pascal GPU, and Deveci et al (2017) proposed Kokkos for many-core GPU architectures. Parger et al (2020) designed spECK for SpGEMM. Niu et al (2022) proposed a tiled algorithm for SpGEMM on GPUs called TileSpGEMM.…”
Section: Spgemm Algorithmsmentioning
confidence: 99%
“…In addition to utilizing PLUB and PGO to evaluate our method's performance, we also compared the performance with six existing implementations, including cuSPARSE (Demouth, 2012), NSparse (Nagasaka et al, 2017), spECK (Parger et al, 2020), bhSPARSE (Liu and Vinter, 2015), Kokkos (Deveci et al, 2017), and TileSpGEMM (Niu et al, 2022). The evaluation is based on the GFLOPS performance, which is twice the number of the intermediate products divided by the execution time.…”
Section: Comparison With Existing Algorithmsmentioning
confidence: 99%