2018
DOI: 10.1007/978-3-319-78024-5_5
|View full text |Cite
|
Sign up to set email alerts
|

Multilayer Approach for Joint Direct and Transposed Sparse Matrix Vector Multiplication for Multithreaded CPUs

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

1
32
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(33 citation statements)
references
References 17 publications
1
32
0
Order By: Relevance
“…Sparse blocks in the second level of the HSF can either use the COO format or the CSR format. The Šimeček et al 1 format stores sparse block B i in the CSR format when the relation nec( B i ) ≥ rows( B i ) holds, otherwise it stores B i in the COO format. We use an improved choice heuristic in the proposed format to ensure that sparse blocks use the more space‐efficient format and prefer the CSR format if both formats are equally space‐efficient.…”
Section: Proposed Sparse Matrix Formatmentioning
confidence: 99%
See 4 more Smart Citations
“…Sparse blocks in the second level of the HSF can either use the COO format or the CSR format. The Šimeček et al 1 format stores sparse block B i in the CSR format when the relation nec( B i ) ≥ rows( B i ) holds, otherwise it stores B i in the COO format. We use an improved choice heuristic in the proposed format to ensure that sparse blocks use the more space‐efficient format and prefer the CSR format if both formats are equally space‐efficient.…”
Section: Proposed Sparse Matrix Formatmentioning
confidence: 99%
“…Because of how the presented parallel SpMM T V algorithm works (Section 6), it is desirable to keep the nonzero element counts of the HSF's sparse blocks as balanced as possible. The Šimeček et al 1 format balances the nonzero element counts of sparse blocks using a process called sparse block normalization . Sparse block normalization uses a threshold θ given by θ=maxαnec(A)nτ,216, where α(0,1] is the format's parameter and nτ is the thread count used for parallel SpMM T V. Parameter α is used to adjust the resulting threshold.…”
Section: Proposed Sparse Matrix Formatmentioning
confidence: 99%
See 3 more Smart Citations