2014
DOI: 10.1137/13092589x
|View full text |Cite
|
Sign up to set email alerts
|

Simultaneous Input and Output Matrix Partitioning for Outer-Product--Parallel Sparse Matrix-Matrix Multiplication

Abstract: Abstract. For outer-product-parallel sparse matrix-matrix multiplication (SpGEMM) of the form C = A×B, we propose three hypergraph models that achieve simultaneous partitioning of input and output matrices without any replication of input data. All three hypergraph models perform conformable one-dimensional (1D) columnwise and 1D rowwise partitioning of the input matrices A and B, respectively. The first hypergraph model performs two-dimensional (2D) nonzero-based partitioning of the output matrix, whereas the… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
35
0

Year Published

2015
2015
2019
2019

Publication Types

Select...
4
1
1

Relationship

3
3

Authors

Journals

citations
Cited by 31 publications
(35 citation statements)
references
References 21 publications
0
35
0
Order By: Relevance
“…The model proposed in Definition 1 is also distinct from the ones proposed in [1]. The approach in [1] considers a restricted class of parallelizations, known as outer-product algorithms, which leads to hypergraphs with fewer vertices and nets, while our model encompasses 1D (which include outer-product), 2D, and 3D parallelizations as defined in [2].…”
Section: Theoretical Modelmentioning
confidence: 99%
See 4 more Smart Citations
“…The model proposed in Definition 1 is also distinct from the ones proposed in [1]. The approach in [1] considers a restricted class of parallelizations, known as outer-product algorithms, which leads to hypergraphs with fewer vertices and nets, while our model encompasses 1D (which include outer-product), 2D, and 3D parallelizations as defined in [2].…”
Section: Theoretical Modelmentioning
confidence: 99%
“…The approach in [1] considers a restricted class of parallelizations, known as outer-product algorithms, which leads to hypergraphs with fewer vertices and nets, while our model encompasses 1D (which include outer-product), 2D, and 3D parallelizations as defined in [2]. Another difference in the models is that the approach in [1] simultaneously partitions scalar multiplications and output matrix data, while we partition only the multiplications, as described above. Proof.…”
Section: Theoretical Modelmentioning
confidence: 99%
See 3 more Smart Citations