2010
DOI: 10.1007/978-3-642-12200-2_14
|View full text |Cite
|
Sign up to set email alerts
|

The I/O Complexity of Sparse Matrix Dense Matrix Multiplication

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
11
0

Year Published

2010
2010
2022
2022

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 13 publications
(11 citation statements)
references
References 5 publications
0
11
0
Order By: Relevance
“…Greiner and Jacob have proven theoretically [26] that as the number of nonzeroes per row exceeds some hardware threshold, namely m M where m is the number [20]. cuBLAS sgemm is a dense-dense matrix multiplication function from a vendor-shipped library.…”
Section: Discussionmentioning
confidence: 99%
“…Greiner and Jacob have proven theoretically [26] that as the number of nonzeroes per row exceeds some hardware threshold, namely m M where m is the number [20]. cuBLAS sgemm is a dense-dense matrix multiplication function from a vendor-shipped library.…”
Section: Discussionmentioning
confidence: 99%
“…The algorithm also takes sparse matrices as input, and never explicitly computes a multiplication of two n × n matrices. Therefore, for input feature dimension and hidden dimension d n, time and space complexity of DIMPA (and implicitly of DIGRAC) is O(|E|dh + 2ndK) and O(2|E| + 4nd + nK), respectively [20,18]. For large-scale networks, DIMPA is amenable to a minibatch version using neighborhood sampling, similar to the minibatch forward propagation algorithm in [19,31].…”
Section: B8 Complexity Analysismentioning
confidence: 99%
“…Figure 1 gives an overview. [19]. For large networks, SIMPA is amenable to a more scalable version following [18].…”
Section: Supervised Lossmentioning
confidence: 99%
“…The algorithm also takes sparse matrices as input, and sparsity is maintained throughout. Therefore, for input feature dimension d in and hidden dimension d, if d = max(d in , d) n, time and space complexity of SIMPA, and implicitly SSSNET, is O(|E|d h 2 + 4nd K) and O(4|E| + 10nd + nK), respectively [23,19]. When the network is large, SIMPA is amendable to a minibatch version using neighborhood sampling, similar to the minibatch forward propagation algorithm in [21,36].…”
Section: Implementation Detailsmentioning
confidence: 99%