2022
DOI: 10.1145/3524298
|View full text |Cite
|
Sign up to set email alerts
|

Rateless codes for near-perfect load balancing in distributed matrix-vector multiplication

Abstract: Large-scale machine learning and data mining applications require computer systems to perform massive matrix-vector and matrix-matrix multiplication operations that need to be parallelized across multiple nodes. The presence of straggling nodes---computing nodes that unpredictably slow down or fail---is a major bottleneck in such distributed computations. Ideal load balancing strategies that dynamically allocate more tasks to faster nodes require knowledge or monitoring of node speeds as well as the ability to… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 13 publications
(5 citation statements)
references
References 10 publications
1
4
0
Order By: Relevance
“…Among the first four schemes, LT + IR achieved the best performance for all four scenarios. Note that IR eliminates the packet-loss issue, and this result has also been demonstrated in [ 14 ], where only the straggler issue was considered. This is because LT codes can achieve near-perfect load balance among the worker nodes in the presence of stragglers.…”
Section: Performance Evaluationsupporting
confidence: 60%
See 3 more Smart Citations
“…Among the first four schemes, LT + IR achieved the best performance for all four scenarios. Note that IR eliminates the packet-loss issue, and this result has also been demonstrated in [ 14 ], where only the straggler issue was considered. This is because LT codes can achieve near-perfect load balance among the worker nodes in the presence of stragglers.…”
Section: Performance Evaluationsupporting
confidence: 60%
“…As in [ 14 ], we consider a computation delay model as follows. The computation delay at each worker node consists of two parts.…”
Section: System Modelmentioning
confidence: 99%
See 2 more Smart Citations
“…this shows the adjacency matrix of the model, the initial input matrix, abstracted from the relationships between the elements. According to Table 9, the Adjacency matrix multiplication of the model, which is an intermediate computational procedure to obtain the reachable matrix by successively multiplying the adjacency multiplication matrix until the matrix does not change 22 .…”
Section: Analysis Of In Uencing Factorsmentioning
confidence: 99%