Proceedings of the 48th International Conference on Parallel Processing 2019
DOI: 10.1145/3337821.3337878
|View full text |Cite
|
Sign up to set email alerts
|

Accelerated Work Stealing

Abstract: Realizing scalable performance with irregular parallel applications is challenging on large-scale distributed memory clusters. These applications typically require continuous, dynamic load balancing to maintain efficiency. Work stealing is a common approach to dynamic distributed load balancing. However its use in conjunction with advanced network offload capabilities is not well understood. We present a distributed work-stealing system that is amenable to acceleration using the Portals 4 network programming i… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
2
1
1
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(3 citation statements)
references
References 37 publications
(43 reference statements)
0
3
0
Order By: Relevance
“…The proposed solution uses work stealing [9,10] to mitigate workload on congested servers to the benefit of other servers. Work stealing involves an idle processor stealing one or more tasks from its neighbor's waiting list [9,11,12]. This method already proved its ability to achieve lower latencies in single parallel job execution on multiple processors [9,10,13].…”
Section: Data Planementioning
confidence: 99%
“…The proposed solution uses work stealing [9,10] to mitigate workload on congested servers to the benefit of other servers. Work stealing involves an idle processor stealing one or more tasks from its neighbor's waiting list [9,11,12]. This method already proved its ability to achieve lower latencies in single parallel job execution on multiple processors [9,10,13].…”
Section: Data Planementioning
confidence: 99%
“…33 Larkins et al have introduced an alternative of one-sided RDMA communication called Portals interface 34 to accelerate work stealing in distributed memory. 35 Regarding the approach for reducing migration overhead, Lifflander et al introduced a hierarchical technique that applies the persistence principle to distribute the load of task-based applications. 36 Menon et al proposed using partial information about the global system state to improve stealing decisions as well as balance the load by randomized work-stealing.…”
Section: Related Workmentioning
confidence: 99%
“…Dinan et al have designed a scalable model for work‐stealing using PGAS by the Aggregate Remote Memory Copy Interface (ARMCI), 32 focusing on techniques to reduce locking on the critical path and contention of splitting work 33 . Larkins et al have introduced an alternative of one‐sided RDMA communication called Portals interface 34 to accelerate work stealing in distributed memory 35 . Regarding the approach for reducing migration overhead, Lifflander et al introduced a hierarchical technique that applies the persistence principle to distribute the load of task‐based applications 36 .…”
Section: Related Workmentioning
confidence: 99%