GPU Computing Gems Jade Edition 2012
DOI: 10.1016/b978-0-12-385963-1.00035-6
|View full text |Cite
|
Sign up to set email alerts
|

Dynamic Load Balancing Using Work-Stealing

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
13
0

Year Published

2014
2014
2021
2021

Publication Types

Select...
4
3
2

Relationship

1
8

Authors

Journals

citations
Cited by 15 publications
(13 citation statements)
references
References 1 publication
0
13
0
Order By: Relevance
“…Our implementation of work-stealing follows Cederman and Tsigas [5], who give an implementation for GPUs of an algorithm due to Arora et al [3]. The implementation is based around a collection of double-ended queues (deques), one per OpenCL work-item in this work.…”
Section: Adding Work-stealingmentioning
confidence: 99%
See 1 more Smart Citation
“…Our implementation of work-stealing follows Cederman and Tsigas [5], who give an implementation for GPUs of an algorithm due to Arora et al [3]. The implementation is based around a collection of double-ended queues (deques), one per OpenCL work-item in this work.…”
Section: Adding Work-stealingmentioning
confidence: 99%
“…execution unit ('work-item') maintains its own task queue, but can steal from another's queue should its own become empty. We present an implementation of work-stealing that builds on an implementation for GPUs, by Cederman and Tsigas [5], of an algorithm due to Arora et al [3]. It is written in OpenCL (a multi-threaded extension of C for programming heterogeneous systems of CPUs, GPUs, and FPGAs [13]) and automatically compiled to hardware using Altera's software development kit for OpenCL (AOCL) [2].…”
Section: Introductionmentioning
confidence: 99%
“…We developed an efficient LBM algorithm which uses MPI+GPU-based cluster structure model, and combining the advantages of both static and dynamic load balance in the LBM (Willebeek-LeMair and Reeves 1993;Hui and Chanson 1999;Arora et al 2001;Tzeng et al 2010;Cederman and Tsigas 2012). Therefore, we refer to (Colajanni et al 1998;Pai et al 1998;Srisuresh and Gan 1998;Bunt et al 1999;Cardellini et al 2002;Padhy and Rao 2011) research papers, and design LBM operating mechanism based on Queuing Theory, which is divided into MPI strategy mechanism and GPU strategy mechanism.…”
Section: Hybrid Mpi/cuda Program With Lbmmentioning
confidence: 99%
“…Consider the tricky lock-free code in Figure 6 for stealing from a queue. This function is a part of an intricate work-stealing algorithm originally proposed by Arora et al [16] and presented in the context of GPUs by Cederman and Tsigas [17]. We have augmented the code with remote synchronization highlighted in bold italics.…”
Section: Motivating Scope Promotionmentioning
confidence: 99%