2009 IEEE International Conference on Cluster Computing and Workshops 2009
DOI: 10.1109/clustr.2009.5289193
|View full text |Cite
|
Sign up to set email alerts
|

Coordinating the use of GPU and CPU for improving performance of compute intensive applications

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
37
0
4

Year Published

2010
2010
2018
2018

Publication Types

Select...
4
3
1

Relationship

2
6

Authors

Journals

citations
Cited by 70 publications
(42 citation statements)
references
References 14 publications
1
37
0
4
Order By: Relevance
“…In order to exploit this intra-filter task heterogeneity, we then proposed and implemented a task assignment policy, called demand-driven dynamic weighted round-robin [39] in the Anthill Event Scheduler module, previously shown in Sect. 3.…”
Section: Intra-filter Task Assignmentmentioning
confidence: 99%
See 1 more Smart Citation
“…In order to exploit this intra-filter task heterogeneity, we then proposed and implemented a task assignment policy, called demand-driven dynamic weighted round-robin [39] in the Anthill Event Scheduler module, previously shown in Sect. 3.…”
Section: Intra-filter Task Assignmentmentioning
confidence: 99%
“…Our approach assigns tasks to devices based on the relative performance of that device, with the aim of optimizing the overall execution time. While in previous work [39], we considered intranode parallelism, in the original version of this paper [37] and here we consider both intra-node and inter-node parallelism. We also present new techniques for on-line performance estimation and handling data transfers between the CPU and the GPU.…”
Section: Introductionmentioning
confidence: 98%
“…We further embrace a coarser-grained data parallelism than StreamIt, which results in performance beyond prior work [28]. A number of other filter-based frameworks have been designed [29], [30], [31], [32], [33]. Similarly, they encapsulate computations into filters, a central concept to express algorithms.…”
Section: Related Workmentioning
confidence: 99%
“…Experiments revealed that pure algorithm optimizations may lead to impressive improvements over the single CPU-core version and, even more, the improvement factor rises along with a better coordination between GPGPU and CPU [18]. Unfortunately, this is not an universal truth, mobile GPUs, for example, being designed paying much more attention to power consumption rather than focusing on performance, thus lowering the bus traffic between GPU and other devices, (e.g.…”
Section: State Of the Artmentioning
confidence: 99%