2011
DOI: 10.1145/1993316.1993516
|View full text |Cite
|
Sign up to set email alerts
|

Automatic CPU-GPU communication management and optimization

Abstract: The performance benefits of GPU parallelism can be enormous, but unlocking this performance potential is challenging. The applicability and performance of GPU parallelizations is limited by the complexities of CPU-GPU communication. To address these communications problems, this paper presents the first fully automatic system for managing and optimizing CPU-GPU communcation. This system, called the CPU-GPU Communication Manager (CGCM), consists of a run-time library and a set of compiler transformations that w… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2013
2013
2021
2021

Publication Types

Select...
5
3
2

Relationship

0
10

Authors

Journals

citations
Cited by 52 publications
(5 citation statements)
references
References 17 publications
0
5
0
Order By: Relevance
“…For example, in GPU, the communication between CPU and GPU, or the data movement, is an important part of the calculation. So, many researches evaluated the performance of CPU-GPU communication [16], [17]. In this paper, the impact of the runtime environment was evaluated by comparing two parallel implementations of the heat equation executed on two different runtime environments.…”
Section: Our Contributionmentioning
confidence: 99%
“…For example, in GPU, the communication between CPU and GPU, or the data movement, is an important part of the calculation. So, many researches evaluated the performance of CPU-GPU communication [16], [17]. In this paper, the impact of the runtime environment was evaluated by comparing two parallel implementations of the heat equation executed on two different runtime environments.…”
Section: Our Contributionmentioning
confidence: 99%
“…CPU-GPU data transfer optimizations Data transfers between CPU and GPU have been studied extensively as an important bottleneck for parallelization efforts. Previous work [25,30] established systems for automatic management of CPU-GPU communication. The authors of [29] implemented a system to move OpenMP code to GPUs, optimizing data transfers using data flow analysis.…”
Section: Domain-specific Languagesmentioning
confidence: 99%
“…CGCM [15] uses a technique for automatic management of GPU/CPU memory communication. This technique is similar to our analysis for determining when lazy or eager copy-outs are used.…”
Section: Related Workmentioning
confidence: 99%