Proceedings of ACM SIGPLAN International Workshop on Libraries, Languages, and Compilers for Array Programming 2014
DOI: 10.1145/2627373.2627379
|View full text |Cite
|
Sign up to set email alerts
|

GPGPU Composition with OCaml

Abstract: GPGPU programming promises high performance. However, to achieve it, developers must overcome several challenges. The main ones are : write and use hyper-parallel kernels on GPU, manage memory transfers between CPU and GPU, and compose kernels, keeping individual performance of components while optimizing the global performance. In this article, we propose to study the composition by distinguishing the location where it is done : kernel composition on the GPU, kernel generation by the CPU, and overall composit… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2015
2015
2018
2018

Publication Types

Select...
2
1

Relationship

2
1

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 17 publications
(13 reference statements)
0
2
0
Order By: Relevance
“…Several extensions have been developed on top of them. Parallel skeletons and compositions have been proposed in [39], [40] to make the design of high performance GPGPU applications easier. High-level data structures that can automatically be transferred between host (CPU) and GPU memory have been introduced to Sarek in [41].…”
Section: Map and Reduce With Spoc And Sarekmentioning
confidence: 99%
“…Several extensions have been developed on top of them. Parallel skeletons and compositions have been proposed in [39], [40] to make the design of high performance GPGPU applications easier. High-level data structures that can automatically be transferred between host (CPU) and GPU memory have been introduced to Sarek in [41].…”
Section: Map and Reduce With Spoc And Sarekmentioning
confidence: 99%
“…To improve composition of the overall program, increase reusability and automatically improve performance, we proposed algorithmic constructs built upon SPOC and Sarek [3]. First, we proposed transformation functions that deeply modify kernel internal representations to provide specific computations.…”
Section: High-level Composition and Transformationsmentioning
confidence: 99%