2012
DOI: 10.1007/978-3-642-29737-3_31
|View full text |Cite
|
Sign up to set email alerts
|

Generating GPU Code from a High-Level Representation for Image Processing Kernels

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2012
2012
2018
2018

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 7 publications
(4 citation statements)
references
References 3 publications
0
4
0
Order By: Relevance
“…It has been used successfully in several works (e.g., Membarth et al [2011]). The specification contains a description of the iteration space I, a precedence relationship R to set order of execution, a partition P to indicate sets of iterations preferably executed on a single processing element, and a set of memory locations that may be read (M r ) or written (M w ) for a given iteration.…”
Section: Mathematical Code Representationsmentioning
confidence: 99%
“…It has been used successfully in several works (e.g., Membarth et al [2011]). The specification contains a description of the iteration space I, a precedence relationship R to set order of execution, a partition P to indicate sets of iterations preferably executed on a single processing element, and a set of memory locations that may be read (M r ) or written (M w ) for a given iteration.…”
Section: Mathematical Code Representationsmentioning
confidence: 99%
“…As is common in GPU compute programming, we assume that code has already been written or generated [23,24,25] and that parameters like size of thread blocks, size of thread groups and other GPU-specific parameters are optimized based on the specific architecture of the GPU.…”
Section: Related Work and Our Contri-butionsmentioning
confidence: 99%
“…In this paper, we use and extend the HIPA cc framework to design image processing kernels [2], [3]. The framework uses a source-to-source compiler based on Clang [7] in order to generate low-level, optimized CUDA and OpenCL code for execution on GPU accelerators.…”
Section: Image Processing Frameworkmentioning
confidence: 99%
“…While we previously presented the description and mapping of point and local operators [2], [3], the notation for global reduction operators is introduced in this paper. Global operators are memory bound and, thus, are first class candidates to investigate the proposed techniques.…”
Section: Introductionmentioning
confidence: 99%