2017
DOI: 10.1002/cpe.3973
|View full text |Cite
|
Sign up to set email alerts
|

Autotuning CUDA compiler parameters for heterogeneous applications using the OpenTuner framework

Abstract: A Graphics Processing Unit (GPU) is a parallel computing coprocessor specialized in accelerating vector operations. The enormous heterogeneity of parallel computing platforms justifies and motivates the development of automated optimization tools and techniques. The Algorithm Selection Problem consists in finding a combination of algorithms, or a configuration of an algorithm, that optimizes the solution of a set of problem instances. An autotuner solves the Algorithm Selection Problem using search and optimiz… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
11
0

Year Published

2017
2017
2021
2021

Publication Types

Select...
3
2

Relationship

1
4

Authors

Journals

citations
Cited by 11 publications
(11 citation statements)
references
References 37 publications
0
11
0
Order By: Relevance
“…We employ the CLTune program that CLBlast uses for tuning XgemmDirect, and we implement the OpenTuner program for this kernel according to the work of Bruel et al, where we use the unconstrained search space; we report a penalty value in case of a configuration for which XgemmDirect's constraints are not satisfied.…”
Section: Resultsmentioning
confidence: 99%
See 4 more Smart Citations
“…We employ the CLTune program that CLBlast uses for tuning XgemmDirect, and we implement the OpenTuner program for this kernel according to the work of Bruel et al, where we use the unconstrained search space; we report a penalty value in case of a configuration for which XgemmDirect's constraints are not satisfied.…”
Section: Resultsmentioning
confidence: 99%
“…Due to these dependencies, OpenTuner is not capable of auto‐tuning GEMM. For this restriction, the OpenTuner community has offered workarounds, eg, re‐designing the user program so that its tuning parameters become independent or setting a penalty value for configurations where the constraints are not met . However, the first workaround usually requires a significant effort from the user, while the second may cause a poor tuning result as we demonstrate for GEMM in Section 7.…”
Section: Motivation and Related Workmentioning
confidence: 99%
See 3 more Smart Citations