2022
DOI: 10.1016/j.jpdc.2021.10.003
|View full text |Cite
|
Sign up to set email alerts
|

Using hardware performance counters to speed up autotuning convergence on GPUs

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
12
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
1

Relationship

1
4

Authors

Journals

citations
Cited by 10 publications
(12 citation statements)
references
References 41 publications
0
12
0
Order By: Relevance
“…We used the simulated tuning to analyze the convergence speed of the profile-based searcher proposed in [1] and of the random search. During the analysis, the autotuning is performed in the defined number of iterations.…”
Section: Experimental Design Materials and Methodsmentioning
confidence: 99%
See 4 more Smart Citations
“…We used the simulated tuning to analyze the convergence speed of the profile-based searcher proposed in [1] and of the random search. During the analysis, the autotuning is performed in the defined number of iterations.…”
Section: Experimental Design Materials and Methodsmentioning
confidence: 99%
“… The random search is used by default. To test profile-based searcher [1] , the macro USE_PROFILE_SEARCHER has to be set to 1 in the code of the benchmark (in cpp file). The time for autotuning is restricted to a certain value set by macro TUNE_SEC.…”
Section: Experimental Design Materials and Methodsmentioning
confidence: 99%
See 3 more Smart Citations