2015 44th International Conference on Parallel Processing 2015
DOI: 10.1109/icpp.2015.41
|View full text |Cite
|
Sign up to set email alerts
|

GPGPU Benchmark Suites: How Well Do They Sample the Performance Spectrum?

Abstract: Recently, GPGPUs have positioned themselves in the mainstream processor arena with their potential to perform a massive number of jobs in parallel. At the same time, many GPGPU benchmark suites have been proposed to evaluate the performance of GPGPUs. Both academia and industry have been introducing new sets of benchmarks each year while some already published benchmarks have been updated periodically. However, some benchmark suites contain benchmarks that are duplicates of each other or use the same underlyin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
2
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 9 publications
(4 citation statements)
references
References 32 publications
0
2
0
Order By: Relevance
“…We operated this initial filtering by following the naming convention explained in Section III-B; specifically, we focused on three macro categories: a first category that refers to GPU memory accesses (L2 and DRAM, starts with lts); a second category that refers to metrics related to compute instructions (starts with gr, gpu, smsp inst or cycles) and a final category that summarizes branch divergence (starts with smsp warp). The reason for choosing these categories is because it has been shown to fit into what is known as principal components for GPGPU performance prediction and/or optimization [30], [31].…”
Section: A Ncu and Kernels' Performance Metricsmentioning
confidence: 99%
See 1 more Smart Citation
“…We operated this initial filtering by following the naming convention explained in Section III-B; specifically, we focused on three macro categories: a first category that refers to GPU memory accesses (L2 and DRAM, starts with lts); a second category that refers to metrics related to compute instructions (starts with gr, gpu, smsp inst or cycles) and a final category that summarizes branch divergence (starts with smsp warp). The reason for choosing these categories is because it has been shown to fit into what is known as principal components for GPGPU performance prediction and/or optimization [30], [31].…”
Section: A Ncu and Kernels' Performance Metricsmentioning
confidence: 99%
“…all the possible slowdown values, Y ), we plot both how the slowdown changes as we vary the two parameters, keeping fixed at 7 the number of CPU interferents (Figure 4), and the total slowdown distribution among all the generated kernels on all the possible number of interferents (Figure 5). We set p 0 ∈ [1, 10, 20, 30, 40, 50, 75, 100, 125, 150, 175, 200, 250, 300, 350, 400, 500, 1000] and p 1 ∈ [1, 3,7,9,10,12,14,17,23,30]. Considering these steps and ranges and the number of interferents, we generate 180 different versions of PARKERNEL and 1440 individual experiments calculated by varying the number of interferents.…”
Section: B a Parameterized Kernel: Parkernelmentioning
confidence: 99%
“…Therefore, we used the Rodinia benchmark. 29 It correctly represents a variety of workload 45 and is a proven workload to test computing and profiling techniques.…”
Section: Overhead Analysismentioning
confidence: 99%
“…With regards to GPU simulation acceleration, there exist some research that either choose a portion [51] or perform a pre-characterization [52] of target workloads and then derive the execution time from the simulation results. There are also studies that focus on the generation of GPU benchmarks [53] to reveal GPU's performance spectrum, and modeling of GPU memory systems [54]. These studies are supplementary for GPU performance estimation techniques.…”
Section: Related Workmentioning
confidence: 99%