2014
DOI: 10.1007/s11227-014-1212-z
|View full text |Cite
|
Sign up to set email alerts
|

Optimizing an APSP implementation for NVIDIA GPUs using kernel characterization criteria

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
13
0
1

Year Published

2015
2015
2018
2018

Publication Types

Select...
4
3

Relationship

1
6

Authors

Journals

citations
Cited by 8 publications
(14 citation statements)
references
References 10 publications
0
13
0
1
Order By: Relevance
“…In the current prototype, the CPU threads granularity is determined by a simple regular blocking policy, that does not require a specific kernel characterization. For GPU kernels, the library integrates the model presented in [15,21]. This model allows the determination of configuration parameters (grid, threadblock, and L1 cache memory sizes), for NVIDIA's GPUs.…”
Section: Controllers Librarymentioning
confidence: 99%
“…In the current prototype, the CPU threads granularity is determined by a simple regular blocking policy, that does not require a specific kernel characterization. For GPU kernels, the library integrates the model presented in [15,21]. This model allows the determination of configuration parameters (grid, threadblock, and L1 cache memory sizes), for NVIDIA's GPUs.…”
Section: Controllers Librarymentioning
confidence: 99%
“…Our implementation improves the performance of the previous state-of-the-art due to Martín et al [23]. Following the guidelines proposed in [28], we have proposed a refined method to systematically obtain good GPU configuration parameters in terms of GPU code characteristics. The application of this methodology has led to performance improvements of our shortest paths program solutions compared with the use of configuration parameter values suggested by CUDA programming guidelines [21].…”
Section: Answer To the Research Questionmentioning
confidence: 94%
“…Esta implementación mejora el rendimiento de la anterior solución del estado-del-arte propuesta por Martín et al [23]. Siguiendo las pautas propuestas en [28], hemos refinado el método de caracterización de kernels para obtener valores más adecuados para los parámetros de ejecución de las GPUs que conlleven a ejecuciones óptimas o cercanas al óptimo. La aplicación de esta metodología a nuestra implementación ha hecho que pudiéramos obtener valores más apropiados, que implicaron mejoras muy significativas en comparación con los valores recomendados por las guías de programación de CUDA [21].…”
Section: R4 Conclusionesunclassified
“…Our model considers the characterization of the kernel code to automatically optimize launching parameters, such as the thread-block geometry. We propose to integrate the model of qualitative characteristics presented in [105,131] in our Controller. To use this model, the programmers should examine the kernel code, and they should conceptually characterize it in classes according to three main criteria.…”
Section: Characterisation Of Kernels For Executionmentioning
confidence: 99%
“…For GPU kernels, our current prototype library integrates the model presented in [104,105,131]. This model allows to determine configuration parameters (grid, thread-block and L1 cache memory sizes), for NVIDIA's GPUs.…”
Section: Kernel Characterizationmentioning
confidence: 99%