2008
DOI: 10.1016/j.jpdc.2008.05.011
|View full text |Cite
|
Sign up to set email alerts
|

Program optimization carving for GPU computing

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
47
0

Year Published

2010
2010
2018
2018

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 98 publications
(47 citation statements)
references
References 17 publications
0
47
0
Order By: Relevance
“…Compared to a set of recent studies on performance autotuning by empirical search [11,12,13,14,15], we provide an alternative optimization solution. Certainly search-based approaches are a powerful tool for optimization, but we note two disadvantages of such an approach.…”
Section: Introductionmentioning
confidence: 99%
“…Compared to a set of recent studies on performance autotuning by empirical search [11,12,13,14,15], we provide an alternative optimization solution. Certainly search-based approaches are a powerful tool for optimization, but we note two disadvantages of such an approach.…”
Section: Introductionmentioning
confidence: 99%
“…This empirical optimization with varying configurations is a typical approach in GPU programming because a general performance prediction model for a GPU architecture is not available due to the complexity of its parallel programming model [2,28,29]. Our experiments showed that the thread block with size 16 Â 26 yields the best performance in the block-level facet processing implementation.…”
Section: Thread-block Configurationmentioning
confidence: 91%
“…Later, Ryoo et al developed their ideas further on efficient GPU program optimization, and revealed that optimizing an application for maximum performance goes beyond simply application of a set of optimization techniques to code. The difficulty comes from the fact that the interactions among the underlying architectural and programming model constraints affect performance in a non-linear fashion [29]. They modeled GPU programming for maximum performance as a multivariable optimization problem.…”
Section: Related Workmentioning
confidence: 99%
“…The limited research in kernel code optimization has resulted in the underutilization of these devices. Similar programming paradigms such as CUDA have been studied for many years to enable their use on these computing devices [8,9,16]. However, optimization techniques on CUDA are specified on heterogeneous multicore architectures with nVidia GPUs, which may be inappropriate for various OpenCL-compatible devices.…”
Section: Figmentioning
confidence: 99%