2011
DOI: 10.1007/978-3-642-21487-5_12
|View full text |Cite
|
Sign up to set email alerts
|

Performance Analysis and Tuning of Automatically Parallelized OpenMP Applications

Abstract: Abstract. Automatic parallelization combined with tuning techniques is an alternative to manual parallelization of sequential programs to exploit the increased computational power that current multi-core systems offer. Automatic parallelization concentrates on finding any possible parallelism in the program, whereas tuning systems help identifying efficient parallel code segments and serializing inefficient ones using runtime performance metrics. In this work we study the performance gap between automatic and … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
9
0

Year Published

2012
2012
2021
2021

Publication Types

Select...
3
3
2

Relationship

1
7

Authors

Journals

citations
Cited by 14 publications
(10 citation statements)
references
References 13 publications
(15 reference statements)
0
9
0
Order By: Relevance
“…Recall a key challenge in doing so: Because of the interactions of compiler optimizations within and across program sections, the sheer number of possible optimization combinations would make a brute-force approach infeasible -leading to extremely long tuning times. This paper pursues an approach that exploits locality of optimization interactions [20] -distant program sections do not influence each others significantly, allowing substantial pruning of the number of optimization variants that need to be explored.…”
Section: Portable Tuning Systemmentioning
confidence: 99%
“…Recall a key challenge in doing so: Because of the interactions of compiler optimizations within and across program sections, the sheer number of possible optimization combinations would make a brute-force approach infeasible -leading to extremely long tuning times. This paper pursues an approach that exploits locality of optimization interactions [20] -distant program sections do not influence each others significantly, allowing substantial pruning of the number of optimization variants that need to be explored.…”
Section: Portable Tuning Systemmentioning
confidence: 99%
“…Hyper-parameters Tuning: The concept of auto-tuning OpenMP code is well-established [21][22][23][24], and as one can assume, the choice of each environment variable can greatly affect the performance of the code [25]. Consider for example the dynamic scheduling option: If the chosen chunk_size is too small, the resulting numerous work segments cause high overhead.…”
Section: Related Workmentioning
confidence: 99%
“…The significance of loop transformations is described in earlier works . Loop transformation enhances coarse‐grain parallelism.…”
Section: Related Workmentioning
confidence: 99%
“…Imai claims that without any loop transformation, a program does not have much coarse‐grain parallelism between different loops. Recent research has therefore proposed automatic tuning techniques to be essential parts of the auto‐parallelization tools with the aim to overcome performance degradation. The studies mentioned above do not evaluate the effect of loop transformation techniques, such as tiling, fission, fusion, unrolling, and peeling on parallelization.…”
Section: Introductionmentioning
confidence: 99%