2010
DOI: 10.1007/s10766-010-0140-7
|View full text |Cite
|
Sign up to set email alerts
|

Comparison of OpenMP 3.0 and Other Task Parallel Frameworks on Unbalanced Task Graphs

Abstract: The UTS benchmark is used to evaluate the expression and performance of task parallelism in OpenMP 3.0 as implemented in a number of recently released compilers and run-time systems. UTS performs parallel search of an irregular and unpredictable search space, as arises, e.g., in combinatorial optimization problems. As such UTS presents a highly unbalanced task graph that challenges scheduling, load balancing, termination detection, and task coarsening strategies. Expressiveness and scalability are compared for… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
21
0
1

Year Published

2010
2010
2020
2020

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 33 publications
(22 citation statements)
references
References 19 publications
(28 reference statements)
0
21
0
1
Order By: Relevance
“…There are a lot of comparative studies between OpenMP and other CPU specific programming models [11], [12], and also some relevant work on the comparison between CUDA and OpenCL [13], [14]. However, detailed studies on OpenMP and OpenCL are rarely seen.…”
Section: Related Workmentioning
confidence: 99%
“…There are a lot of comparative studies between OpenMP and other CPU specific programming models [11], [12], and also some relevant work on the comparison between CUDA and OpenCL [13], [14]. However, detailed studies on OpenMP and OpenCL are rarely seen.…”
Section: Related Workmentioning
confidence: 99%
“…The manuscript precedes the inclusion of the depend clause in OpenMP 4.0, and thus it does not cover the implicit dependencies it enables, critical for our work. This latter observation also pertains to [25], which compares task parallelism under several parallel frameworks based on explicit synchronizations, including OpenMP 3.0 and TBB, which is the backend for the two libraries we have tested.…”
Section: Related Workmentioning
confidence: 81%
“…We used the UTS [2], [3], [5] benchmark as the irregular and imbalance workloads in the experiments. Olivier and Prins developed the UTS benchmark using OpenMP [6] for shared memory computers and using UPC [7] for both shared memory and distributed memory computers.…”
Section: Related Workmentioning
confidence: 99%
“…Because this region of a shared stack may be accessed concurrently by local and remote threads, the threads require locking so that we must introduce an additional overhead. Work aggregation [3], [5] and multiple work stealing strategies share the idea of paying off overheads. Work aggregation uses a task-chunking technique designed to increase granularity when creating tasks.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation