2015
DOI: 10.1007/s10766-015-0398-x
|View full text |Cite
|
Sign up to set email alerts
|

A Loosely Coordinated Model for Heap-Based Priority Queues in Multicore Environments

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2018
2018
2020
2020

Publication Types

Select...
3
2

Relationship

1
4

Authors

Journals

citations
Cited by 14 publications
(10 citation statements)
references
References 30 publications
0
10
0
Order By: Relevance
“…Such results confirm our expectation of acceptable overhead of the redistribution procedure on the efficiency of the algorithm, because there are not global synchronization among the threads. Only families F (3) , F (5) , and F (6) show values for the efficiency E N < 0.7 with N = 16 threads, because the presence of features (the peak in the corner or the discontinuity) needing a more frequent redistribution of sub-domains. A last set of experiment is aimed to measure the effectiveness of the GPU as floating point accelerator in the hybrid Algorithm 2 as described in the previous section.…”
Section: Test Resultsmentioning
confidence: 99%
See 4 more Smart Citations
“…Such results confirm our expectation of acceptable overhead of the redistribution procedure on the efficiency of the algorithm, because there are not global synchronization among the threads. Only families F (3) , F (5) , and F (6) show values for the efficiency E N < 0.7 with N = 16 threads, because the presence of features (the peak in the corner or the discontinuity) needing a more frequent redistribution of sub-domains. A last set of experiment is aimed to measure the effectiveness of the GPU as floating point accelerator in the hybrid Algorithm 2 as described in the previous section.…”
Section: Test Resultsmentioning
confidence: 99%
“…More precisely, the families F (2) and F (3) show a performance gain of about 5× because their analytic expressions are based only on floating point operations, without the use of trigonometric or exponential functions as for the families F (1) , F (4) , and F (5) . The worst case is represented by the family F (6) with a performance gain of about 1.8× because the thread divergence due to the presence of the selection structure in its analytic expression, that greatly limits the GPU performance when threads follow different paths in the control flow.…”
Section: Test Resultsmentioning
confidence: 99%
See 3 more Smart Citations