2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and PHD Forum 2011
DOI: 10.1109/ipdps.2011.271
|View full text |Cite
|
Sign up to set email alerts
|

An Optimized Reduction Design to Minimize Atomic Operations in Shared Memory Multiprocessors

Abstract: Reduction operations play a key role in modern massively data parallel computation. However, current implementations in shared memory programming APIs such as OpenMP are often cause of computation bottlenecks due to the high number of atomic operations involved. We propose a reduction design that takes advantage of the coupling with a barrier synchronization to optimize the execution of the reduction. Experimental results show how the number of atomic operations involved is dramatically reduced, which can lead… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
5
0

Year Published

2011
2011
2022
2022

Publication Types

Select...
2
1
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(5 citation statements)
references
References 15 publications
0
5
0
Order By: Relevance
“…In the literature, only Speziale et al 16 propose this method: transporting reduction values on the same cache line as the synchronization flag.…”
Section: False Sharingmentioning
confidence: 99%
See 4 more Smart Citations
“…In the literature, only Speziale et al 16 propose this method: transporting reduction values on the same cache line as the synchronization flag.…”
Section: False Sharingmentioning
confidence: 99%
“…Figure 7 illustrates this idea. In the literature, only [18] proposes this method: transporting reduction values on the same cache line as the synchronization flag. Their method is to use a "container" in each node to store both the flag and the variable to be reduced.…”
Section: A False Sharingmentioning
confidence: 99%
See 3 more Smart Citations