2008
DOI: 10.1109/ipdps.2008.4536357
|View full text |Cite
|
Sign up to set email alerts
|

Architecture optimizations for synchronization and communication on chip multiprocessors

Abstract: Chip multiprocessors (CMPs) enable concurrent execution of multiple threads using several cores on a die. Current CMPs behave much like symmetric multiprocessors and do not take advantage of the proximity between cores to improve synchronization and communication between concurrent threads. Thread synchronization and communication instead use memory/cache interactions. We propose two architectural enhancements to support fine grain synchronization and communication between threads that reduce overhead and memo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2009
2009
2014
2014

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(2 citation statements)
references
References 21 publications
(18 reference statements)
0
2
0
Order By: Relevance
“…They have been using customized fullempty-bit memory system to support full-system fine-grained synchronizations [5][6][7][8]. Also, some work is dedicated to accelerating on-chip fine-grained synchronization [9][10][11]. They modified the ISA of processors and used special instructions to support the fine-grained synchronization.…”
Section: Related Workmentioning
confidence: 99%
“…They have been using customized fullempty-bit memory system to support full-system fine-grained synchronizations [5][6][7][8]. Also, some work is dedicated to accelerating on-chip fine-grained synchronization [9][10][11]. They modified the ISA of processors and used special instructions to support the fine-grained synchronization.…”
Section: Related Workmentioning
confidence: 99%
“…The benefits of accelerating fine-grain synchronization have previously been discussed in detail [9][10][11][12][13][14][15] and some implementations are tested in software simulators or some real machines [4,9,16]. In these implementations, they either require a separate tag space for each value in the real memory, which requires a customized storage system and customized processors, or only support fine-grained synchronization in a single processor.…”
Section: Synchronization Enginementioning
confidence: 99%