2010 IEEE International Conference on Cluster Computing Workshops and Posters (CLUSTER WORKSHOPS) 2010
DOI: 10.1109/clusterwksp.2010.5613092
|View full text |Cite
|
Sign up to set email alerts
|

Low-latency explicit communication and synchronization in scalable multi-core clusters

Abstract: Abstract-One of the main challenges in the multi-core area is the communication and synchronization of the cores and the design of an efficient interconnection network that is scalable to multiple cores. In this paper we present an efficient implementation of a scalable system that is targeting multicore systems. Each cluster node consists of 4 processors that support both explicit and implicit communication. Processor's cache is augmented with scratchpad and is merged with the network interface (NI) for reduc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2013
2013
2015
2015

Publication Types

Select...
1
1

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 9 publications
0
2
0
Order By: Relevance
“…In order to make fast matching decision between the Input and Output Queues, several iterative matching algorithms have been proposed for CIOQ switches, such as PIM [23], iSLIP [25] and Dual Round Robin arbitration scheme (DRR) [24]. In this system we designed a more efficient matching algorithm combining the efficiency of PIM2 and the performance of DRR, called special Sequential Iterative Matching (SIM) [22]. In each iteration of the PIM matching algorithm two steps are performed; request and grant.…”
Section: Hardware Platformmentioning
confidence: 99%
See 1 more Smart Citation
“…In order to make fast matching decision between the Input and Output Queues, several iterative matching algorithms have been proposed for CIOQ switches, such as PIM [23], iSLIP [25] and Dual Round Robin arbitration scheme (DRR) [24]. In this system we designed a more efficient matching algorithm combining the efficiency of PIM2 and the performance of DRR, called special Sequential Iterative Matching (SIM) [22]. In each iteration of the PIM matching algorithm two steps are performed; request and grant.…”
Section: Hardware Platformmentioning
confidence: 99%
“…Six of these boards are interconnected through a Virtex2 board that serves as the central switch [22]. The measurement of the execution time has been performed in real-time on the FPGAs using hardware counter augmented to the Microblaze processors.…”
Section: Evaluation Setupmentioning
confidence: 99%