2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis 2010
DOI: 10.1109/sc.2010.53
|View full text |Cite
|
Sign up to set email alerts
|

The 48-core SCC Processor: the Programmer's View

Abstract: The number of cores integrated onto a single die is expected to climb steadily in the foreseeable future. This move to many-core chips is driven by a need to optimize performance per watt. How best to connect these cores and how to program the resulting many-core processor, however, is an open research question. Designs vary from GPUs to cache-coherent shared memory multiprocessors to pure distributed memory chips. The 48-core SCC processor reported in this paper is an intermediate case, sharing traits of mess… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
100
0

Year Published

2011
2011
2014
2014

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 178 publications
(104 citation statements)
references
References 9 publications
0
100
0
Order By: Relevance
“…The dataflow based Ambric Massively Parallel Processor Array [4] implements a similar methodology although with a hierarchical interconnect structure. The Intel SCC [7] on the other hand performs all required reservations at runtime rather than statically. Message passing is implemented through a global shared address space accessible through each PEs Message Passing Buffer (MPB).…”
Section: Related Work and Motivationmentioning
confidence: 99%
“…The dataflow based Ambric Massively Parallel Processor Array [4] implements a similar methodology although with a hierarchical interconnect structure. The Intel SCC [7] on the other hand performs all required reservations at runtime rather than statically. Message passing is implemented through a global shared address space accessible through each PEs Message Passing Buffer (MPB).…”
Section: Related Work and Motivationmentioning
confidence: 99%
“…Architecture: The target architecture is assumed to be hierarchical considering the recent design trend [14], [16], [18]. We maintain an abstract representation of this architecture, in a tree form, as depicted in Fig 4. For illustration, we restrict ourselves to only two levels of communication: (1) set of processors that form clusters communicating via a first level network, e.g., cluster (denoted cl x in Fig 4(b)) and (2) clusters connected via a second level network, e.g., on-chip network (g in Fig 4(b)).…”
Section: A Problem Descriptionmentioning
confidence: 99%
“…That is, given the the task-to-cluster mapping, sort all the tasks that are mapped on a cluster cl (line 4-7). From the sorted list, the heaviest task is chosen (line 8) and assigned to the idlest processor in cl (line [13][14]. If the heaviest one does not fit to the idlest processor, this solution should be marked as invalid.…”
Section: Speculationmentioning
confidence: 99%
See 2 more Smart Citations