2005
DOI: 10.1145/1105734.1105742
|View full text |Cite
|
Sign up to set email alerts
|

Hardware-modulated parallelism in chip multiprocessors

Abstract: Chip multi-processors (CMPs) already have widespread commercial availability, and technology roadmaps project enough on-chip transistors to replicate tens or hundreds of current processor cores. How will we express parallelism, partition applications, and schedule/place/migrate threads on these highly-parallel CMPs?This paper presents and evaluates a new approach to highly-parallel CMPs, advocating a new hardware-software contract. The software layer is encouraged to expose large amounts of multi-granular, het… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
8
0

Year Published

2006
2006
2011
2011

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 21 publications
(8 citation statements)
references
References 22 publications
0
8
0
Order By: Relevance
“…Quicksort presented in [25] outperforms the well-known parallel sample sort [4], and consumes less memory for uni and multiprocessors [24]. Chen et al [2] proposed a hardware-software module for managing threads on a 16-core simulation. They achieved speedups from 4 to 11 times for some benchmarks including qsort (the standard implementation from quicksort) compared to singlecore.…”
Section: Quicksort Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Quicksort presented in [25] outperforms the well-known parallel sample sort [4], and consumes less memory for uni and multiprocessors [24]. Chen et al [2] proposed a hardware-software module for managing threads on a 16-core simulation. They achieved speedups from 4 to 11 times for some benchmarks including qsort (the standard implementation from quicksort) compared to singlecore.…”
Section: Quicksort Related Workmentioning
confidence: 99%
“…(2) Comparison sorts which depend on running comparison operations across the keys to find the correct relative order. In this work, we target both types.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Thread migration in distributed memory parallel computers is studied in, e.g., [10][11][12] to maximize data access locality, and thus minimize communication between the processors. In the context of chip multiprocessors, thread migration and load-balancing in general are discussed, e.g., in [13,14]. In the context of grid computing, the thread migration is studied by Chaudhary, Jiang, and Walters in [15,16].…”
Section: Moving Threads System and Related Workmentioning
confidence: 99%
“…The Network-Driven Processor proposed by Chen et al [10] also shares several features with CAPSULE, and its initial version called Agent Programming+SOMT [21]. They similarly propose to delegate thread creation decisions at run-time to the hardware, and show an example application on a CMP; there is also a significant development on streaming applications and support.…”
Section: Related Workmentioning
confidence: 99%