International Symposium on Code Generation and Optimization, 2004. CGO 2004.
DOI: 10.1109/cgo.2004.1281670
|View full text |Cite
|
Sign up to set email alerts
|

Specialized dynamic optimizations for high-performance energy-efficient microarchitecture

Abstract: We study several

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
13
0

Publication Types

Select...
4
2
2

Relationship

1
7

Authors

Journals

citations
Cited by 12 publications
(13 citation statements)
references
References 26 publications
0
13
0
Order By: Relevance
“…As such, we designed Federation without further plans for horizontally aggregating more than two cores into a single very wide core. For higher single-thread performance, the combination of Federation with techniques that can effectively shorten the critical path-such as runahead execution [Mutlu et al 2003], sophisticated prefetchers [Ganusov and Burtscher 2006], or dynamic optimization [Almog et al 2004]-seems to be the most fruitful path to pursue. An advantage of many such techniques is their toleration of infrequent or long latency communication with the main core, which makes it much easier to implement them using multiple cores of a manycore processor.…”
Section: Discussionmentioning
confidence: 99%
“…As such, we designed Federation without further plans for horizontally aggregating more than two cores into a single very wide core. For higher single-thread performance, the combination of Federation with techniques that can effectively shorten the critical path-such as runahead execution [Mutlu et al 2003], sophisticated prefetchers [Ganusov and Burtscher 2006], or dynamic optimization [Almog et al 2004]-seems to be the most fruitful path to pursue. An advantage of many such techniques is their toleration of infrequent or long latency communication with the main core, which makes it much easier to implement them using multiple cores of a manycore processor.…”
Section: Discussionmentioning
confidence: 99%
“…The recent rePlay [12], [13], [14] and PARROT [15], [16] frameworks enable very aggressive hardware optimizations, by using a dynamically configurable optimization engine running in parallel with a high performance execution core. The key idea in these frameworks is the atomic execution of traces.…”
Section: A Trace Cache Optimizationsmentioning
confidence: 99%
“…Architecture research has produced a wide variety microarchitectural, predictor-based optimizations, including value prediction [16,17,24] instruction reuse [26], hardware prefetching [4,7,13,14,18,25,31,27,32], dynamic program optimization [1,19,23,35], pointer caching [8], and cache replacement policies, e.g., [20]. These techniques collect metadata information at runtime about the application's behavior and store it in on-chip buffers or lookup tables.…”
Section: Introductionmentioning
confidence: 99%