Proceedings of the 44th Annual IEEE/ACM International Symposium on Microarchitecture 2011
DOI: 10.1145/2155620.2155623
|View full text |Cite
|
Sign up to set email alerts
|

Bundled execution of recurring traces for energy-efficient general purpose processing

Abstract: Technology scaling has delivered on its promises of increasing device density on a single chip. However, the voltage scaling trend has failed to keep up, introducing tight power constraints on manufactured parts. In such a scenario, there is a need to incorporate energy-efficient processing resources that can enable more computation within the same power budget. Energy efficiency solutions in the past have typically relied on application specific hardware and accelerators. Unfortunately, these approaches do no… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
59
0

Year Published

2013
2013
2017
2017

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 95 publications
(59 citation statements)
references
References 33 publications
0
59
0
Order By: Relevance
“…Hardware acceleration exists in many forms, such as analog accelerators [6,50], static [13,19,28,38,43,52,55] and dynamic datapath accelerators [14,25,27], and programmable accelerators, such as GPUs and DSPs. In this work, we focus on static datapath accelerators.…”
Section: Background and Motivationmentioning
confidence: 99%
“…Hardware acceleration exists in many forms, such as analog accelerators [6,50], static [13,19,28,38,43,52,55] and dynamic datapath accelerators [14,25,27], and programmable accelerators, such as GPUs and DSPs. In this work, we focus on static datapath accelerators.…”
Section: Background and Motivationmentioning
confidence: 99%
“…5.7.8. Core Area Like previous approaches that propose heterogeneous designs (e.g., [57,23,29]), HBA optimizes for a heavily energy/power-constrained future, targeting designs in which energy and power are performance limiters [16]. Also like other heterogeneous designs that attempt to reduce power/energy through increased specialization, the power benefits of HBA come at the cost of additional chip area.…”
Section: Block Info Cachementioning
confidence: 99%
“…Several prior works combine a "cold pipeline" and "hot pipeline" that execute infrequent (cold) and frequent (hot) code traces respectively [46,8,23]. PARROT [46] captures hot code traces and performs optimizations on them to produce code for an optimized VLIW engine.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Recent work has proposed on-chip accelerators for energy efficiency (dark silicon) in the context of general-purpose (i.e., desktop and mobile) workloads [12,13,14,28,32]. While these proposals try to improve the efficiency of the memory hierarchy, the applicability of the proposed techniques to big data workloads is limited due to the deep software stacks and vast datasets in today's server applications.…”
Section: Related Workmentioning
confidence: 99%