ACM/IEEE SC 2006 Conference (SC'06) 2006
DOI: 10.1109/sc.2006.62
|View full text |Cite
|
Sign up to set email alerts
|

The Potential Energy Efficiency of Vector Acceleration

Abstract: Energy efficiency of computation is quickly becoming a key problem from the chip through the data center. This paper presents the first quantitative study of the potential energy efficiency of vector accelerators. We propose and study a vector accelerator architecture suitable for implementation in a 70nm technology. The vector architecture has a highbandwidth on-chip cache system coupled to 16 independent memory channels. We show that such an accelerator can achieve speedups of 10X or more on loop kernels in … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
13
0

Year Published

2008
2008
2023
2023

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 15 publications
(13 citation statements)
references
References 15 publications
(14 reference statements)
0
13
0
Order By: Relevance
“…Since vector processors are by default energy-e cient (Lemuet et al 2006), we added support for vector instructions. We did not consider adding multiple lanes due to the following reasons: (1) we wanted to do minimal changes to the existing scalar processor in order to keep the area/power envelope and reuse the existing processor resources in the most e cient way; (2) adding one additional lane increases area of scalar baseline by 44%; adding four lanes will more than double the area of the processor and it is not acceptable in highly constrained low-end devices.…”
Section: Related Workmentioning
confidence: 99%
“…Since vector processors are by default energy-e cient (Lemuet et al 2006), we added support for vector instructions. We did not consider adding multiple lanes due to the following reasons: (1) we wanted to do minimal changes to the existing scalar processor in order to keep the area/power envelope and reuse the existing processor resources in the most e cient way; (2) adding one additional lane increases area of scalar baseline by 44%; adding four lanes will more than double the area of the processor and it is not acceptable in highly constrained low-end devices.…”
Section: Related Workmentioning
confidence: 99%
“…The vector architectures are compared with conventional superscalar and VLIW architectures for multimedia benchmarks in [74]. Energy-efficiency potentials of vector accelerators for high performance computing systems are discussed in [86]. The efficiency of an architecture depends on the organization of the SIMD units and how they are employed with regard to instruction pipeline and memory hierarchy.…”
Section: Simd Alus and Vector Processorsmentioning
confidence: 99%
“…Energyefficiency potentials of vector accelerators for high performance computing systems are discussed in [28]. Three main limitations of conventional 1D vector architectures are known to be complexity of the central register file, implementation difficulties of precise exception handling, and expensive onchip memory [20].…”
Section: B Related Workmentioning
confidence: 99%