2010 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation 2010
DOI: 10.1109/icsamos.2010.5642059
|View full text |Cite
|
Sign up to set email alerts
|

A Polymorphic Register File for matrix operations

Abstract: Abstract-Previous vector architectures divided the available register file space in a fixed number of registers of equal sizes and shapes. We propose a register file organization which allows dynamic creation of a variable number of multidimensional registers of arbitrary sizes referred to as a Polymorphic Register File. Our objective is to evaluate the performance benefits of the proposed organization. Simulation results using real applications (Floyd and CG) suggest speedups of up to 3 times compared to the … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
25
0

Year Published

2011
2011
2017
2017

Publication Types

Select...
2
2
1

Relationship

3
2

Authors

Journals

citations
Cited by 10 publications
(25 citation statements)
references
References 15 publications
0
25
0
Order By: Relevance
“…Currently, only 1D and 2D operands are supported, but the PRF can be extended for any number of dimensions. Potential performance gains are due to multiaxis vectorization, efficient register storage utilization, higher Previous works indicate possible reductions of the number of executed instructions by three orders of magnitude due to PRF [4]. Furthermore, PRFs allow performance benefits when compared to the Cell processor for Floyd and the main kernel of the CG Method -sparse matrix vector multiplication [4].…”
Section: Introductionmentioning
confidence: 99%
See 4 more Smart Citations
“…Currently, only 1D and 2D operands are supported, but the PRF can be extended for any number of dimensions. Potential performance gains are due to multiaxis vectorization, efficient register storage utilization, higher Previous works indicate possible reductions of the number of executed instructions by three orders of magnitude due to PRF [4]. Furthermore, PRFs allow performance benefits when compared to the Cell processor for Floyd and the main kernel of the CG Method -sparse matrix vector multiplication [4].…”
Section: Introductionmentioning
confidence: 99%
“…Potential performance gains are due to multiaxis vectorization, efficient register storage utilization, higher Previous works indicate possible reductions of the number of executed instructions by three orders of magnitude due to PRF [4]. Furthermore, PRFs allow performance benefits when compared to the Cell processor for Floyd and the main kernel of the CG Method -sparse matrix vector multiplication [4]. The PRF programming interface allows high performance dense matrix multiplication with at least 35 times less instructions than a hand-crafted version for the Cell BE [14].…”
Section: Introductionmentioning
confidence: 99%
See 3 more Smart Citations