Our system is currently under heavy load due to increased usage. We're actively working on upgrades to improve performance. Thank you for your patience.
Proceedings 1998 Fourth International Symposium on High-Performance Computer Architecture
DOI: 10.1109/hpca.1998.650564
|View full text |Cite
|
Sign up to set email alerts
|

Performance evaluation of tiling for the register level

Abstract: Tiling is a well-known loop transformation, which is basically used to expose coarse-grain parallelism and to exploit data reuse at the cache level. However, it can also be used to exploit data reuse at the register level and to improve programs's ILP. Previous work on tiling and also commercial compilers are able to perform tiling for the register level in more than one dimension when the iteration space is rectangular. Non-rectangular iteration spaces are commonly found in linear algebra algorithms or can ar… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
10
0

Publication Types

Select...
2
2

Relationship

2
2

Authors

Journals

citations
Cited by 4 publications
(10 citation statements)
references
References 26 publications
0
10
0
Order By: Relevance
“…For problem sizes that are very small and/or not multiple of the tile sizes, the execution time wasted on boundary tiles is significant and in these tiles less ILP and less data reuse than in non-boundary tiles is achieved [11].…”
Section: Performance Resultsmentioning
confidence: 99%
See 4 more Smart Citations
“…For problem sizes that are very small and/or not multiple of the tile sizes, the execution time wasted on boundary tiles is significant and in these tiles less ILP and less data reuse than in non-boundary tiles is achieved [11].…”
Section: Performance Resultsmentioning
confidence: 99%
“…Tiling the register level outperforms other compiler optimizations due to two reasons: 1) it always achieves ILP in the loop body [11] and 2) the number of load/store instructions is significantly reduced (register reuse). Transformations such as inner unrolling can also achieve good levels of ILP in most of the cases.…”
Section: Performance Resultsmentioning
confidence: 99%
See 3 more Smart Citations