12th Euromicro Conference on Parallel, Distributed and Network-Based Processing, 2004. Proceedings. 2004
DOI: 10.1109/empdp.2004.1271460
|View full text |Cite
|
Sign up to set email alerts
|

Improving cache locality with blocked array layouts

Abstract: Minimizing cache misses is one of the most important factors to reduce average latency for memory accesses. Tiled codes modify the instruction stream to exploit cache locality for array accesses. In this paper, we further reduce cache misses, restructuring the memory layout of multidimensional arrays, that are accessed by tiled instruction code. In our method, array elements are stored in a blocked way, exactly as they are swept by the tiled instruction stream. We present a straightforward way to easily transl… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2012
2012
2017
2017

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(1 citation statement)
references
References 15 publications
0
1
0
Order By: Relevance
“…(2) The approach cannot support simultaneous parallel small tile access. Although other studies() have used the Z‐Morton layout to exploit 2‐D data locality and reduce conflict misses, those approaches have increased the cycle time or the access latency due to the versatility of the Morton‐index translations. In contrast, Lim and Thottethodi proposed a hardware‐based bit‐permuting unit to translate the raster scan order address to a Z‐Morton address.…”
Section: Introductionmentioning
confidence: 99%
“…(2) The approach cannot support simultaneous parallel small tile access. Although other studies() have used the Z‐Morton layout to exploit 2‐D data locality and reduce conflict misses, those approaches have increased the cycle time or the access latency due to the versatility of the Morton‐index translations. In contrast, Lim and Thottethodi proposed a hardware‐based bit‐permuting unit to translate the raster scan order address to a Z‐Morton address.…”
Section: Introductionmentioning
confidence: 99%