Proceedings of the 17th International Conference on Parallel Architectures and Compilation Techniques 2008
DOI: 10.1145/1454115.1454156
|View full text |Cite
|
Sign up to set email alerts
|

Hybrid access-specific software cache techniques for the cell BE architecture

Abstract: Ease of programming is one of the main impediments for the broad acceptance of multi-core systems with no hardware support\ud for transparent data transfer between local and global memories. Software cache is a robust approach to provide the user with a\ud transparent view of the memory architecture; but this software approach can suffer from poor performance. In this paper, we propose a hierarchical, hybrid software-cache architecture that classifies at compile time memory accesses in two classes, highlocalit… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
30
0

Year Published

2010
2010
2015
2015

Publication Types

Select...
5
2

Relationship

2
5

Authors

Journals

citations
Cited by 41 publications
(41 citation statements)
references
References 14 publications
0
30
0
Order By: Relevance
“…Instead, the compiler has to wrap the potentially incoherent accesses with a piece of code that does a software lookup to check if some SPM has a copy of the data, triggers a fine-grained DMA transfer to bring the data to the SPM of the local core, accesses it, and triggers a DMA transfer back if the data is modified. This solution adds huge overheads, as observed in previous works in the area of software caches [22]. This paper avoids these overheads by allowing all the cores to access all the SPMs and by proposing a hierarchy of directories and filters that efficiently tracks the contents of all SPMs and diverts the potentially incoherent accesses to any SPM of the chip.…”
Section: Spm Management In Hybrid Memory Systemsmentioning
confidence: 97%
See 3 more Smart Citations
“…Instead, the compiler has to wrap the potentially incoherent accesses with a piece of code that does a software lookup to check if some SPM has a copy of the data, triggers a fine-grained DMA transfer to bring the data to the SPM of the local core, accesses it, and triggers a DMA transfer back if the data is modified. This solution adds huge overheads, as observed in previous works in the area of software caches [22]. This paper avoids these overheads by allowing all the cores to access all the SPMs and by proposing a hierarchy of directories and filters that efficiently tracks the contents of all SPMs and diverts the potentially incoherent accesses to any SPM of the chip.…”
Section: Spm Management In Hybrid Memory Systemsmentioning
confidence: 97%
“…For a computational loop the code is transformed into a two-nested loop that uses tiling to do the computation [19,20,22,41], as shown in Figure 3. Each iteration of the outermost loop executes three phases: (1) a control phase that maps chunks of the array sections to the SPMs, (2) a synchronization phase that waits for the completion of the DMA transfers, and (3) a work phase that performs the computation for the currently mapped chunks of data.…”
Section: Compiler and Runtime Supportmentioning
confidence: 99%
See 2 more Smart Citations
“…Compilers succeed in generating code for LMs when the computation is based on predictable memory access patterns [7] but, when non-predictable memory access patterns are found, compilers need to ensure correctness by applying complex analyses such as memory aliasing [8], [9], [10]. When compilers cannot ensure that there is no aliasing between two memory references that may target copies of the same data in the LM and in the cache hierarchy, they must conservatively avoid using the LM.…”
Section: Introductionmentioning
confidence: 99%