Proceedings of the 2011 ACM Symposium on Applied Computing 2011
DOI: 10.1145/1982185.1982425
|View full text |Cite
|
Sign up to set email alerts
|

Applying genetic algorithms to optimize the power in tiled SNUCA chip multicore architectures

Abstract: We propose a novel technique for reducing the power consumed by the on-chip cache on SNUCA chip multicore platform. This is achieved by what we call a "remap table", which maps accesses to the cache banks that are as close as possible to the cores, on which the processes are scheduled. With this technique, instead of using all the available cache, we use a portion of the cache and allocate lesser cache to the application. We formulate the problem as an energy-delay(ED) minimization problem and solve it offline… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2018
2018
2018
2018

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(3 citation statements)
references
References 15 publications
0
3
0
Order By: Relevance
“…© Elsevier collecting the overall accesses count to each LLC bank by each core. The trace strategy requires instrumenting and running the application under representative conditions or to simulate its execution in a cycle-accurate, maybe a fullsystem, simulator implementing all the details of the system memory hierarchy [41], [39], [55]. The bank_count approach delivers coarser information than trace-based strategies but can be adopted more easily by adding only specific hardware counters to measure the number of accesses to the LLC banks from each core [44].…”
Section: Program Memory Layout Optimizationsmentioning
confidence: 99%
See 2 more Smart Citations
“…© Elsevier collecting the overall accesses count to each LLC bank by each core. The trace strategy requires instrumenting and running the application under representative conditions or to simulate its execution in a cycle-accurate, maybe a fullsystem, simulator implementing all the details of the system memory hierarchy [41], [39], [55]. The bank_count approach delivers coarser information than trace-based strategies but can be adopted more easily by adding only specific hardware counters to measure the number of accesses to the LLC banks from each core [44].…”
Section: Program Memory Layout Optimizationsmentioning
confidence: 99%
“…taking them from the executable binary, or calculating them at runtime after a phase where the running application is profiled. To activate the optimized memory layout, the OS fills a small hardware remap table in each core, similarly to some existing approaches [41], [42], [43]. Specifically, the remap table is accessed from a core before the LLC access (e.g.…”
Section: Bank Level Data Layout Optimizations In Snucamentioning
confidence: 99%
See 1 more Smart Citation