Modified Hotspot Cache Architecture: A Low Energy Fast Cache for Embedded Processors

Ali, Kashif; Aboelaze, Mokhtar; Datta, Samir

doi:10.1109/icsamos.2006.300806

Cited by 8 publications

(7 citation statements)

References 19 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Besides these designs, other designs also exploit a small L0 cache [24,3,11,4,7,9]. In the HotSpot cache [24] and its variant [3], the L0 cache is used in the I-cache to store "hot" loops which have high execution fraction, to reduce accesses to the L1 cache.…”

Section: Related Workmentioning

confidence: 99%

“…6. 3 In ARM Cortex-M3 [2], while the AGU does not take a whole stage, its clock speed is low, around 100MHz [20]). In this figure, the target architecture is the ARM11 processor [1].…”

Section: Processor-cache Interfacementioning

confidence: 99%

“…In the HotSpot cache [24] and its variant [3], the L0 cache is used in the I-cache to store "hot" loops which have high execution fraction, to reduce accesses to the L1 cache. It does so by using the BTB to identify fetch addresses which belong to the hot loops.…”

Section: Related Workmentioning

confidence: 99%

See 2 more Smart Citations

Revisiting level-0 caches in embedded processors

Duong

Kim²,

Zhao

et al. 2012

Proceedings of the 2012 International Conference on Compilers, Architectures and Synthesis for Embedded Systems

View full text Add to dashboard Cite

Level-0 (L0) caches have been proposed in the past as an inexpensive way to improve performance and reduce energy consumption in resource-constrained embedded processors. This paper proposes new L0 data cache organizations using the assumption that an L0 hit/miss determination can be completed prior to the L1 access. This is a realistic assumption for very small L0 caches that can nevertheless deliver significant miss rate and/or energy reduction. The key issue for such caches is how and when to move data between the L0 and L1 caches. The first new cache, a flow cache, targets a conflict miss reduction in a direct-mapped L1 cache. It offers a simpler hardware design and uses on average 10% less dynamic energy than the victim cache with nearly identical performance. The second new cache, a hit cache, reduces the dynamic energy consumption in a set-associative L1 cache by 30% without impacting performance. A variant of this policy reduces the dynamic energy consumption by up to 50%, with 5% performance degradation.

show abstract

Section: Related Workmentioning

confidence: 99%

“…6. 3 In ARM Cortex-M3 [2], while the AGU does not take a whole stage, its clock speed is low, around 100MHz [20]). In this figure, the target architecture is the ARM11 processor [1].…”

Section: Processor-cache Interfacementioning

confidence: 99%

See 1 more Smart Citation

Revisiting level-0 caches in embedded processors

Duong

Kim²,

Zhao

et al. 2012

Proceedings of the 2012 International Conference on Compilers, Architectures and Synthesis for Embedded Systems

View full text Add to dashboard Cite

show abstract

“…A research [14] point out that some frequently used data are not taken into hotspots cache, and hence they present a new cache framework that introduce hotspots cache and filter cache between L1 cache and memory at the same time. Their experiment shows the cache efficiency is improved.…”

Section: Related Technologies From Hardware Levelmentioning

confidence: 99%

A Survey of the Heterogeneous Computing Platform and Related Technologies

Liu¹,

Liang²,

Wang³

2017

dtetr

View full text Add to dashboard Cite

Abstract.A heterogeneous computing platform is an computing system that composed of different types of computing units. By fully using the computing ability of different types of computing units, the heterogeneous computing platform can achieve better performance and power efficiency than traditional homogeneous computing platform. In this paper, we firstly summarize and analyze the key factors that affect the performance of a heterogeneous computing platform. Next, we conduct a specific survey about these key factors from both software and hardware aspects and introduce some research results and key technologies at present. And then, we introduce the heterogeneous computing framework and make a comparison between OpenCL and HSA which are now both promising. At last, we analyze the future directions of the heterogeneous computing platform.

show abstract

“…Cache [28] on and cache off are used to activate and deactivate caching. The system has three states namely normal state, SOS state and rescue state.…”

Section: Related Workmentioning

confidence: 99%