2010 IEEE International Symposium on Performance Analysis of Systems &Amp; Software (ISPASS) 2010
DOI: 10.1109/ispass.2010.5452021
|View full text |Cite
|
Sign up to set email alerts
|

Runahead execution vs. conventional data prefetching in the IBM POWER6 microprocessor

Abstract: After many years of prefetching research, most commercially available systems support only two types of prefetching: software-directed prefetching and hardware-based prefetchers using simple sequential or stride-based prefetching algorithms. More sophisticated prefetching proposals, despite promises of improved performance, have not been adopted by industry. In this paper, we explore the efficacy of both hardware and software prefetching in the context of an IBM POWER6 commercial server. Using a variety of app… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2012
2012
2019
2019

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 9 publications
(5 citation statements)
references
References 40 publications
0
5
0
Order By: Relevance
“…Cain et al studied prefetchers on IBM Power architectures [5]. New designs are proposed to address the over-aggressiveness of current prefetchers [10,11,15].…”
Section: Performance Resultsmentioning
confidence: 99%
“…Cain et al studied prefetchers on IBM Power architectures [5]. New designs are proposed to address the over-aggressiveness of current prefetchers [10,11,15].…”
Section: Performance Resultsmentioning
confidence: 99%
“…And sometimes the technique is intended for certain targeted applications such as excessively memory-bound programs. A fully automated mechanism to make these decisions may have difficulty achieving effectiveness reported in literature [10], especially when targeting a broad spectrum of applications. ii) A contributing factor to the manual approach is the notion of "delinquent instructions": a few culprits created most of the performance problems.…”
Section: Background and Related Workmentioning
confidence: 99%
“…MLP), in the successor blocks that we can hoist (a function of ALPBB, PDIH and PHI). • tendency to stall at branch resolution (ASPCB) • modest L1 D$ miss rate as we do not employ Runahead [10,4] or iCFP [19] • good I$ performance…”
Section: Experimental Evaluationmentioning
confidence: 99%