Proceedings of the ACM SIGPLAN 2002 Conference on Programming Language Design and Implementation 2002
DOI: 10.1145/512529.512554
|View full text |Cite
|
Sign up to set email alerts
|

Dynamic hot data stream prefetching for general-purpose programs

Abstract: Prefetching data ahead of use has the potential to tolerate the growing processor-memory performance gap by overlapping long latency memory accesses with useful computation. While sophisticated prefetching techniques have been automated for limited domains, such as scientific codes that access dense arrays in loop nests, a similar level of success has eluded general-purpose programs, especially pointer-chasing codes written in languages such as C and C++.We address this problem by describing, implementing and … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
62
0

Year Published

2004
2004
2014
2014

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 107 publications
(63 citation statements)
references
References 22 publications
1
62
0
Order By: Relevance
“…Examples of adaptive optimization systems include HotSpot virtual machine [64] from Sun (now Oracle), Jikes RVM [65] from IBM, and Open Runtime Platform (ORP) [66] from Intel Corporation. To identify hot-spots, researchers have proposed to use online hardware profiling mechanisms such as counters and samplings [67][68][69][70][71], or to use program instrumentation [72][73][74][75][76][77][78], combined instrumentation and sampling [79][80][81], or coupled offline and online profiling [82]. To further improve adaptive optimization, a number of techniques have been developed; for example, recompilation [83], deferred and partial compilation [84][85][86], and dynamic deoptimization [87].…”
Section: B Jit Compilationmentioning
confidence: 99%
“…Examples of adaptive optimization systems include HotSpot virtual machine [64] from Sun (now Oracle), Jikes RVM [65] from IBM, and Open Runtime Platform (ORP) [66] from Intel Corporation. To identify hot-spots, researchers have proposed to use online hardware profiling mechanisms such as counters and samplings [67][68][69][70][71], or to use program instrumentation [72][73][74][75][76][77][78], combined instrumentation and sampling [79][80][81], or coupled offline and online profiling [82]. To further improve adaptive optimization, a number of techniques have been developed; for example, recompilation [83], deferred and partial compilation [84][85][86], and dynamic deoptimization [87].…”
Section: B Jit Compilationmentioning
confidence: 99%
“…The adaptive scheme uses data sampling to estimate the distribution of the workload. Other groups have used code-based sampling to identify data streams and strides [4,22]. While most past techniques exploit dynamic locality for sequential execution, we focus on parallel execution.…”
Section: Related Workmentioning
confidence: 99%
“…Another proposed class of prefetchers utilizes address correlation [3,4,10,11,15,20], which promises wider applicability across a diverse spectrum of workloads because they target generalized memory access patterns. Rather than detecting patterns in data layout, these prefetchers correlate data addresses to predict future misses.…”
Section: Introductionmentioning
confidence: 99%