Proceedings of the 5th European Conference on Computer Systems 2010
DOI: 10.1145/1755913.1755947
|View full text |Cite
|
Sign up to set email alerts
|

Locating cache performance bottlenecks using data profiling

Abstract: Effective use of CPU data caches is critical to good performance, but poor cache use patterns are often hard to spot using existing execution profiling tools. Typical profilers attribute costs to specific code locations. The costs due to frequent cache misses on a given piece of data, however, may be spread over instructions throughout the application. The resulting individually small costs at a large number of instructions can easily appear insignificant in a code profiler's output.DProf helps programmers und… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
40
0

Year Published

2010
2010
2019
2019

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 61 publications
(43 citation statements)
references
References 18 publications
(15 reference statements)
0
40
0
Order By: Relevance
“…Although widely used for performance tuning and architectural-design verification [7,30], they have only had limited use in automated softwarereliability tools for several reasons:…”
Section: Contributionmentioning
confidence: 99%
“…Although widely used for performance tuning and architectural-design verification [7,30], they have only had limited use in automated softwarereliability tools for several reasons:…”
Section: Contributionmentioning
confidence: 99%
“…Load Imbalance and Serialization can be measured by counting the cycles when a core is idle or busy-waiting for work (and not waiting on synchronization primitives) while another core (or cores) are busy doing work 1 . Distinguishing between Load Imbalance and Serialization is tricky, as they both show up as processor idle cycles while a thread is either 1 We assume that the program is not I/O-bound.…”
Section: A Measuring Software-induced Overheadmentioning
confidence: 99%
“…Distinguishing between Load Imbalance and Serialization is tricky, as they both show up as processor idle cycles while a thread is either 1 We assume that the program is not I/O-bound.…”
Section: A Measuring Software-induced Overheadmentioning
confidence: 99%
See 1 more Smart Citation
“…Software profiling relies on code instrumentation to associate certain hardware performance events with specific code segments (e.g. oprofile, gprof, Intel VTune, Sun Studio collect/analyze) or data structures [2]. Software profiling is conventionally regarded as an off-line method, because it slows down application execution.…”
Section: A Traditional Approaches To Understanding Workload Behaviourmentioning
confidence: 99%