2009
DOI: 10.1145/1543135.1542526
|View full text |Cite
|
Sign up to set email alerts
|

Binary analysis for measurement and attribution of program performance

Abstract: Modern programs frequently employ sophisticated modular designs. As a result, performance problems cannot be identified from costs attributed to routines in isolation; understanding code performance requires information about a routine's calling context. Existing performance tools fall short in this respect. Prior strategies for attributing context-sensitive performance at the source level either compromise measurement accuracy, remain too close to the binary, or require custom compilers. To understand the per… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
39
0

Year Published

2009
2009
2022
2022

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 24 publications
(39 citation statements)
references
References 20 publications
0
39
0
Order By: Relevance
“…Several tools that analyze serial code and provide useful information for performance tuning and parallelization have been introduced in the recent past [6]- [8], [12]. However, their loop detection mechanisms are neither precise nor inter-procedural.…”
Section: Loop Nesting In Source Code Machine Code and The Executionmentioning
confidence: 99%
See 3 more Smart Citations
“…Several tools that analyze serial code and provide useful information for performance tuning and parallelization have been introduced in the recent past [6]- [8], [12]. However, their loop detection mechanisms are neither precise nor inter-procedural.…”
Section: Loop Nesting In Source Code Machine Code and The Executionmentioning
confidence: 99%
“…However, there might be situations in which loop structures in source code do not correspond exactly to these in binary code due to aggressive loop transformations and function inlining. Even in those cases, we can map the structures without inconsistency of semantics as discussed in [8]. Also, we note that the loop structures in the machine code is one of the important factors in determining the data access locality and performance.…”
Section: Static Analysismentioning
confidence: 99%
See 2 more Smart Citations
“…Instead of relying on compiler-generated debug information to map an instruction to its associated source code line (what AerialVision currently does in source code view), HPCToolkit analyzes the application binaries directly to recover the program structure [40]. It then correlates the statistically sampled performance metrics with the source code structure and presents the metrics with the associated source code.…”
Section: E Identifying Performance Bottlenecks With a Pc-histogrammentioning
confidence: 99%