Proceedings of the 8th Annual IEEE/ACM International Symposium on Code Generation and Optimization 2010
DOI: 10.1145/1772954.1772963
|View full text |Cite
|
Sign up to set email alerts
|

Taming hardware event samples for FDO compilation

Abstract: Feedback-directed optimization (FDO) is effective in improving application runtime performance, but has not been widely adopted due to the tedious dual-compilation model, the difficulties in generating representative training data sets, and the high runtime overhead of profile collection. The use of hardware-event sampling to generate estimated edge profiles overcomes these drawbacks. Yet, hardware event samples are typically not precise at the instruction or basic-block granularity. These inaccuracies lead to… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
32
0

Year Published

2012
2012
2024
2024

Publication Types

Select...
4
3
2

Relationship

1
8

Authors

Journals

citations
Cited by 59 publications
(32 citation statements)
references
References 19 publications
0
32
0
Order By: Relevance
“…For example, while Feedback Directed Optimization (FDO) can provide substantial performance gains, the extra step involved in the programmers workflow has stopped this promising technique from being widely adopted [13]. By eliminating any extra steps, we believe that SiblingRivalry can bring autotuning to the mainstream program optimization.…”
Section: Discussionmentioning
confidence: 99%
“…For example, while Feedback Directed Optimization (FDO) can provide substantial performance gains, the extra step involved in the programmers workflow has stopped this promising technique from being widely adopted [13]. By eliminating any extra steps, we believe that SiblingRivalry can bring autotuning to the mainstream program optimization.…”
Section: Discussionmentioning
confidence: 99%
“…This speedup is due to a series of components in the framework. Initially, using traditional PMU-based sampling to collect frequency profile, as described in [9], approximately 4.35% speedup can be achieved. If we use LBR to collect the profile, the speedup increases to 7.38%.…”
Section: Over/under Sampling Due To Optimizationmentioning
confidence: 99%
“…Although Chen [10] presented how hardware event samples can be used for FDO of general programs and GWP [23] profiles have provided performance insights for datacenter applications, not much has been studied about other applications of FDO on datacenter applications. As instant profiling enables various types of profiles to be collected continuously from datacenters, we expect this information to contribute to improving the performance of cloud computing.…”
Section: Future Workmentioning
confidence: 99%