2013 IEEE International Symposium on Workload Characterization (IISWC) 2013
DOI: 10.1109/iiswc.2013.6704685
|View full text |Cite
|
Sign up to set email alerts
|

Platform-independent analysis of function-level communication in workloads

Abstract: The emergence of many-core and heterogeneous multicore processors has meant that data communication pat terns increasingly determine application performance. Micro processor designers need tools that can extract and represent these producer-consumer relationships for a workload to aid them in a wide range of tasks including hardware-software co-design, software partitioning, and application performance optimization. This paper presents SigH, a profiling tool that can extract communication patterns within a wor… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2014
2014
2018
2018

Publication Types

Select...
6
1

Relationship

2
5

Authors

Journals

citations
Cited by 8 publications
(3 citation statements)
references
References 26 publications
(29 reference statements)
0
3
0
Order By: Relevance
“…A common method for HW/SW partitioning is to profile the application to find the performance hotspot region. These regions are candidates for FPGA acceleration, as long as the overhead of communication with CPU is not significant [15]. To perform hotspot analysis on big data applications, we use Intel Vtune to select the common hotspot modules of the applications running on big and little cores.…”
Section: Performance Hotspot and Post-acceleration Cpu Code Charactermentioning
confidence: 99%
“…A common method for HW/SW partitioning is to profile the application to find the performance hotspot region. These regions are candidates for FPGA acceleration, as long as the overhead of communication with CPU is not significant [15]. To perform hotspot analysis on big data applications, we use Intel Vtune to select the common hotspot modules of the applications running on big and little cores.…”
Section: Performance Hotspot and Post-acceleration Cpu Code Charactermentioning
confidence: 99%
“…The capture tool of SynchroTrace is based on the Sigil workload analysis framework [27], which is currently built on top of the Valgrind dynamic binary instrumentation framework, but other instrumentation front-ends are possible. Although Sigil [27] was originally designed to capture communication between functions, for this work it has been adapted to capture local computation of threads and communication between threads. Another important addition of this work is capturing the synchronization behaviors of threads by wrapping prevailing Pthread and OpenMP API calls.…”
Section: Synchrotrace Capture Frameworkmentioning
confidence: 99%
“…The binary instrumentation tool is called Sigil [6], a publiclyavailable utility built on top of the Valgrind/Callgrind framework. Through running multi-threaded Splash2 benchmarks, virtual address read and writes are captured, abstracting away any architecture specific behavior.…”
Section: Monte Carlo Exploration Of Thread Mappingmentioning
confidence: 99%