2019 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS) 2019
DOI: 10.1109/ispass.2019.00015
|View full text |Cite
|
Sign up to set email alerts
|

Full-System Simulation of Mobile CPU/GPU Platforms

Abstract: Graphics Processing Units (GPUs) critically rely on a complex system software stack comprising kernel-and userspace drivers and Just-in-time (JIT) compilers. Yet, existing GPU simulators typically abstract away details of the software stack and GPU instruction set. Partly, this is because GPU vendors rarely release sufficient information about their latest GPU products. However, this is also due to the lack of an integrated CPU/GPU simulation framework, which is complete and powerful enough to drive the comple… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 33 publications
0
2
0
Order By: Relevance
“…Prune=63 Prune=127 1.0x 1.0x 1.0x 1.0x 1.0x 1.0x 1.0x 1.0x 1.0x 1.0x 1.0x 1.0x 1.0x 1.0x 1.0x 1.0x 1.0x 1.0x 1.1x 1.0x 1.0x 1.0x 1.0x 1.0x 1.0x 1.0x 1.0x for different sizes of a convolutional layer, as well as lowerlevel details about the execution in hardware, we executed the workloads in a Full-System Mali GPU simulator [22].…”
Section: B Channel Pruning Observed Through Gpu Simulationmentioning
confidence: 99%
See 1 more Smart Citation
“…Prune=63 Prune=127 1.0x 1.0x 1.0x 1.0x 1.0x 1.0x 1.0x 1.0x 1.0x 1.0x 1.0x 1.0x 1.0x 1.0x 1.0x 1.0x 1.0x 1.0x 1.1x 1.0x 1.0x 1.0x 1.0x 1.0x 1.0x 1.0x 1.0x for different sizes of a convolutional layer, as well as lowerlevel details about the execution in hardware, we executed the workloads in a Full-System Mali GPU simulator [22].…”
Section: B Channel Pruning Observed Through Gpu Simulationmentioning
confidence: 99%
“…Through the use of higher level libraries, like the ACL, we lose observability that we would normally have when working directly with OpenCL. To understand all the calls and kernel management performed by the Arm Compute Library for different sizes of a convolutional layer, as well as lowerlevel details about the execution in hardware, we executed the workloads in a Full-System Mali GPU simulator [22].…”
Section: B Channel Pruning Observed Through Gpu Simulationmentioning
confidence: 99%