2019 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS) 2019
DOI: 10.1109/ispass.2019.00028
|View full text |Cite
|
Sign up to set email alerts
|

Analyzing Machine Learning Workloads Using a Detailed GPU Simulator

Abstract: Most deep neural networks deployed today are trained using GPUs via high-level frameworks such as Tensor-Flow [1] and PyTorch [2]. This paper describes changes we made to the GPGPU-Sim simulator [3], [4] to enable it to run PyTorch by running PTX kernels included in NVIDIA's cuDNN [5] library. We use the resulting modified simulator, which has been made available publicly with this paper 1 , to study some simple deep learning workloads. With our changes to GPGPU-Sim's functional simulation model we find GPGPU-… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
13
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
4
3
2

Relationship

1
8

Authors

Journals

citations
Cited by 51 publications
(14 citation statements)
references
References 25 publications
0
13
0
Order By: Relevance
“…Both ML and DL come under the big umbrella of artificial intelligence (AI) and aim at learning useful information from the big data 7 . These techniques have gained enormous popularity in the field of network security, over the last decade due to the invention of very powerful graphics processor units (GPUs) 8 . Both ML and DL are powerful tools in learning useful features from the network traffic and predicting the normal and abnormal activities based on the learned patterns.…”
Section: Introductionmentioning
confidence: 99%
“…Both ML and DL come under the big umbrella of artificial intelligence (AI) and aim at learning useful information from the big data 7 . These techniques have gained enormous popularity in the field of network security, over the last decade due to the invention of very powerful graphics processor units (GPUs) 8 . Both ML and DL are powerful tools in learning useful features from the network traffic and predicting the normal and abnormal activities based on the learned patterns.…”
Section: Introductionmentioning
confidence: 99%
“…Therefore, specific microarchitectures should be used for a particular study. In this study, we target Maxwell and Pascal microarchitectures for two rea sons: (1) because they share the same ISA [24]; and (2) they have been intensively used to accelerate deep learning models [3], [12], [25].…”
Section: Tested Gpu Architecturesmentioning
confidence: 99%
“…It would also be good to observe how the proposed technique translates to embedded GPUs for the for inference procedure. Lew et al [33] modify GPGPU-Sim in order to study ML workloads and analyze their behavior (e.g. run applications that use cuDNN and PyTorch).…”
Section: Related Workmentioning
confidence: 99%