2016
DOI: 10.48550/arxiv.1612.00521
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Performance Modeling of Distributed Deep Neural Networks

Abstract: During the past decade, machine learning has become extremely popular and can be found in many aspects of our every day life. Nowayadays with explosion of data while rapid growth of computation capacity, Distributed Deep Neural Networks (DDNNs) which can improve their performance linearly with more computation resources, have become hot and trending. However, there has not been an in depth study of the performance of these systems, and how well they scale.In this paper we analyze CNTK, one of the most commonly… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 3 publications
0
2
0
Order By: Relevance
“…However, the best performance and alleviation overhead can come from tuning distributed algorithm and distributed system framework properties. A recent work [36] focused on analyzing DNNs performance by using CNTK framework. The performance model was to capture the scalability of the system while increasing the computation nodes in small and large clusters.…”
Section: B Experimental Evaluationmentioning
confidence: 99%
“…However, the best performance and alleviation overhead can come from tuning distributed algorithm and distributed system framework properties. A recent work [36] focused on analyzing DNNs performance by using CNTK framework. The performance model was to capture the scalability of the system while increasing the computation nodes in small and large clusters.…”
Section: B Experimental Evaluationmentioning
confidence: 99%
“…CNN is more powerful and has a better performance compared to other traditional deep learning algorithms due to automatic feature extraction capability. The main drawback of CNN is the long training time and the complex neural architecture ( Hashemi et al, 2016 ; Kim et al, 2017 ). The complex neural networks structures take weeks and months to complete the training process in the case of big datasets.…”
Section: Introductionmentioning
confidence: 99%