2019
DOI: 10.48550/arxiv.1911.10504
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Stage-based Hyper-parameter Optimization for Deep Learning

Abstract: As deep learning techniques advance more than ever, hyper-parameter optimization is the new major workload in deep learning clusters. Although hyper-parameter optimization is crucial in training deep learning models for high model performance, effectively executing such a computation-heavy workload still remains a challenge. We observe that numerous trials issued from existing hyper-parameter optimization algorithms share common hyper-parameter sequence prefixes, which implies that there are redundant computat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 15 publications
0
1
0
Order By: Relevance
“…The DNN model Shin et al, [2] will constitute a nested architecture of layers in which the number of parameters is in millions. The deep model's high degrees of freedom will enable it to be approximate non-linear as well as linear functions; despite that, this model is constantly at the risk of overfitting to training data.…”
Section: Introductionmentioning
confidence: 99%
“…The DNN model Shin et al, [2] will constitute a nested architecture of layers in which the number of parameters is in millions. The deep model's high degrees of freedom will enable it to be approximate non-linear as well as linear functions; despite that, this model is constantly at the risk of overfitting to training data.…”
Section: Introductionmentioning
confidence: 99%