Multi-objective simulated annealing for hyper-parameter optimization in convolutional neural networks

Gülcü, Ayla; Kuş, Zeki

doi:10.7717/peerj-cs.338

Cited by 19 publications

(10 citation statements)

References 23 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…However, the proposed model is not built and scaled for the real work problems with larger datasets such as network data. Multi-objective simulated annealing (MOSA) [21] algorithm that efficiently searches the objective space outperforming the simulated annealing (SA) algorithm with a caveat that the computational complexity is as important as the test accuracy. Hoopes et al [22] proposed HyperMorph, a learning-based strategy that eliminates the need to tune hyperparameters during training, reducing the computational time but limits the capability to find the optimal values.…”

Section: Related Workmentioning

confidence: 99%

Hyperparameter optimization using custom genetic algorithm for classification of benign and malicious traffic on internet of things–23 dataset

Thavasimani¹,

Srinath²

2022

IJECE

View full text Add to dashboard Cite

Hyperparameter optimization is one of the main challenges in deep learning despite its successful exploration in many areas such as image classification, speech recognition, natural language processing, and fraud detections. Hyperparameters are critical as they control the learning rate of a model and should be tuned to improve performance. Tuning the hyperparameters manually with default values is a challenging and time-intensive task. Though the time and efforts spent on tuning the hyperparameters are decreasing, it is always a burden when it comes to a new dataset or solving a new task or improving the existing model. In our paper, we propose a custom genetic algorithm to auto-tune the hyperparameters of the deep learning sequential model to classify benign and malicious traffic from internet of things-23 dataset captured by Czech Technical University, Czech Republic. The dataset is a collection of 30.85 million records of malicious and benign traffic. The experimental results show a promising outcome of 98.9% accuracy.

show abstract

Section: Related Workmentioning

confidence: 99%

Hyperparameter optimization using custom genetic algorithm for classification of benign and malicious traffic on internet of things–23 dataset

Thavasimani¹,

Srinath²

2022

IJECE

View full text Add to dashboard Cite

show abstract

“…A first set of quality metrics is related to the resulting Pareto front. Here, hypervolume is the most widely used; see Garrido and Hernández (2019) (Chatelain et al, 2007), the average distance (or Generational Distance) of the front to a reference set (such as the approximated true Pareto front obtained by exhaustive search, see Smithson et al (2016); or an aggregated front, see Gülcü and Kuş (2021)), a coverage measure computed as the percentage of the solutions of an algorithm A dominated by the solutions of another algorithm B (Juang & Hsu, 2014;H. Li, Zhang, Tsang, & Ford, 2004), or metrics based on the shape of the Pareto front (Abdolsh et al, 2019) or its diversity (Juang & Hsu, 2014;H.…”

Section: Quality Metrics For Comparing Multi-objective Hpo Algorithmsmentioning

confidence: 99%

“…Li et al, 2004). The latter can be computed using the spacing and the spread of the solutions: spacing evaluates the diversity of the Pareto points along a given front (Gülcü & Kuş, 2021), whereas spread evaluates the range of the objective function values (see Zitzler, Deb, and Thiele (2000)). Some authors use performance measures that do not relate to the quality of the front obtained; e.g., execution time (Horn et al, 2017;Parsa et al, 2019;Richter et al, 2016), number of performance evaluations (Parsa et al, 2019), CPU utilization in parallel computer architectures (Richter et al, 2016), measures that were not considered as an objective and that are evaluated in the Pareto solutions (usually, confusion matrix-based measures for classification problems; see Salt et al (2019)), or measures that are specific for the HPO algorithm used (e.g., the number of new points suggested per batch is used by Gupta, Shilton, Rana, and Venkatesh (2018) to evaluate the performance of the search executed during batch Bayesian optimization).…”

Section: Quality Metrics For Comparing Multi-objective Hpo Algorithmsmentioning

confidence: 99%

“…Simulated annealing (SA) is a probabilistic technique for finding the global optimum of a single-objective problem (Kirkpatrick, Gelatt, & Vecchi, 1983). Gülcü and Kuş (2021) applied a multi-objective approach (MOSA) to optimize 14 hyperparameters of a CNN. Their MOSA algorithm selects new solutions based on their relative merit (measured by the dominance relationship) w.r.t.…”

Section: Ref Hpmentioning

confidence: 99%

See 1 more Smart Citation

A survey on multi-objective hyperparameter optimization algorithms for Machine Learning

Morales-Hernández¹,

Nieuwenhuyse²,

Rojas³

2021

Preprint

View full text Add to dashboard Cite

Hyperparameter optimization (HPO) is a necessary step to ensure the best possible performance of Machine Learning (ML) algorithms. Several methods have been developed to perform HPO; most of these are focused on optimizing one performance measure (usually an error-based measure), and the literature on such single-objective HPO problems is vast. Recently, though, algorithms have appeared which focus on optimizing multiple conflicting objectives simultaneously. This article presents a systematic survey of the literature published between 2014 and 2020 on multi-objective HPO algorithms, distinguishing between metaheuristicbased algorithms, metamodel-based algorithms and approaches using a mixture of both. We also discuss the quality metrics used to compare multi-objective HPO procedures, and present future research directions.

show abstract

“…The main advantage of 1D-CNN is automatic feature extraction performed through its initial convolutional layers [10,13,34]. However, CNN has a high computational cost, and its architecture design is a difficult task [35].…”

Section: Introductionmentioning

confidence: 99%

A Damage Detection Method Using Neural Network Optimized by Multiple Particle Collision Algorithm

et al. 2021

View full text Add to dashboard Cite

A critical task of structural health monitoring is damage detection and localization. Lamb wave propagation methods have been successfully applied for damage identification in plate-like structures. However, Lamb wave processing is still a challenging task due to its multimodal and dispersive characteristics. To address this issue, data-driven machine learning approaches as artificial neural network (ANN) have been proposed. However, the effectiveness of ANN can be improved based on its architecture and the learning strategy employed to train it. The present paper proposes a Multiple Particle Collision Algorithm (MPCA) to design an optimum ANN architecture to detect and locate damages in plate-like structures. For the first time in the literature, the MPCA is applied to find damages in plate-like structures. The present work uses one piezoelectric transducer to generate Lamb wave signals on an aluminum plate structure and a linear array of four transducers to capture the scattered signals. The continuous wavelet transform (CWT) processes the captured signals to estimate the time-of-flight (ToF) that is the ANN inputs. The ANN output is the damage spatial coordinates. In addition to MPCA optimization, this paper uses a quantitative entropy-based criterion to find the best mother wavelet and the scale values. The presented experimental results show that MPCA is capable of finding a simple ANN architecture with good generalization performance in the proposed damage localization application. The proposed method is compared with the 1-dimensional convolutional neural network (1D-CNN). A discussion about the advantages and limitations of the proposed method is presented.

show abstract

Multi-objective simulated annealing for hyper-parameter optimization in convolutional neural networks

Cited by 19 publications

References 23 publications

Hyperparameter optimization using custom genetic algorithm for classification of benign and malicious traffic on internet of things–23 dataset

Hyperparameter optimization using custom genetic algorithm for classification of benign and malicious traffic on internet of things–23 dataset

A survey on multi-objective hyperparameter optimization algorithms for Machine Learning

A Damage Detection Method Using Neural Network Optimized by Multiple Particle Collision Algorithm

Contact Info

Product

Resources

About