Parameters optimization of deep learning models using Particle swarm optimization

Qolomany, Basheer; Maabreh, Majdi; Al-Fuqaha, Ala; Gupta, Ajay; Benhaddou, Driss

doi:10.1109/iwcmc.2017.7986470

Cited by 92 publications

(57 citation statements)

References 15 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Former literature solely discusses on how to determine the number of hidden neurons (assuming by using one hidden layer), but does rarely discuss the way to determine the optimal number of hidden layers. This fact is due to the assumption that a network with only one hidden layer is sufficient to universally approve almost all functions [3]- [7]. However, several studies probed that the application of two hidden layers provides better performance compare to one hidden layer in some cases [3]- [7].…”

Section: Introductionmentioning

confidence: 99%

“…Since the increasing ability of computers, the application of neural networks with more than one hidden layer has become one of the attractions for researchers, especially since the use of deep neural networks (DNN) to solve problems in various fields. The utilization of DNN is defined as a technique applying neural networks by using numerous hidden layers between the input and output layers [3], [10]; or in other words it is considered as machine learning with DNN. One of the challenges in the successful implementation of deep neural networks lies on the determination of the network architecture, which is closely related to the number of hidden layers and hidden neurons.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

A Novel Approach in Determining Neural Networks Architecture to Classify Data With Large Number of Attributes

2020

View full text Add to dashboard Cite

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

A Novel Approach in Determining Neural Networks Architecture to Classify Data With Large Number of Attributes

2020

View full text Add to dashboard Cite

“…Please refer to Table 2. [16,2048] With the objectives of reducing the amount of parameters (weights), the computational burden, and to control overfitting, for each convolutional layer, we used a pooling (downsampling) layer of a fixed size of 2 × 2 along with the max pooling option. Further, we utilized the ReLU (rectified linear unit) activation function and employed the Adam optimizer with a fixed learning rate of θ = 0.0001 and the mean squared error (MSE) loss function.…”

Section: Parameter Setupmentioning

confidence: 99%

Optimizing Convolutional Neural Network Hyperparameters by Enhanced Swarm Intelligence Metaheuristics

et al. 2020

View full text Add to dashboard Cite

Computer vision is one of the most frontier technologies in computer science. It is used to build artificial systems to extract valuable information from images and has a broad range of applications in various areas such as agriculture, business, and healthcare. Convolutional neural networks represent the key algorithms in computer vision, and in recent years, they have attained notable advances in many real-world problems. The accuracy of the network for a particular task profoundly relies on the hyperparameters’ configuration. Obtaining the right set of hyperparameters is a time-consuming process and requires expertise. To approach this concern, we propose an automatic method for hyperparameters’ optimization and structure design by implementing enhanced metaheuristic algorithms. The aim of this paper is twofold. First, we propose enhanced versions of the tree growth and firefly algorithms that improve the original implementations. Second, we adopt the proposed enhanced algorithms for hyperparameters’ optimization. First, the modified metaheuristics are evaluated on standard unconstrained benchmark functions and compared to the original algorithms. Afterward, the improved algorithms are employed for the network design. The experiments are carried out on the famous image classification benchmark dataset, the MNIST dataset, and comparative analysis with other outstanding approaches that were tested on the same problem is conducted. The experimental results show that both proposed improved methods establish higher performance than the other existing techniques in terms of classification accuracy and the use of computational resources.

show abstract

“…GA have many successful applications in the domains of deep learning and CNNs 17,18 . Better results can be obtained by applying meta-heuristic methods such as genetic algorithm (GA) 17 and swarm intelligence 27 to the process of CNN hyperparameter optimization.…”

mentioning

confidence: 99%

Optimization of culture conditions for differentiation of melon based on artificial neural network and genetic algorithm

Zhang

Deng

Dai

et al. 2020

Sci Rep

View full text Add to dashboard Cite

Artificial neural network is an efficient and accurate fitting method. It has the function of self-learning, which is particularly important for prediction, and it could take advantage of the computer's high-speed computing capabilities and find the optimal solution quickly. In this paper, four culture conditions: agar concentration, light time, culture temperature, and humidity were selected. And a three-layer neural network was used to predict the differentiation rate of melon under these four conditions. Ten-fold cross validation revealed that the optimal back propagation neural network was established with traingdx as the training function and the final architecture of 4-3-1 (four neurons in the input layer, three neurons in the hidden layer and one neuron in the output layer), which yielded a high coefficient of correlation (R 2 , 0.9637) between the actual and predicted outputs, and a root-mean-square error (RMSE) of 0.0108, suggesting that the artificial neural network worked well. According to the optimal culture conditions generated by genetic algorithm, tissue culture experiments had been carried out. The results showed that the actual differentiation rate of melon reached 90.53%, and only 1.59% lower than the predicted value of genetic algorithm. It was better than the optimization by response surface methodology, which the predicted induced differentiation rate is 86.04%, the actual value is 83.62%, and was 2.89% lower than the predicted value. It can be inferred that the combination of artificial neural network and genetic algorithm can optimize the plant tissue culture conditions well and with high prediction accuracy, and this method will have a good application prospect in other biological experiments.

show abstract

Parameters optimization of deep learning models using Particle swarm optimization

Cited by 92 publications

References 15 publications

A Novel Approach in Determining Neural Networks Architecture to Classify Data With Large Number of Attributes

A Novel Approach in Determining Neural Networks Architecture to Classify Data With Large Number of Attributes

Optimizing Convolutional Neural Network Hyperparameters by Enhanced Swarm Intelligence Metaheuristics

Optimization of culture conditions for differentiation of melon based on artificial neural network and genetic algorithm

Contact Info

Product

Resources

About