Performance Comparison of CNN Models Using Gradient Flow Analysis

Noh, Seol-hyun

doi:10.3390/informatics8030053

Cited by 12 publications

(8 citation statements)

References 21 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Натуральный градиентный спуск ( [11], [13]) с импульсом, удовлетворяющий условию Нестерова, может быть представлен следующим образом: (k) ) (τ -параметр демпфирования), F -матрица Фишера, которая учитывает кривизну поверхности f для обхода локальных минимумов и отличает натуральный градиентный спуск (2) от стохастического (1). Определение матрицы Фишера берет свое начало еще с определения градиентного потока на гладких Римановых многообразиях в [8], где свойства производных (градиентов) и кривизны уже рассмотрены в общих случаях. Данный подход уже пытались использовать в методах оптимизации в [14].…”

Section: метод быстрого поиска экстремума на основе Ngdm и распределе...unclassified

“…Особенно остро встает вопрос нахождения минимума в машинном обучении, где процесс оптимизации функции потерь влияет на конечную точность. Для решения данной проблемы был предложен градиентный поток из [8], представляющий собой произведение метрического тензора на гладком многообразии и градиента оптимизируемой функции. Такой подход ускорил процесс минимизации функции потерь в нейронных сетях, но в данной статье будут использоваться многообразия вероятностных распределений вместо гладких.…”

Section: Introductionunclassified

See 1 more Smart Citation

A new approach to training neural networks using natural gradient descent with momentum based on Dirichlet distributions

Abdulkadirov,

Lyakhov

2023

Computer Optics

View full text Add to dashboard Cite

In this paper, we propose a natural gradient descent algorithm with momentum based on Dirichlet distributions to speed up the training of neural networks. This approach takes into account not only the direction of the gradients, but also the convexity of the minimized function, which significantly accelerates the process of searching for the extremes. Calculations of natural gradients based on Dirichlet distributions are presented, with the proposed approach introduced into an error backpropagation scheme. The results of image recognition and time series forecasting during the experiments show that the proposed approach gives higher accuracy and does not require a large number of iterations to minimize loss functions compared to the methods of stochastic gradient descent, adaptive moment estimation and adaptive parameter-wise diagonal quasi-Newton method for nonconvex stochastic optimization.

show abstract

Section: метод быстрого поиска экстремума на основе Ngdm и распределе...unclassified

Section: Introductionunclassified

A new approach to training neural networks using natural gradient descent with momentum based on Dirichlet distributions

Abdulkadirov,

Lyakhov

2023

Computer Optics

View full text Add to dashboard Cite

show abstract

“…For a model such as the fully connected feedforward DNN illustrated in Fig. 5, the rectified linear unit (ReLU)-style activation function is commonly used for the neurons in the hidden layers, rather than an S-shaped function such as the sigmoid function, in order to overcome gradient vanishing [35], [36]; in the context of DNNs, the problem of vanishing gradients can often arise for a model configured with Sshaped activation functions for the hidden neurons to be trained by a stochastic gradient-based weight optimizer such as Adam, which was first proposed in [37], for their most appropriate weight coefficients including biases.…”

Section: 2) Building Dnns For Time-series Load Modeling and Forecastingmentioning

confidence: 99%

A Smart Home Energy Management System Utilizing Neurocomputing-Based Time-Series Load Modeling and Forecasting Facilitated by Energy Decomposition for Smart Home Automation

et al. 2022

View full text Add to dashboard Cite

The key advantage of using power-utility-owned smart meters is the ability to transmit electrical energy consumption data to power utilities' remote data centers for various purposes, such as billing. Several useful consumer-centric use cases can also be identified for the collection and further analysis of consumers' electrical energy consumption data from smart meters. One of the use cases is home automation. Recent related solutions for home automation involving home security and healthcare depend on the installation of sensors and/or other devices such as video cameras, which have high costs for installation and annual maintenance. Because the electrical energy consumption patterns mined from smart meter data are indicative of residents' daily life, it is possible to develop a new home automation approach based on energy decomposition for smart home automation. Accordingly, in this work, a smart home energy management system (SHEMS) utilizing a parallel-processing-implemented, GPU-accelerated neurocomputing-based time-series load modeling and forecasting mechanism is proposed for smart home automation. Energy decomposition is used to facilitate the time-series load modeling and forecasting mechanism, which tracks appliance-level electrical energy consumption to be quantitatively modeled from circuit-level consumption, with no intrusive deployment of networked plug-level power meters for individual electrical home appliances. For the neurocomputing approach applied in this mechanism, an autoregressive multilayer perceptron methodology is compared against a stacked long short-term memory methodology. The presented neurocomputing-based time-series load modeling and forecasting mechanism facilitated by energy decomposition is capable of predicting residents' daily behavioral patterns by nonintrusively analyzing and modeling relevant electrical home appliances based on their past trends for smart home automation.

show abstract

“…DenseNet-201 models are built with several parallel layer skips that aid in the training of deeper network architectures to identify corn leaf diseases. DenseNet-201 encompasses a concatenate convolutional network that extricated it from other identifier algorithms, which upsurges variation in the input of subsequent layers and enriches efficiency (Noh, 2021). The layers between two adjacent blocks are implied to as transition layers or concatenate convolutional neural network layers.…”

Section: 60mentioning

confidence: 99%

Identification of Corn Leaf Diseases Comprising of Blight, Grey Spot and Rust Using DenseNet-201

ENTUNI

Zulcaffle

2022

BJRST

View full text Add to dashboard Cite

Corn is a vital commodity in Malaysia because it is a key component of animal feed. The retention of the wholesome corn yield is essential to satisfy the rising demand. Like other plants, corn is susceptible to pathogens infection during the growing period. Manual observation of the diseases nevertheless takes time and requires a lot of work. The aim of this study was to propose an automatic approach to identify corn leaf diseases. The dataset used comprises of the images of diseased corn leaf comprising of blight, grey spot and rust as well as healthy corn leaf in YCbCr colour space representation. The DenseNet-201 algorithm was utilised in the proposed method of identifying corn leaf diseases. The training and validation analysis of distinctive epoch values of DenseNet-201 were also used to validate the proposed method, which resulted in significantly higher identification accuracy. DenseNet-201 succeeded 95.11% identification accuracy and it outperformed the prior identification methods such as ResNet-50, ResNet-101 and Bag of Features. The DenseNet-201 also has been validated to function as anticipated in identifying corn leaf diseases based on the algorithm validation assessment.

show abstract

Performance Comparison of CNN Models Using Gradient Flow Analysis

Cited by 12 publications

References 21 publications

A new approach to training neural networks using natural gradient descent with momentum based on Dirichlet distributions

A new approach to training neural networks using natural gradient descent with momentum based on Dirichlet distributions

A Smart Home Energy Management System Utilizing Neurocomputing-Based Time-Series Load Modeling and Forecasting Facilitated by Energy Decomposition for Smart Home Automation

Identification of Corn Leaf Diseases Comprising of Blight, Grey Spot and Rust Using DenseNet-201

Contact Info

Product

Resources

About