Sangmin Ji scite author profile

Sangmin Ji

6Publications

65Citation Statements Received

97Citation Statements Given

How they've been cited

155

How they cite others

Affiliations

Beijing Institute of Technology, Chungnam National University

Publications

Order By: Most citations

An Effective Optimization Method for Machine Learning Based on ADAM

Ahn

2020

Applied Sciences

View full text Add to dashboard Cite

A machine is taught by finding the minimum value of the cost function which is induced by learning data. Unfortunately, as the amount of learning increases, the non-liner activation function in the artificial neural network (ANN), the complexity of the artificial intelligence structures, and the cost function’s non-convex complexity all increase. We know that a non-convex function has local minimums, and that the first derivative of the cost function is zero at a local minimum. Therefore, the methods based on a gradient descent optimization do not undergo further change when they fall to a local minimum because they are based on the first derivative of the cost function. This paper introduces a novel optimization method to make machine learning more efficient. In other words, we construct an effective optimization method for non-convex cost function. The proposed method solves the problem of falling into a local minimum by adding the cost function in the parameter update rule of the ADAM method. We prove the convergence of the sequences generated from the proposed method and the superiority of the proposed method by numerical comparison with gradient descent (GD, ADAM, and AdaMax).

show abstract

Analysis of Recurrent Neural Network and Predictions

Park

2020

Symmetry

View full text Add to dashboard Cite

This paper analyzes the operation principle and predicted value of the recurrent-neural-network (RNN) structure, which is the most basic and suitable for the change of time in the structure of a neural network for various types of artificial intelligence (AI). In particular, an RNN in which all connections are symmetric guarantees that it will converge. The operating principle of a RNN is based on linear data combinations and is composed through the synthesis of nonlinear activation functions. Linear combined data are similar to the autoregressive-moving average (ARMA) method of statistical processing. However, distortion due to the nonlinear activation function in RNNs causes the predicted value to be different from the predicted ARMA value. Through this, we know the limit of the predicted value of an RNN and the range of prediction that changes according to the learning data. In addition to mathematical proofs, numerical experiments confirmed our claims.

show abstract

A Novel Learning Rate Schedule in Optimization for Neural Networks and It’s Convergence

Park

2020

Symmetry

View full text Add to dashboard Cite

The process of machine learning is to find parameters that minimize the cost function constructed by learning the data. This is called optimization and the parameters at that time are called the optimal parameters in neural networks. In the process of finding the optimization, there were attempts to solve the symmetric optimization or initialize the parameters symmetrically. Furthermore, in order to obtain the optimal parameters, the existing methods have used methods in which the learning rate is decreased over the iteration time or is changed according to a certain ratio. These methods are a monotonically decreasing method at a constant rate according to the iteration time. Our idea is to make the learning rate changeable unlike the monotonically decreasing method. We introduce a method to find the optimal parameters which adaptively changes the learning rate according to the value of the cost function. Therefore, when the cost function is optimized, the learning is complete and the optimal parameters are obtained. This paper proves that the method ensures convergence to the optimal parameters. This means that our method achieves a minimum of the cost function (or effective learning). Numerical experiments demonstrate that learning is good effective when using the proposed learning rate schedule in various situations.

show abstract

An Enhanced Optimization Scheme Based on Gradient Descent Methods for Machine Learning

2019

Symmetry

View full text Add to dashboard Cite

A The learning process of machine learning consists of finding values of unknown weights in a cost function by minimizing the cost function based on learning data. However, since the cost function is not convex, it is conundrum to find the minimum value of the cost function. The existing methods used to find the minimum values usually use the first derivative of the cost function. When even the local minimum (but not a global minimum) is reached, since the first derivative of the cost function becomes zero, the methods give the local minimum values, so that the desired global minimum cannot be found. To overcome this problem, in this paper we modified one of the existing schemes—the adaptive momentum estimation scheme—by adding a new term, so that it can prevent the new optimizer from staying at local minimum. The convergence condition for the proposed scheme and the convergence value are also analyzed, and further explained through several numerical experiments whose cost function is non-convex.

show abstract

An Adaptive Optimization Method Based on Learning Rate Schedule for Neural Networks

Park

2021

Applied Sciences

View full text Add to dashboard Cite

Artificial intelligence (AI) is achieved by optimizing the cost function constructed from learning data. Changing the parameters in the cost function is an AI learning process (or AI learning for convenience). If AI learning is well performed, then the value of the cost function is the global minimum. In order to obtain the well-learned AI learning, the parameter should be no change in the value of the cost function at the global minimum. One useful optimization method is the momentum method; however, the momentum method has difficulty stopping the parameter when the value of the cost function satisfies the global minimum (non-stop problem). The proposed method is based on the momentum method. In order to solve the non-stop problem of the momentum method, we use the value of the cost function to our method. Therefore, as the learning method processes, the mechanism in our method reduces the amount of change in the parameter by the effect of the value of the cost function. We verified the method through proof of convergence and numerical experiments with existing methods to ensure that the learning works well.

show abstract

A deep learning-based approach to time-coordination entry guidance for multiple hypersonic vehicles

Guo

Tang

et al. 2023

Aeronaut. j.

View full text Add to dashboard Cite

A multiple-vehicles time-coordination guidance technique based on deep learning is suggested to address the cooperative guiding problem of hypersonic gliding vehicle entry phase. A dual-parameter bank angle profile is used in longitudinal guiding to meet the requirements of time coordination. A vehicle trajectory database is constructed along with a deep neural network (DNN) structure devised to fulfill the error criteria, and a trained network is used to replace the conventional prediction approach. Moreover, an extended Kalman filter is constructed to detect changes in aerodynamic parameters in real time, and the aerodynamic parameters are fed into a DNN. The lateral guiding employs a logic for reversing the sign of bank angle, which is based on the segmented heading angle error corridor. The final simulation results demonstrate that the built DNN is capable of addressing the cooperative guiding requirements. The algorithm is highly accurate in terms of guiding, has a fast response time, and does not need inter-munition communication, and it is capable of solving guidance orders that satisfy flight requirements even when aerodynamic parameter disruptions occur.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Sangmin Ji

An Effective Optimization Method for Machine Learning Based on ADAM

Analysis of Recurrent Neural Network and Predictions

A Novel Learning Rate Schedule in Optimization for Neural Networks and It’s Convergence

An Enhanced Optimization Scheme Based on Gradient Descent Methods for Machine Learning

An Adaptive Optimization Method Based on Learning Rate Schedule for Neural Networks

A deep learning-based approach to time-coordination entry guidance for multiple hypersonic vehicles

Contact Info

Product

Resources

About