Consider a discrete-time nonlinear system with random disturbances appearing in the real plant and the output channel where the randomly perturbed output is measurable. An iterative procedure based on the linear quadratic Gaussian optimal control model is developed for solving the optimal control of this stochastic system. The optimal state estimate provided by Kalman filtering theory and the optimal control law obtained from the linear quadratic regulator problem are then integrated into the dynamic integrated system optimisation and parameter estimation algorithm. The iterative solutions of the optimal control problem for the model obtained converge to the solution of the original optimal control problem of the discrete-time nonlinear system, despite model-reality differences, when the convergence is achieved. An illustrative example is solved using the method proposed. The results obtained show the effectiveness of the algorithm proposed.
A computational approach is proposed for solving the discrete time nonlinear stochastic optimal control problem. Our aim is to obtain the optimal output solution of the original optimal control problem through solving the simplified model-based optimal control problem iteratively. In our approach, the adjusted parameters are introduced into the model used such that the differences between the real system and the model used can be computed. Particularly, system optimization and parameter estimation are integrated interactively. On the other hand, the output is measured from the real plant and is fed back into the parameter estimation problem to establish a matching scheme. During the calculation procedure, the iterative solution is updated in order to approximate the true optimal solution of the original optimal control problem despite model-reality differences. For illustration, a wastewater treatment problem is studied and the results show the efficiency of the approach proposed.
In this paper, we propose an efficient algorithm for solving a nonlinear stochastic optimal control problem in discrete-time, where the true filtered solution of the original optimal control problem is obtained through solving a linear model-based optimal control problem with adjustable parameters iteratively. The adjustments of these parameters are based on the differences between the real plant and the linear model that are measured. The main feature of the algorithm proposed is the integration of system optimization and parameter estimation in an interactive way so that the correct filtered solution of the original optimal control problem is obtained when the convergence is achieved. For illustration, a nonlinear continuous stirred reactor tank problem is studied. The simulation results obtained demonstrate the efficiency of the algorithm proposed.
An iterative algorithm, which is called the integrated optimal control and parameter estimation algorithm, is developed for solving a discrete time nonlinear stochastic control problem. It is based on the integration of the principle of model-reality differences and Kalman filtering theory, where the dynamic integrated system optimization and parameter estimation algorithm are used interactively. In this approach, the weighted least-square output residual is included in the cost function by appropriately monitoring the weighted matrix. An improved linear quadratic Gaussian optimal control model, rather than the original optimal control problem, is solved. Subsequently, the model optimum is updated using the adjusted parameters induced by the differences between the real plant and the model used. These updated solutions converge to the true optimum, despite model-reality differences. For illustration, the optimal control of a nonlinear continuous stirred tank reactor problem is considered and solved by using the method proposed.
In this paper, the current variant technique of the stochastic gradient descent (SGD) approach, namely, the adaptive moment estimation (Adam) approach, is improved by adding the standard error in the updating rule. The aim is to fasten the convergence rate of the Adam algorithm. This improvement is termed as Adam with standard error (AdamSE) algorithm. On the other hand, the mean-variance portfolio optimization model is formulated from the historical data of the rate of return of the S&P 500 stock, 10-year Treasury bond, and money market. The application of SGD, Adam, adaptive moment estimation with maximum (AdaMax), Nesterov-accelerated adaptive moment estimation (Nadam), AMSGrad, and AdamSE algorithms to solve the mean-variance portfolio optimization problem is further investigated. During the calculation procedure, the iterative solution converges to the optimal portfolio solution. It is noticed that the AdamSE algorithm has the smallest iteration number. The results show that the rate of convergence of the Adam algorithm is significantly enhanced by using the AdamSE algorithm. In conclusion, the efficiency of the improved Adam algorithm using the standard error has been expressed. Furthermore, the applicability of SGD, Adam, AdaMax, Nadam, AMSGrad, and AdamSE algorithms in solving the mean-variance portfolio optimization problem is validated.
Output measurement for nonlinear optimal control problems is an interesting issue. Because the structure of the real plant is complex, the output channel could give a significant response corresponding to the real plant. In this paper, a least squares scheme, which is based on the Gauss-Newton algorithm, is proposed. The aim is to approximate the output that is measured from the real plant. In doing so, an appropriate output measurement from the model used is suggested. During the computation procedure, the control trajectory is updated iteratively by using the Gauss-Newton recursion scheme. Consequently, the output residual between the original output and the suggested output is minimized. Here, the linear model-based optimal control model is considered, so as the optimal control law is constructed. By feed backing the updated control trajectory into the dynamic system, the iterative solution of the model used could approximate to the correct optimal solution of the original optimal control problem, in spite of model-reality differences. For illustration, current converted and isothermal reaction rector problems are studied and the results are demonstrated. In conclusion, the efficiency of the approach proposed is highly presented.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.