Model-Free Multi-Kernel Learning Control for Nonlinear Discrete-Time Systems

Liu, Jiahang; Xu, Xin; Huang, Zhenhua; Lian, Chuanqiang

doi:10.2316/journal.206.2017.5.206-5112

Cited by 3 publications

(3 citation statements)

References 17 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Following the same line with [18], setting

\frac{\partial J (s)}{\partial u (s)} = 0

, it is possible to compute the optimal control policy at time k as a function of the next state

s^{+}

u^{*} ()(, s) = - \frac{1}{2} γ R^{- 1} {()(, \frac{\partial s^{+}}{\partial u^{*} ()(, s)})}^{⊤} λ^{*} (s^{+})

where

λ^{*} = \partial J^{*} / \partial s

is the costate that can be obtained as follows:

\begin{matrix} right leftthickmathspace.5em \\ λ^{*} (s) = & \frac{\partial [r (s, u^{*} (s)) + γ J^{*} (s^{+})]}{\partial s} \\ = & 2 Q s + γ {()(, \frac{\partial s^{+}}{\partial s})}^{T} λ^{*} (s^{+}) \end{matrix}

As the system given by (1) is non‐linear, it is difficult to analytically calculate

λ^{*} ()(, s)

that needs to be used in (4). To solve this problem, kernel‐based least‐squares iterative method has been widely used for policy and value evaluation in the framework of DHP.…”

Section: Problem Formulationmentioning

confidence: 99%

“…Compared with single‐kernel designs, the adopted multi‐kernel structure consists of a linear combination of weighted single‐kernel functions, hence it is capable in reducing the complexity of kernel width parameter tuning for feature representation, especially for high dimensional and heterogeneous data‐samples. Furthermore, the multi‐kernel based feature representation is both used for optimal policy and VFA, which is different from the work in [18].…”

Section: Introductionmentioning

confidence: 99%

“…In [17], an effective MKL method has been proposed, which can achieve an autonomous selection of models for classification and regression problems and extract understandable information of learning problems. An incremental DHP with multiple kernel functions has been presented in [18] for the optimal control of discrete‐time non‐linear systems. Although MKL has successfully been applied to many practical tasks, there are still few research works on the approximation of multi‐kernel structure both in the actor module and the critic module.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Synchronous multi‐kernel iterative dual heuristic programming with ℓ₂‐regularisation

Yang

Zhang

Liu

et al. 2020

J. eng.

Self Cite

View full text Add to dashboard Cite

“…Following the same line with [18], setting

\frac{\partial J (s)}{\partial u (s)} = 0

, it is possible to compute the optimal control policy at time k as a function of the next state

s^{+}

u^{*} ()(, s) = - \frac{1}{2} γ R^{- 1} {()(, \frac{\partial s^{+}}{\partial u^{*} ()(, s)})}^{⊤} λ^{*} (s^{+})

where

λ^{*} = \partial J^{*} / \partial s

is the costate that can be obtained as follows:

\begin{matrix} right leftthickmathspace.5em \\ λ^{*} (s) = & \frac{\partial [r (s, u^{*} (s)) + γ J^{*} (s^{+})]}{\partial s} \\ = & 2 Q s + γ {()(, \frac{\partial s^{+}}{\partial s})}^{T} λ^{*} (s^{+}) \end{matrix}

As the system given by (1) is non‐linear, it is difficult to analytically calculate

λ^{*} ()(, s)

that needs to be used in (4). To solve this problem, kernel‐based least‐squares iterative method has been widely used for policy and value evaluation in the framework of DHP.…”

Section: Problem Formulationmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Synchronous multi‐kernel iterative dual heuristic programming with ℓ₂‐regularisation

Yang

Zhang

Liu

et al. 2020

J. eng.

Self Cite

View full text Add to dashboard Cite

A Reinforcement Learning Approach to Autonomous Decision Making of Intelligent Vehicles on Highways

Zuo

et al. 2019

IEEE Trans. Syst. Man Cybern, Syst.

View full text Add to dashboard Cite

Multi-Kernel Online Reinforcement Learning for Path Tracking Control of Intelligent Vehicles

Liu

Huang²,

et al. 2021

IEEE Trans. Syst. Man Cybern, Syst.

View full text Add to dashboard Cite

Model-Free Multi-Kernel Learning Control for Nonlinear Discrete-Time Systems

Cited by 3 publications

References 17 publications

Synchronous multi‐kernel iterative dual heuristic programming with ℓ₂‐regularisation

Synchronous multi‐kernel iterative dual heuristic programming with ℓ₂‐regularisation

A Reinforcement Learning Approach to Autonomous Decision Making of Intelligent Vehicles on Highways

Multi-Kernel Online Reinforcement Learning for Path Tracking Control of Intelligent Vehicles

Contact Info

Product

Resources

About

Model-Free Multi-Kernel Learning Control for Nonlinear Discrete-Time Systems

Cited by 3 publications

References 17 publications

Synchronous multi‐kernel iterative dual heuristic programming with ℓ2‐regularisation

Synchronous multi‐kernel iterative dual heuristic programming with ℓ2‐regularisation

A Reinforcement Learning Approach to Autonomous Decision Making of Intelligent Vehicles on Highways

Multi-Kernel Online Reinforcement Learning for Path Tracking Control of Intelligent Vehicles

Contact Info

Product

Resources

About

Synchronous multi‐kernel iterative dual heuristic programming with ℓ₂‐regularisation

Synchronous multi‐kernel iterative dual heuristic programming with ℓ₂‐regularisation