2020
DOI: 10.1016/j.jprocont.2020.02.003
|View full text |Cite
|
Sign up to set email alerts
|

A model-based deep reinforcement learning method applied to finite-horizon optimal control of nonlinear control-affine system

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
11
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 46 publications
(11 citation statements)
references
References 40 publications
0
11
0
Order By: Relevance
“…Other approaches utilize metalearning and apprenticeship learning to quickly adapt trained RL models to new processes [27,28]. In the model-based setting, Kim et al [29] incorporate deep neural networks (DNNs) as value function approximators into the globalized dual heuristic programming algorithm. Predictive models have also been augmented with popular DRL algorithms, such as DDPG or TD3, to improve the policy gradient estimation [30] Other approaches to RL-based control postulate a fixed control structure such as PID [31,32,33].…”
Section: Related Workmentioning
confidence: 99%
“…Other approaches utilize metalearning and apprenticeship learning to quickly adapt trained RL models to new processes [27,28]. In the model-based setting, Kim et al [29] incorporate deep neural networks (DNNs) as value function approximators into the globalized dual heuristic programming algorithm. Predictive models have also been augmented with popular DRL algorithms, such as DDPG or TD3, to improve the policy gradient estimation [30] Other approaches to RL-based control postulate a fixed control structure such as PID [31,32,33].…”
Section: Related Workmentioning
confidence: 99%
“…Action refers to the combination of action elements according to a certain order to form an action sequence of a certain type of movement [4,5]. Activity refers to a broader concept, representing the complex movement of the human body, which is closely related to the object and the surrounding environment [6]. Recognition of human behavior is usually combined with multifeature fusion, and its application prospects are very broad.…”
Section: Introductionmentioning
confidence: 99%
“…Recently, there has been growing interest amongst the research community and industry in the development of reinforcement learning (RL) based control schemes [1]. This is underpinned by the ability of RL to naturally account for process stochasticity and handle nonlinear dynamics, and reflected by a growing literature that demonstrates application empirically [2,3,4]. All of these works rely on offline simulation of a process model, with results often validated on the same model that the RL policy was trained.…”
Section: Introductionmentioning
confidence: 99%