1991 American Control Conference 1991
DOI: 10.23919/acc.1991.4791776
|View full text |Cite
|
Sign up to set email alerts
|

Reinforcement Learning is Direct Adaptive Optimal Control

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

3
341
0
12

Year Published

1996
1996
2022
2022

Publication Types

Select...
6
3
1

Relationship

0
10

Authors

Journals

citations
Cited by 267 publications
(356 citation statements)
references
References 17 publications
3
341
0
12
Order By: Relevance
“…More details can be found in [21]. A derivative of supervised learning is so-called reinforcement learning, which is based trail and error (and reward) [68] and has backings from psychology.…”
Section: Lateral Inhibition and Hebbian Learningmentioning
confidence: 99%
“…More details can be found in [21]. A derivative of supervised learning is so-called reinforcement learning, which is based trail and error (and reward) [68] and has backings from psychology.…”
Section: Lateral Inhibition and Hebbian Learningmentioning
confidence: 99%
“…In contrast, reinforcement learning operates directly on measured data and rewards from interaction, and can also address cases which are analytically intractable using approximations and data-driven techniques. A concise treatment of reinforcement learning as "adaptive optimal control" is presented in [Sutton et al, 1991].…”
Section: Introductionmentioning
confidence: 99%
“…This will lead to an optimization-like problem which cannot be handled by conventional optimal control, e.g., linear quadratic regulator (LQR) [5], due to uncertain and nonlinear system dynamics. In the literature, reinforcement learning, also known as adaptive dynamic programming, has been extensively studied in the control community to address this issue [6,7].…”
Section: Introductionmentioning
confidence: 99%