2020
DOI: 10.1016/j.automatica.2020.108922
|View full text |Cite
|
Sign up to set email alerts
|

Data-based reinforcement learning approximate optimal control for an uncertain nonlinear system with control effectiveness faults

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
17
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 30 publications
(18 citation statements)
references
References 20 publications
1
17
0
Order By: Relevance
“…In contrast to (20), the version of the BE in ( 22) is a function of ∇B and therefore selecting the weight estimates to minimize (22) may not correspond to the minimization of the original BE in (20). That is, even if ( Ŵc , Ŵa ) → W , the BE in ( 22) may be large at certain points in the statespace because of the influence of the safeguarding component of ( 21), making ( 22) a non-ideal performance metric for learning.…”
Section: Safe Exploration Via Simulation Of Experiencementioning
confidence: 90%
See 4 more Smart Citations
“…In contrast to (20), the version of the BE in ( 22) is a function of ∇B and therefore selecting the weight estimates to minimize (22) may not correspond to the minimization of the original BE in (20). That is, even if ( Ŵc , Ŵa ) → W , the BE in ( 22) may be large at certain points in the statespace because of the influence of the safeguarding component of ( 21), making ( 22) a non-ideal performance metric for learning.…”
Section: Safe Exploration Via Simulation Of Experiencementioning
confidence: 90%
“…However, since a model of the system is known (or, as discussed in Sec. VI, an approximation of the model is known), the BE can be evaluated at any point in the statespace [28] using a different policy to generate data more representative of (20). To facilitate this approach, define the family of mappings {x i : χ × R ≥t0 → χ} N i=1 such that each x i (x(t), t) ∈ B l (x(t)) maps the current state x(t) to some unexplored point in B l (x(t)).…”
Section: Safe Exploration Via Simulation Of Experiencementioning
confidence: 99%
See 3 more Smart Citations