2010
DOI: 10.1016/j.automatica.2010.02.006
|View full text |Cite
|
Sign up to set email alerts
|

Approximate dynamic programming with a fuzzy parameterization

Abstract: Dynamic programming (DP) is a powerful paradigm for general, nonlinear optimal control. Computing exact DP solutions is in general only possible when the process states and the control actions take values in a small discrete set. In practice, it is necessary to approximate the solutions. Therefore, we propose an algorithm for approximate DP that relies on a fuzzy partition of the state space, and on a discretization of the action space. This fuzzy Q-iteration algorithm works for deterministic processes, under … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
38
0

Year Published

2011
2011
2019
2019

Publication Types

Select...
4
3
3

Relationship

4
6

Authors

Journals

citations
Cited by 55 publications
(38 citation statements)
references
References 18 publications
0
38
0
Order By: Relevance
“…, 15π} Since an exact optimal solution for the inverted pendulum problem is not known, in order to approximate the regret (3), a near-optimal solution is computed instead. To this end, the fuzzy Q-iteration algorithm [7] is modified to work for the sparsely stochastic systems considered in this paper, and applied to the inverted pendulum using a very accurate approximator over the state space. Figure 3, top reports the (approximate) regret of the three algorithms, averaged over the set X 0 .…”
Section: Resultsmentioning
confidence: 99%
“…, 15π} Since an exact optimal solution for the inverted pendulum problem is not known, in order to approximate the regret (3), a near-optimal solution is computed instead. To this end, the fuzzy Q-iteration algorithm [7] is modified to work for the sparsely stochastic systems considered in this paper, and applied to the inverted pendulum using a very accurate approximator over the state space. Figure 3, top reports the (approximate) regret of the three algorithms, averaged over the set X 0 .…”
Section: Resultsmentioning
confidence: 99%
“…. , N. This value iteration is guaranteed to converge (Buşoniu et al, 2010) and terminates when the following condition is satisfied:…”
Section: Preliminariesmentioning
confidence: 99%
“…It can also be simply combined with fuzzy logic and provide the relationship between the states and the accessible action, which is the same as creating the fuzzy logic "if. ..then" engine [23][24][25].…”
Section: Introductionmentioning
confidence: 99%