2004
DOI: 10.1109/tsmcb.2004.825938
|View full text |Cite
|
Sign up to set email alerts
|

Online Tuning of Fuzzy Inference Systems Using Dynamic Fuzzy Q-Learning

Abstract: This paper presents a dynamic fuzzy Q-learning (DFQL) method that is capable of tuning fuzzy inference systems (FIS) online. A novel online self-organizing learning algorithm is developed so that structure and parameters identification are accomplished automatically and simultaneously based only on Q-learning. Self-organizing fuzzy inference is introduced to calculate actions and Q-functions so as to enable us to deal with continuous-valued states and actions. Fuzzy rules provide a natural mean of incorporatin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
90
0
2

Year Published

2005
2005
2019
2019

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 125 publications
(97 citation statements)
references
References 13 publications
0
90
0
2
Order By: Relevance
“…For example, Smith [16] presented a new model for representation and generalization in model-less RL based on the self-organizing map (SOM) and standard Q-learning. The adaptation of Watkins' Q-learning with fuzzy inference systems for problems with large state-action spaces or with continuous state spaces is also proposed [6], [17], [18], [19]. Many specific improvements are also implemented to modify related RL methods in practice [7], [9], [10], [20], [21], [22].…”
Section: Introductionmentioning
confidence: 99%
“…For example, Smith [16] presented a new model for representation and generalization in model-less RL based on the self-organizing map (SOM) and standard Q-learning. The adaptation of Watkins' Q-learning with fuzzy inference systems for problems with large state-action spaces or with continuous state spaces is also proposed [6], [17], [18], [19]. Many specific improvements are also implemented to modify related RL methods in practice [7], [9], [10], [20], [21], [22].…”
Section: Introductionmentioning
confidence: 99%
“…If the TD error is positive, it suggests that the quality of this action should be strengthened for future use, whereas if the TD error is negative, it suggests that the quality should weakened [4]. The learning rule by taking the eligibity traces is given by …”
Section: Qsmentioning
confidence: 99%
“…An important number of choices is given a priori, these choices are carried with empirical methods, and then the design of the FLC can prove to be long and delicate towards the important number of parameters to determine, and can lead then to a solution with poor performance [12]. With this subjective approach, it is difficult for a designer to examine complex systems to find the necessary number of rules, and to determine appropriate parameters of the rules for implementing the fuzzy controller [13]. Also, it isn't easy to design an optimized fuzzy controller.…”
mentioning
confidence: 99%
See 2 more Smart Citations