2011
DOI: 10.1002/acs.1249
|View full text |Cite
|
Sign up to set email alerts
|

Q(λ)-learning adaptive fuzzy logic controllers for pursuit-evasion differential games

Abstract: This paper addresses the problem of tuning the input and the output parameters of a fuzzy logic controller. A novel technique that combines Q( )-learning with function approximation (fuzzy inference system) is proposed. The system learns autonomously without supervision or a priori training data. The proposed technique is applied to three different pursuit-evasion differential games. The proposed technique is compared with the classical control strategy, Q( )-learning only, and the technique proposed by Dai et… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
96
0

Year Published

2012
2012
2021
2021

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 32 publications
(96 citation statements)
references
References 41 publications
(76 reference statements)
0
96
0
Order By: Relevance
“…In [4], using velocity vectors of the robot relative to each obstacle, an online navigation method based on calculating the best feasible direction close to an optimal direction to the target is proposed for pursuing a moving target amidst dynamic and static obstacles. Adaptive learning control for pursuit-evasion were presented in [6], [7], and experiments on capturing a moving object using pure pursuit were shown in [8]. …”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…In [4], using velocity vectors of the robot relative to each obstacle, an online navigation method based on calculating the best feasible direction close to an optimal direction to the target is proposed for pursuing a moving target amidst dynamic and static obstacles. Adaptive learning control for pursuit-evasion were presented in [6], [7], and experiments on capturing a moving object using pure pursuit were shown in [8]. …”
Section: Related Workmentioning
confidence: 99%
“…Applications of robotics have been applied to home services, health care and military missions such [3]- [5], etc. Developing various intelligence services, for example intelligent surveillance and patrol systems, is of emerging demand to support human society [6]- [7]. As an intelligent mechatronics system, the mobile robot needs to integrate algorithms related to environment sensing for obstacle detection and SLAM, behavior and route planning, controlling and executing [8].…”
Section: Introductionmentioning
confidence: 99%
“…In each episode, we let the robot move forward and backward in front of the obstacle in order to acquire rewards and react according to the control policy (10). The episode finishes (corresponding to k D k f / when the robot enters the shaded zone (shown in Figure 4), that is, the distance threshold th was reached.…”
Section: Learning From Interactions With the Environmentmentioning
confidence: 99%
“…Different from supervised learning, which is learning from input-output data provided by an expert, reinforcement learning is adequate for learning from interaction by using very simple evaluative or critic information instead of instructive information [1]. Reinforcement learning has been used by some authors as a mechanism in tuning and adaptation of the fuzzy logic controllers [3], [7], [12], [13], [22]. Some of the commonly used reinforcement learning algorithms estimate the value function of the state-action pairs, where the estimated value function shows how good it is for the learning agent to perform a given action in a given state.…”
Section: Introductionmentioning
confidence: 99%
“…This is because the Q-learning algorithm can only deal with learning environments that have discrete states and actions. Different algorithms that extend the Q-learning method Mostafa to deal with differential games by using fuzzy inference systems have been proposed in literature [3], [7], [12], [13], [15], [22]. One of these algorithms is the Q-learning fuzzy inference system (QLFIS) algorithm proposed in [7].…”
Section: Introductionmentioning
confidence: 99%