Q(λ)-learning adaptive fuzzy logic controllers for pursuit-evasion differential games

Desouky, Sameh F.; Schwartz, Howard M.

doi:10.1002/acs.1249

Cited by 32 publications

(96 citation statements)

References 41 publications

(76 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In [4], using velocity vectors of the robot relative to each obstacle, an online navigation method based on calculating the best feasible direction close to an optimal direction to the target is proposed for pursuing a moving target amidst dynamic and static obstacles. Adaptive learning control for pursuit-evasion were presented in [6], [7], and experiments on capturing a moving object using pure pursuit were shown in [8]. …”

Section: Related Workmentioning

confidence: 99%

“…Applications of robotics have been applied to home services, health care and military missions such [3]- [5], etc. Developing various intelligence services, for example intelligent surveillance and patrol systems, is of emerging demand to support human society [6]- [7]. As an intelligent mechatronics system, the mobile robot needs to integrate algorithms related to environment sensing for obstacle detection and SLAM, behavior and route planning, controlling and executing [8].…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

An Intelligent Control System for Mobile Robot Navigation Tasks in Surveillance

Lin

et al. 2014

Robot Intelligence Technology and Applications 2

View full text Add to dashboard Cite

Abstract. In recent years, the autonomous mobile robot has found diverse applications such as home/health care system, surveillance system in civil and military applications and exhibition robot. For surveillance tasks such as moving target pursuit or following and patrol in a region using mobile robot, this paper presents a fuzzy Q-learning, as an intelligent control for cost-based navigation, for autonomous learning of suitable behaviors without the supervision or external human command. The Q-learning is used to select the appropriate rule of interval type-2 fuzzy rule base. The initial testing of the intelligent control is demonstrated by simulation as well as experiment of a simple wall-following based patrolling task of autonomous mobile robot.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

An Intelligent Control System for Mobile Robot Navigation Tasks in Surveillance

Lin

et al. 2014

Robot Intelligence Technology and Applications 2

View full text Add to dashboard Cite

show abstract

“…In each episode, we let the robot move forward and backward in front of the obstacle in order to acquire rewards and react according to the control policy (10). The episode finishes (corresponding to k D k f / when the robot enters the shaded zone (shown in Figure 4), that is, the distance threshold th was reached.…”

Section: Learning From Interactions With the Environmentmentioning

confidence: 99%

Heuristic dynamic programming using echo state network as online trainable adaptive critic

Koprinkova–Hristova

Oubbati

Palm

2012

Adaptive Control & Signal

View full text Add to dashboard Cite

The present paper proposes an implementation of a relatively new recurrent neural network architecturethe echo state network (ESN)-within the frame of heuristic dynamic programming. The ESN is trained online to estimate the utility function and to adapt the control policy of an embodied agent. With the advantage of an easy training algorithm, the ESN architecture offers a simple way to calculate the derivatives required for adapting the controller. Experimental results are provided to validate the proposed learning approach.

show abstract

“…Different from supervised learning, which is learning from input-output data provided by an expert, reinforcement learning is adequate for learning from interaction by using very simple evaluative or critic information instead of instructive information [1]. Reinforcement learning has been used by some authors as a mechanism in tuning and adaptation of the fuzzy logic controllers [3], [7], [12], [13], [22]. Some of the commonly used reinforcement learning algorithms estimate the value function of the state-action pairs, where the estimated value function shows how good it is for the learning agent to perform a given action in a given state.…”

Section: Introductionmentioning

confidence: 99%

“…This is because the Q-learning algorithm can only deal with learning environments that have discrete states and actions. Different algorithms that extend the Q-learning method Mostafa to deal with differential games by using fuzzy inference systems have been proposed in literature [3], [7], [12], [13], [15], [22]. One of these algorithms is the Q-learning fuzzy inference system (QLFIS) algorithm proposed in [7].…”

Section: Introductionmentioning

confidence: 99%

The residual gradient FACL algorithm for differential games

Awheda

Schwartz

2015

2015 IEEE 28th Canadian Conference on Electrical and Computer Engineering (CCECE)

Self Cite

View full text Add to dashboard Cite

A new fuzzy reinforcement learning algorithm that tunes the input and the output parameters of a fuzzy logic controller is proposed in this paper. The proposed algorithm uses three fuzzy inference systems (FISs); one is used as an actor (fuzzy logic controller, FLC), and the other two FISs are used as critics. The proposed algorithm uses the residual gradient value iteration algorithm described in [4] to tune the input and the output parameters of the actor (FLC) of the learning robot. The proposed algorithm also tunes the input and the output parameters of the critics. The proposed algorithm is called the residual gradient fuzzy actor critics learning (RGFACL) algorithm. The proposed algorithm is used to learn a single pursuit-evasion differential game. Simulation results show that the performance of the proposed RGFACL algorithm outperforms the performance of the fuzzy actor critic learning (FACL) and the Q-learning fuzzy inference system (QLFIS) algorithms proposed in [3] and [7], respectively, in terms of convergence and speed of learning.

show abstract

Q(λ)-learning adaptive fuzzy logic controllers for pursuit-evasion differential games

Cited by 32 publications

References 41 publications

An Intelligent Control System for Mobile Robot Navigation Tasks in Surveillance

An Intelligent Control System for Mobile Robot Navigation Tasks in Surveillance

Heuristic dynamic programming using echo state network as online trainable adaptive critic

The residual gradient FACL algorithm for differential games

Contact Info

Product

Resources

About