Proceedings of PerAc '94. From Perception to Action
DOI: 10.1109/fpa.1994.636137
|View full text |Cite
|
Sign up to set email alerts
|

Self-organizing map for reinforcement learning: obstacle-avoidance with Khepera

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
4
0

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 10 publications
(4 citation statements)
references
References 1 publication
0
4
0
Order By: Relevance
“…One solution to this problem relies on applying generalization techniques to states. Some systems have used decision trees (Chapman & Kaelbling, 1995), neural networks (Sehad & Touzet, 1994;Smith, 2002;Touzet, 1997;Wedel & Polani, 1996), and statistical clustering (Mahavedan & Connell, 1992). The main drawback of this solution is difficult to control ''perceptual aliasing'' problem due to over-generalization.…”
Section: Related Researchesmentioning
confidence: 97%
“…One solution to this problem relies on applying generalization techniques to states. Some systems have used decision trees (Chapman & Kaelbling, 1995), neural networks (Sehad & Touzet, 1994;Smith, 2002;Touzet, 1997;Wedel & Polani, 1996), and statistical clustering (Mahavedan & Connell, 1992). The main drawback of this solution is difficult to control ''perceptual aliasing'' problem due to over-generalization.…”
Section: Related Researchesmentioning
confidence: 97%
“…Many researchers have extended the Q-learning structure to solve continuous state and action problems. Lin developed a Q-learning structure based on a neural network [7], Saito and Fukuda proposed a Q-learning structure based on a cerebellar model articulation controller [8], and Sehad and Touzet applied a self-organizing map (SOM) to the Q-learning structure [9]. Notably, all these algorithms still have different shortcomings.…”
Section: Introductionmentioning
confidence: 99%
“…Multilayer perceptron implementations of the Q-learning have been proposed early [3], due to the interest of the restricted memory need and the generalization capability [4]. Self-organizing map implementation of the Q-learning is more recent [5]. We propose to study the use and discuss the interest of this implementation comparing to a multilayer perceptron implementation or more classical ones.…”
mentioning
confidence: 99%