An adjustment method of the number of states on Q‐learning segmenting state space adaptively

Hamagami, Tomoki; Koakutsu, Seiichi; Hirata, Hironori

doi:10.1002/ecjb.20383

Cited by 7 publications

(10 citation statements)

References 7 publications

(7 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…To avoid the curse of dimensionality, there exists modular hierarchical learning [10,11,16] that construct the learning model as the combination of subspaces. Adaptive segmentation [12,13] for constructing the learning model validly corresponding to the environment is also studied. However more effective technique of different approach is also necessary in order to apply reinforcement learning to actual sized problems.…”

Section: Introductionmentioning

confidence: 99%

Reinforcement learning based on modular fuzzy model with gating unit

Watanabe

Wada

2008

2008 IEEE International Conference on Systems, Man and Cybernetics

View full text Add to dashboard Cite

In order to realize intelligent agent such as autonomous mobile robots, Reinforcement Learning is one of necessary techniques in behavior control system. However, applying the reinforcement learning to actual sized problem, the "curse of dimensionality" problem in partition of sensory states should be avoided maintaining computational efficiency. Furthermore the robot task is desired to be decomposed automatically in learning process for achievement of good performance. We tackle these two issues by applying modular fuzzy model with gating unit to reinforcement learning. The modular fuzzy model extending SIRMs architecture is formulated to apply Q-Learning algorithm. The gating unit that is constructed as a neural network model or simple learning parameters is installed to switch the use of the modular model for task decomposition. Through numerical examples, we found that the proposed method has fair convergence property of learning compared with the conventional model structure.

show abstract

Section: Introductionmentioning

confidence: 99%

Reinforcement learning based on modular fuzzy model with gating unit

Watanabe

Wada

2008

2008 IEEE International Conference on Systems, Man and Cybernetics

View full text Add to dashboard Cite

show abstract

“…To avoid the curse of dimensionality, there exists modular hierarchical learning [10,11] that construct the learning model as the combination of subspaces. Adaptive segmentation [12,13] for constructing the learning model validly corresponding to the environment is also studied. However more effective technique of different approach is also necessary in order to apply reinforcement learning to actual sized problems.…”

Section: Introductionmentioning

confidence: 99%

A study on multi-agent reinforcement learning problem based on hierarchical modular fuzzy model

Watanabe

2009

2009 IEEE International Conference on Fuzzy Systems

View full text Add to dashboard Cite

Reinforcement learning is a promising approach to realize intelligent agent such as autonomous mobile robots. In order to apply the reinforcement learning to actual sized problem, the "curse of dimensionality" problem in partition of sensory states should be avoided maintaining computational efficiency. The paper describes a hierarchical modular reinforcement learning that Profit Sharing learning algorithm is combined with Q-Learning reinforcement learning algorithm hierarchically in multi-agent pursuit environment. As the model structure for such huge problem, I propose a modular fuzzy model extending SIRMs architecture. Through numerical experiments, I found that the proposed method has good convergence property of learning compared with the conventional algorithms.

show abstract

“…QLASS (Q-Learning with Adaptive State Segmentation) is one of on-line categorization methods [8,24]. QLASS categorizes percept vectors on the basis of a Voronoi diagram, where each Voronoi cell corresponds to a category.…”

Section: Introductionmentioning

confidence: 99%

“…Since each category is treated as a state in re-inforcement learning, generating too many categories deteriorates the performance of reinforcement learning. In [8], Hamagami and his group proposed some heuristics to reduce the number of categories. In their method, however, the maximum number of categories must be specified although it is difficult to decide the optimal value of it.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

State space segmentation for acquisition of agent behavior

Ueda

Naraki

Nasu

et al. 2008

Web Intelligence and Agent Systems: An International Journal

View full text Add to dashboard Cite

We propose a new method SSED (State Segmentation based on Euclidean Distance) to categorize continuous numeric percepts for Q-learning, where percept vectors are classified into categories and Q-learning uses categories as states to acquire rules for agent behavior. In SSED, categories are represented by hyper-spheres. A percept vector is classified into a category that covers the vector and is the nearest to it. For efficient reinforcement learning, category merging is provided with SSED, where the number of parameters to control category merging in SSED is fewer than that in fuzzy ART with category merging. In addition, match tracking is incorporated into SSED in order to specialize a category. SSED is combined with Q-learning and it is compared with some state segmentation methods. Experimental results show that Q-learning with SSED learns good rules for agent behavior more efficiently than other methods.

show abstract

An adjustment method of the number of states on Q‐learning segmenting state space adaptively

Cited by 7 publications

References 7 publications

Reinforcement learning based on modular fuzzy model with gating unit

Reinforcement learning based on modular fuzzy model with gating unit

A study on multi-agent reinforcement learning problem based on hierarchical modular fuzzy model

State space segmentation for acquisition of agent behavior

Contact Info

Product

Resources

About