2012
DOI: 10.1007/978-3-642-27645-3_18
|View full text |Cite
|
Sign up to set email alerts
|

Reinforcement Learning in Robotics: A Survey

Abstract: Reinforcement learning offers to robotics a framework and set of tools for the design of sophisticated and hard-to-engineer behaviors. Conversely, the challenges of robotic problems provide both inspiration, impact, and validation for developments in reinforcement learning. The relationship between disciplines has sufficient promise to be likened to that between physics and mathematics. In this article, we attempt to strengthen the links between the two research communities by providing a survey of work in rei… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
265
0

Year Published

2014
2014
2020
2020

Publication Types

Select...
4
3
2

Relationship

1
8

Authors

Journals

citations
Cited by 265 publications
(274 citation statements)
references
References 80 publications
1
265
0
Order By: Relevance
“…A sequence of raw frames is used as input to the network and trial and error is used to learn a policy. Trial and error methods such as reinforcement learning have been extensively used to learn policies for intelligent agents [17]. However, providing demonstrations of correct behavior can greatly expedite the learning rate.…”
Section: Deep Learningmentioning
confidence: 99%
“…A sequence of raw frames is used as input to the network and trial and error is used to learn a policy. Trial and error methods such as reinforcement learning have been extensively used to learn policies for intelligent agents [17]. However, providing demonstrations of correct behavior can greatly expedite the learning rate.…”
Section: Deep Learningmentioning
confidence: 99%
“…Reinforcement learning is a promising approach to deal with control of physical robot with ever increasing complexity of hardware [22], [23] through experience and observations. Q-learning algorithm is a popular model-free reinforcement learning that have been demonstrated to give good results for some instances of robot tasks over the years.…”
Section: Q-learning Algorithmmentioning
confidence: 99%
“…It has been widely applied to the design of robot speed and orientation steering controller because of the following reasons: 1) Control rules are more flexible, thus it can simplify the complex system; 2) The controller can emulate the human decision making; 3) It does not need a detailed model of the plant, and it replaces the mathematical values in describing control system by using the linguistic ambiguous labels for designing robust controllers. On the other hand, reinforcement learning, in particular Q-learning, shows good learning results in designing control input for performing constrained tasks by robots without knowing the system dynamics [22], [23]. The approaches of combining type-1 fuzzy logic and Q-learning for optimization of the consequence parts of fuzzy rules are promising due to the ease of implementation on mobile robot navigation [12]- [17] in which Q value is a cost for each navigation behavior.…”
Section: Introductionmentioning
confidence: 99%
“…Many HRL methods have been proposed in order to reduce the complexity of the task [1]- [4]. The HAM framework [5] can learn complex hierarchical sub-routines.…”
Section: Introductionmentioning
confidence: 99%