2014
DOI: 10.3389/fnbot.2013.00025
|View full text |Cite
|
Sign up to set email alerts
|

Curiosity driven reinforcement learning for motion planning on humanoids

Abstract: Most previous work on artificial curiosity (AC) and intrinsic motivation focuses on basic concepts and theory. Experimental results are generally limited to toy scenarios, such as navigation in a simulated maze, or control of a simple mechanical system with one or two degrees of freedom. To study AC in a more realistic setting, we embody a curious agent in the complex iCub humanoid robot. Our novel reinforcement learning (RL) framework consists of a state-of-the-art, low-level, reactive control layer, which co… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
60
0

Year Published

2014
2014
2023
2023

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 66 publications
(60 citation statements)
references
References 43 publications
0
60
0
Order By: Relevance
“…This way we are able to send arbitrary motions to our system, while ensuring the safety of our robot. Even with just these static objects, this has been shown to provide an interesting way to learn robot reaching behaviors through reinforcement (Pathak et al, 2013;Frank et al, 2014). The presented system has the same functionality also for arbitrary, non-static objects.…”
Section: Example: Reaching While Avoiding a Moving Obstaclementioning
confidence: 90%
“…This way we are able to send arbitrary motions to our system, while ensuring the safety of our robot. Even with just these static objects, this has been shown to provide an interesting way to learn robot reaching behaviors through reinforcement (Pathak et al, 2013;Frank et al, 2014). The presented system has the same functionality also for arbitrary, non-static objects.…”
Section: Example: Reaching While Avoiding a Moving Obstaclementioning
confidence: 90%
“…So, limited applications to all body movements (1) Temporal Awareness Model/Timing Model; Olsen and Goodrich [122] (2) Human-Robot Team (HRT) Modelling or Tele-Operated Multiple Robot Model Burke et al [14] (3) Robot Awareness Model Kanda et al [80,82] (4) Model of Integrated Humans' shared Intentions, e.g. Haptic Channel, Motion Planning Model through Play Interactions, and the like Steinfeld et al [158] Bethel et al [8] Bethel and Murphy [5] Mutlu et al [119] Cooney et al [25,26] Frank et al [39] Pateraki et al [126] [118] Ease of Classification (EOC) Formula Scoring method for evaluating the Societal Acceptance towards robot…”
Section: Strengths Of This Primary Evaluation Methodology Weaknesses mentioning
confidence: 99%
“…Kanda et al [83] (4) Task performance metrics [5,8,14,119] [122, 126,158] (may incorporate the subevaluation methodologies of comparisons of the measurement of body movement interaction in between a humanoid robot and humans with subjective evaluation results [25] [25,26,39,82] [83])…”
Section: Strengths Of This Primary Evaluation Methodology Weaknesses mentioning
confidence: 99%
See 1 more Smart Citation
“…Curiosity-based intrinsic motivation systems are increasingly used in the development of autonomous computational systems [5], and a growing body of computational and robotic work explores the reward mechanisms which subserve a range of cognitive functions and behaviors, for example low-level perceptual encoding [6], novelty detection [7], reaching [8] and motion planning [9]. Various computational implementations of intrinsic motivation have been proposed, such as the drive to increase the ability to predict outcomes e.g., [10], a "creative" drive to find systematicity in input [11], [12], or competence-based systems which seek out maximally-or minimally difficult tasks (for an in-depth review, see [13]).…”
Section: Introductionmentioning
confidence: 99%