2017
DOI: 10.3389/fnbot.2017.00010
|View full text |Cite
|
Sign up to set email alerts
|

Improving Robot Motor Learning with Negatively Valenced Reinforcement Signals

Abstract: Both nociception and punishment signals have been used in robotics. However, the potential for using these negatively valenced types of reinforcement learning signals for robot learning has not been exploited in detail yet. Nociceptive signals are primarily used as triggers of preprogrammed action sequences. Punishment signals are typically disembodied, i.e., with no or little relation to the agent-intrinsic limitations, and they are often used to impose behavioral constraints. Here, we provide an alternative … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
6
2
1

Relationship

2
7

Authors

Journals

citations
Cited by 15 publications
(13 citation statements)
references
References 37 publications
0
8
0
Order By: Relevance
“…DEVELOPMENT Besides intrinsic motivations, some works have highlighted the relevance of other mechanisms to sensorimotor explo-ration. For instance, proprioception is mentioned in [11,28] and nociception in [29]. Proprioception endows agents with a sense of their own movements.…”
Section: The Role Of Somesthetic Senses In Vocalmentioning
confidence: 99%
“…DEVELOPMENT Besides intrinsic motivations, some works have highlighted the relevance of other mechanisms to sensorimotor explo-ration. For instance, proprioception is mentioned in [11,28] and nociception in [29]. Proprioception endows agents with a sense of their own movements.…”
Section: The Role Of Somesthetic Senses In Vocalmentioning
confidence: 99%
“…They also o®er the possibility of combining some sort of random search while utilizing the results of previous evaluations. Miikkulainen et al, 25 Navarro-Guerrero et al, 26,27 Real et al, 28 and Xie et al 29 apply evolutionary algorithms to optimize a subset of hyperparameters for neural networks, while Lorenzo et al 30 use a particle swarm optimization algorithm.…”
Section: Related Workmentioning
confidence: 99%
“…I would add (4) Value systems; extracting saliency from the environment and responding appropriately (Friston et al, 1994 ; Krichmar, 2008 ), and (5) Prediction; using past experience to be more successful in the future (Clark, 2013 ). In the area of value systems, models of neuromodulation have been used to simulate value prediction and drive action selection (Sporns and Alexander, 2002 ; Cox and Krichmar, 2009 ; Vargas et al, 2009 ; Krichmar, 2013 ; Navarro-Guerrero et al, 2017 ). Predictive coding strategies using hierarchical Bayesian systems and recurrent neural networks have been used for robots to develop internal models that predict movement of object and of other robots (Park et al, 2012 ; Murata et al, 2017 ).…”
Section: Future Outlookmentioning
confidence: 99%