Teachable robots: Understanding human teaching behavior to build more effective robot learners

Thomaz, Andrea L.; Breazeal, Cynthia

doi:10.1016/j.artint.2007.09.009

Cited by 304 publications

(248 citation statements)

References 23 publications

(27 reference statements)

Supporting

Mentioning

240

Contrasting

Unclassified

Order By: Relevance

“…The human's feedback is the only source of feedback or evaluation that the agent receives. However, TAMER and other methods for learning from human reward can be useful even when other evaluative information is available, as has been shown previously [21,5,17,11]. The TAMER algorithm described below has additionally been extended to learn in continuous action spaces through an actor-critic algorithm [22] and to provide additional information to the trainer-either action confidence or summaries of past performance-creating changes in the quantity of reward instances given and in learned performance [14] Motivation and philosophy of TAMER The TAMER framework is designed around two insights.…”

Section: Background On Tamermentioning

confidence: 99%

“…Accordingly, other algorithms for learning from human reward [4,21,20,16,18,13] do not directly account for delay, do not model human reward explicitly, and are not fully myopic (i.e., they employ discount factors greater than 0).…”

Section: Background On Tamermentioning

confidence: 99%

“…Though a few past projects have considered this problem of learning from human reward [4,21,20,16,18,13,9], only two of these implemented their solution for a robotic agent. In one such project [13], the agent learned partially in simulation and from hardcoded reward, demonstrations, and human reward.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Training a Robot via Human Feedback: A Case Study

2013

Self Cite

View full text Add to dashboard Cite

Abstract. We present a case study of applying a framework for learning from numeric human feedback-TAMER-to a physically embodied robot. In doing so, we also provide the first demonstration of the ability to train multiple behaviors by such feedback without algorithmic modifications and of a robot learning from free-form human-generated feedback without any further guidance or evaluative feedback. We describe transparency challenges specific to a physically embodied robot learning from human feedback and adjustments that address these challenges.

show abstract

Section: Background On Tamermentioning

confidence: 99%

Section: Background On Tamermentioning

confidence: 99%

See 1 more Smart Citation

Training a Robot via Human Feedback: A Case Study

2013

Self Cite

View full text Add to dashboard Cite

show abstract

“…Our HRI studies with an interactive RL agent revealed that people use the reward signal not only to provide feedback on past actions (what is commonly assumed in the design of RL algorithms) but also to guide future action (Thomaz & Breazeal 2008). Further, we discovered a strong bias of positive over negative feedback over the entire duration of the training, even in the beginning when the agent was doing many things wrong (Thomaz & Breazeal 2008). This suggests that people were using the feedback channel to motivate and encourage the robot.…”

Section: (D) Challengementioning

confidence: 99%

“…In a series of human participant studies where human teachers guide a robot to perform a simple task (learning to operate a control panel with a lever, toggle and button), we have found that humans readily coordinate their teaching behaviour with the robot's gaze behaviour-waiting until the robot re-establishes eye contact before offering their next guidance cue, adaptively re-orienting their guidance cue to be in alignment with the robot's current visual focus, actively trying to re-direct the robot's gaze through deictic cues or offering more guidance if the robot's gaze behaviour conveys uncertainty in what to do next (e.g. looking back and forth among several possible alternatives) (Breazeal & Thomaz 2008a;Thomaz & Breazeal 2008). These findings suggest that people read the robot's gaze as an indicator of its internal state of attention as well as solicitations for help, and intuitively coordinate their teaching acts to support the robot's learning process.…”

Section: Expression In Social Robots C Breazeal 3529mentioning

confidence: 99%

Role of expressive behaviour for robots that learn from people

Breazeal¹

2009

Phil. Trans. R. Soc. B

Self Cite

View full text Add to dashboard Cite

Robotics has traditionally focused on developing intelligent machines that can manipulate and interact with objects. The promise of personal robots, however, challenges researchers to develop socially intelligent robots that can collaborate with people to do things. In the future, robots are envisioned to assist people with a wide range of activities such as domestic chores, helping elders to live independently longer, serving a therapeutic role to help children with autism, assisting people undergoing physical rehabilitation and much more. Many of these activities shall require robots to learn new tasks, skills and individual preferences while 'on the job' from people with little expertise in the underlying technology. This paper identifies four key challenges in developing social robots that can learn from natural interpersonal interaction. The author highlights the important role that expressive behaviour plays in this process, drawing on examples from the past 8 years of her research group, the Personal Robots Group at the MIT Media Lab.

show abstract

Machine Behaviour (Originally Published 2019 by Springer Nature)

Rahwan¹,

Cebrián²,

Obradovich³

et al. 2022

Machine Learning and the City

Self Cite

View full text Add to dashboard Cite

Machines powered by Artificial Intelligence (AI) increasingly mediate our social, cultural, economic, and political interactions. Understanding the behaviour of AI systems is essential to our ability to control their actions, reap their benefits, and minimize their harms. We argue this necessitates a broad scientific research agenda to study machine behaviour that incorporates but expands beyond the discipline of computer science and requires insights from across the sciences. Here we first outline a set of questions fundamental to this emerging field. We then explore the technical, legal, and institutional constraints facing the study of machine behaviour.

show abstract

Teachable robots: Understanding human teaching behavior to build more effective robot learners

Cited by 304 publications

References 23 publications

Training a Robot via Human Feedback: A Case Study

Training a Robot via Human Feedback: A Case Study

Role of expressive behaviour for robots that learn from people

Machine Behaviour (Originally Published 2019 by Springer Nature)

Contact Info

Product

Resources

About