Facial feedback for reinforcement learning: a case study and offline analysis using the TAMER framework

Li, Guangliang; Dibeklioğlu, Hamdi; Whiteson, Shimon; Hung, Hayley

doi:10.1007/s10458-020-09447-w

Cited by 19 publications

(12 citation statements)

References 43 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Li et al trained a mapping model that can map implicit emotions to different explicit feedback data. Facial expressions were marked with different types of feedback in advance, such as 1 for "happy", 0 or -1 for "sadness" [40]. Based on this work, Gadanho introduced a facial feedback reinforcement learning method, which is based on an emotion recognition system.…”

Section: Implicit Interactive Methodsmentioning

confidence: 99%

A Survey of Collaborative Reinforcement Learning: Interactive Methods and Design Patterns

Shi

Cristea

et al. 2021

Designing Interactive Systems Conference 2021

View full text Add to dashboard Cite

The full-text may be used and/or reproduced, and given to third parties in any format or medium, without prior permission or charge, for personal research or study, educational, or not-for-prot purposes provided that:• a full bibliographic reference is made to the original source • a link is made to the metadata record in DRO • the full-text is not changed in any way The full-text must not be sold in any format or medium without the formal permission of the copyright holders.

show abstract

Section: Implicit Interactive Methodsmentioning

confidence: 99%

A Survey of Collaborative Reinforcement Learning: Interactive Methods and Design Patterns

Shi

Cristea

et al. 2021

Designing Interactive Systems Conference 2021

View full text Add to dashboard Cite

show abstract

“…Li et al trained a prediction model mapping the facial feedback to explicit keypress feedback with collected data. Their simulated experiment showed that with enough recognition accuracy, agents can learn a comparative performance from solely facial feedback compared to learning from explicit keypress feedback [50].…”

Section: ) Natural Interactionmentioning

confidence: 99%

“…In addition, these natural interactive feedback can even be combined with hardware delivered feedback to train agents. For example, Li et al mapped the facial expressions to explicit keystroke feedback and proposed to allow an agent to learn from both the predicted facial feedback and keystroke feedback [50].…”

Section: B Multimodal Sensory Feedbackmentioning

confidence: 99%

A Review on Interactive Reinforcement Learning From Human Social Feedback

Lin

Gómez

et al. 2020

IEEE Access

Self Cite

View full text Add to dashboard Cite

Reinforcement learning agent learns how to perform a task by interacting with the environment. The use of reinforcement learning in real-life applications has been limited because of the sample efficiency problem. Interactive reinforcement learning has been developed to speed up the agent's learning and facilitate to learn from ordinary people by allowing them to provide social feedback, e.g, evaluative feedback, advice or instruction. Inspired by real-life biological learning scenarios, there could be many ways to provide feedback for agent learning, such as via hardware delivered, natural interaction like facial expressions, speech or gestures. The agent can even learn from feedback via unimodal or multimodal sensory input. This paper reviews methods for interactive reinforcement learning agent to learn from human social feedback and the ways of delivering feedback. Finally, we discuss some open problems and possible future research directions.

show abstract

“…Our work relates closely to the growing literature of interactive reinforcement learning (RL), or humancentered RL [2,21,22,23,24,25,26,27,28,29] , in which agents learn from interactions with humans in addition to, or instead of, predefined environmental rewards. In the EMPATHIC framework, we use the term implicit human feedback to refer to any multi-modal evaluative signals humans naturally emit during social interactions, including facial expressions, tone of voice, head gestures, hand gestures and other body-language and vocalization modalities not aimed at explicit communication.…”

Section: Related Workmentioning

confidence: 99%

The EMPATHIC Framework for Task Learning from Implicit Human Feedback

Cui¹,

Zhang²,

Allievi³

et al. 2020

Preprint

View full text Add to dashboard Cite

Reactions such as gestures, facial expressions, and vocalizations are an abundant, naturally occurring channel of information that humans provide during interactions. A robot or other agent could leverage an understanding of such implicit human feedback to improve its task performance at no cost to the human. This approach contrasts with common agent teaching methods based on demonstrations, critiques, or other guidance that need to be attentively and intentionally provided. In this paper, we first define the general problem of learning from implicit human feedback and then propose to address this problem through a novel data-driven framework, EMPATHIC. This two-stage method consists of (1) mapping implicit human feedback to relevant task statistics such as rewards, optimality, and advantage; and (2) using such a mapping to learn a task. We instantiate the first stage and three second-stage evaluations of the learned mapping. To do so, we collect a dataset of human facial reactions while participants observe an agent execute a sub-optimal policy for a prescribed training task. We train a deep neural network on this data and demonstrate its ability to (1) infer relative reward ranking of events in the training task from prerecorded human facial reactions; (2) improve the policy of an agent in the training task using live human facial reactions; and (3) transfer to a novel domain in which it evaluates robot manipulation trajectories.

show abstract

Facial feedback for reinforcement learning: a case study and offline analysis using the TAMER framework

Cited by 19 publications

References 43 publications

A Survey of Collaborative Reinforcement Learning: Interactive Methods and Design Patterns

A Survey of Collaborative Reinforcement Learning: Interactive Methods and Design Patterns

A Review on Interactive Reinforcement Learning From Human Social Feedback

The EMPATHIC Framework for Task Learning from Implicit Human Feedback

Contact Info

Product

Resources

About