“…Our work relates closely to the growing literature of interactive reinforcement learning (RL), or humancentered RL [2,21,22,23,24,25,26,27,28,29] , in which agents learn from interactions with humans in addition to, or instead of, predefined environmental rewards. In the EMPATHIC framework, we use the term implicit human feedback to refer to any multi-modal evaluative signals humans naturally emit during social interactions, including facial expressions, tone of voice, head gestures, hand gestures and other body-language and vocalization modalities not aimed at explicit communication.…”