Machine Teaching for Human Inverse Reinforcement Learning

Lee, Michael S.; Admoni, Henny; Simmons, Reid

doi:10.3389/frobt.2021.693050

Cited by 8 publications

(11 citation statements)

References 36 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…IRL) but also their beliefs and subsequently what counterfactuals they would consider. We thus extend our previous work [13] to evaluate a demonstration's informativeness based on counterfactuals generated via potential reward functions on the human's mind as opposed to counterfactuals generated via one-action deviations, and scaffold by showing demonstrations of increasing informativeness.…”

Section: Proposed Techniques For Teaching Humansmentioning

confidence: 81%

“…Brown and Niekum [12] proposed the Set Cover Optimal Teaching (SCOT) algorithm for selecting demonstrations that provide the tightest constraints on a target reward function for a pure IRL learner. However, human learning is more multifaceted and our prior work [13] tailored SCOT for humans by incorporating human learning techniques and concepts such as scaffolding. Our initial method of scaffolding via IRL did not yield significant learning gains, which we aim to improve in this work by incorporating counterfactuals that are based on the human's beliefs regarding the robot's reward function.…”

Section: Related Workmentioning

confidence: 99%

“…We test learners accordingly by asking them to predict the robot's optimal behavior in unseen instances of a domain. In our prior work [13], we showed that informativeness of a demonstration during teaching could simply be inverted to measure the expected difficulty of correctly predicting it during testing. In this work, we propose to update the difficulty measure by explicitly conditioning on the learner's beliefs of the robot's reward function.…”

Section: Related Workmentioning

confidence: 99%

“…Generalizing to higher dimensions, we can Testing: The area of a demonstration's BEC intuitively correlates with its informativeness during teaching as smaller areas indicate less uncertainty regarding w * . Our prior work [13] showed that a demonstration's BEC area may also be inverted to measure the difficulty of correctly predicting the demonstration as a test if the human has not seen it before (e.g. so that a smaller BEC area indicates a difficult test).…”

Section: Proposed Techniques For Teaching Humansmentioning

confidence: 99%

See 3 more Smart Citations

Reasoning about Counterfactuals to Improve Human Inverse Reinforcement Learning

Lee¹,

Admoni²,

Simmons³

2022

Preprint

Self Cite

View full text Add to dashboard Cite

To collaborate well with robots, we must be able to understand their decision making. Humans naturally infer other agents' beliefs and desires by reasoning about their observable behavior in a way that resembles inverse reinforcement learning (IRL). Thus, robots can convey their beliefs and desires by providing demonstrations that are informative for a human's IRL. An informative demonstration is one that differs strongly from the learner's expectations of what the robot will do given their current understanding of the robot's decision making. However, standard IRL does not model the learner's existing expectations, and thus cannot do this counterfactual reasoning. We propose to incorporate the learner's current understanding of the robot's decision making into our model of human IRL, so that our robot can select demonstrations that maximize the human's understanding. We also propose a novel measure for estimating the difficulty for a human to predict instances of a robot's behavior in unseen environments. A user study finds that our test difficulty measure correlates well with human performance and confidence. Interestingly, considering human beliefs and counterfactuals when selecting demonstrations decreases human performance on easy tests, but increases performance on difficult tests, providing insight on how to best utilize such models.

show abstract

Section: Proposed Techniques For Teaching Humansmentioning

confidence: 81%

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Section: Proposed Techniques For Teaching Humansmentioning

confidence: 99%

See 2 more Smart Citations

Reasoning about Counterfactuals to Improve Human Inverse Reinforcement Learning

Lee¹,

Admoni²,

Simmons³

2022

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…Explanations help establish a connection between what has been observed and its causes, and serve as a principled basis for generalization [14]. Consequently, explanations scaffold causal learning and have a crucial role in inference [44]. Following this idea, our work also generate explanations in the form of sentence-trajectories and uses maximum likelihood inverse reinforcement learning to find a weighting of the state features that (locally) maximizes the probability of these trajectories.…”

Section: Learning Rewards From Explanationsmentioning

confidence: 99%

Learning from Explanations With Maximum Likelihood Inverse Reinforcement Learning

Tulli

Melo

Paiva

et al. 2022

Preprint

View full text Add to dashboard Cite

Our research effort takes inspiration from human social learning mechanisms to focus on situations in which an expert guides a learner through explanations. The proposed approach incorporates explanations into maximum likelihood inverse reinforcement learning. We computationally evaluate explanations against other teaching signals (reward, demonstration and explanation) in three navigational scenarios. The generated explanations are also evaluated in a user study with 150 participants. The user study investigates participants' preferences between the different types of teaching signals and the impact of contextual situations, i.e., distance from the task's goal, on their preferences. Our simulations' results show that explanations lead to better performance compared to reward and demonstration signals, and that explanations are preferred by human teachers in situations where the goal is far from the learner.

show abstract