Learning Models of Sequential Decision-Making with Partial Specification of Agent Behavior

Unhelkar, Vaibhav V.; Shah, Julie A.

doi:10.1609/aaai.v33i01.33012522

Cited by 10 publications

(7 citation statements)

References 15 publications

(20 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For instance, the proposed approach assumes accurate specification of team members' policies and complete observablity of their actions, both of which might be difficult to meet in practice. Hence, we are exploring learning-based approaches to arrive at team policies in presence of latent states [20], [21]. To address the challenges associated with state and action observability, the development of an AI Coach would be greatly enhanced by nuanced surgical tool detection and people tracking methodology.…”

Section: Discussionmentioning

confidence: 99%

Towards an AI Coach to Infer Team Mental Model Alignment in Healthcare

Seo,

Kennedy-Metz,

Zenati

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

Shared mental models are critical to team success; however, in practice, team members may have misaligned models due to a variety of factors. In safety-critical domains (e.g., aviation, healthcare), lack of shared mental models can lead to preventable errors and harm. Towards the goal of mitigating such preventable errors, here, we present a Bayesian approach to infer misalignment in team members' mental models during complex healthcare task execution. As an exemplary application, we demonstrate our approach using two simulated team-based scenarios, derived from actual teamwork in cardiac surgery. In these simulated experiments, our approach inferred model misalignment with over 75% recall, thereby providing a building block for enabling computer-assisted interventions to augment human cognition in the operating room and improve teamwork.

show abstract

Section: Discussionmentioning

confidence: 99%

Towards an AI Coach to Infer Team Mental Model Alignment in Healthcare

Seo,

Kennedy-Metz,

Zenati

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…Konidaris et al (2012), Niekum et al (2015), and Ranchod et al (2015) frame the IRL problem in a semi-Markov setting. Further Unhelkar and Shah (2019) proposed agent Markov models (AMM), a hierarchical approach that models the demonstrator’s policy as piecewise Markov with discrete control modes inferred using a non-parametric prior. These approaches utilize reward functions or policies to represent the task specification implicitly.…”

Section: Related Workmentioning

confidence: 99%

Supervised Bayesian specification inference from demonstrations

Shah,

Kamath,

et al. 2023

The International Journal of Robotics Research

View full text Add to dashboard Cite

When observing task demonstrations, human apprentices are able to identify whether a given task is executed correctly long before they gain expertise in actually performing that task. Prior research into learning from demonstrations (LfD) has failed to capture this notion of the acceptability of a task’s execution; meanwhile, temporal logics provide a flexible language for expressing task specifications. Inspired by this, we present Bayesian specification inference, a probabilistic model for inferring task specification as a temporal logic formula. We incorporate methods from probabilistic programming to define our priors, along with a domain-independent likelihood function to enable sampling-based inference. We demonstrate the efficacy of our model for inferring specifications, with over 90% similarity observed between the inferred specification and the ground truth—both within a synthetic domain and during a real-world table setting task.

show abstract

“…The literature investigated how to create a reasonable model of humans and how to obtain task knowledge, e.g., [22]. Hierarchical models consist of layered abstractions and are considered suitable or close to human intuitions.…”

Section: Related Work A) Theory Of Mind In Hrcmentioning

confidence: 99%

Models and Algorithms for Human-Aware Task Planning with Integrated Theory of Mind

Favier,

Shekhar,

Alami

2023

2023 32nd IEEE International Conference on Robot and Human Interactive Communication (RO-MAN)

View full text Add to dashboard Cite

It is essential for a collaborative robot to consider the Theory of Mind (ToM) when interacting with humans. Indeed, performing an action in the absence of another agent may create false beliefs like in the well-known Sally & Anne Task [1]. The robot should be able to detect, react to, and even anticipate false beliefs of other agents with a detrimental impact on the task to achieve. Currently, ToM is mainly used to control the task execution and resolve in a reactive way the detrimental false beliefs. Some works introduce ToM at the planning level by considering distinct beliefs, and we are in this context. This work proposes an extension of an existing human-aware task planner and effectively allows the robot to anticipate a false human belief ensuring a smooth collaboration through an implicitly coordinated plan. First, we propose to capture the observability properties of the environment in the state description using two observability types and the notion of co-presence. They allow us to maintain distinct agent beliefs by reasoning directly on what agents can observe through specifically modeled Situation Assessment processes, instead of reasoning of action effects. Then, thanks to the better estimated human beliefs, we can predict if a false belief with adverse impact will occur. If that is the case then, first, the robot's plan can be to communicate minimally and proactively. Second, if this false belief is due to a non-observed robot action, the robot's plan can be to postpone this action until it can be observed by the human, avoiding the creation of the false belief. We implemented our new conceptual approach, discuss its effectiveness qualitatively, and show experimental results on three novel domains.

show abstract

Learning Models of Sequential Decision-Making with Partial Specification of Agent Behavior

Cited by 10 publications

References 15 publications

Towards an AI Coach to Infer Team Mental Model Alignment in Healthcare

Towards an AI Coach to Infer Team Mental Model Alignment in Healthcare

Supervised Bayesian specification inference from demonstrations

Models and Algorithms for Human-Aware Task Planning with Integrated Theory of Mind

Contact Info

Product

Resources

About