2019
DOI: 10.1609/aaai.v33i01.33012522
|View full text |Cite
|
Sign up to set email alerts
|

Learning Models of Sequential Decision-Making with Partial Specification of Agent Behavior

Abstract: Artificial agents that interact with other (human or artificial) agents require models in order to reason about those other agents’ behavior. In addition to the predictive utility of these models, maintaining a model that is aligned with an agent’s true generative model of behavior is critical for effective human-agent interaction. In applications wherein observations and partial specification of the agent’s behavior are available, achieving model alignment is challenging for a variety of reasons. For one, the… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
3
1

Relationship

1
9

Authors

Journals

citations
Cited by 10 publications
(7 citation statements)
references
References 15 publications
(20 reference statements)
0
7
0
Order By: Relevance
“…For instance, the proposed approach assumes accurate specification of team members' policies and complete observablity of their actions, both of which might be difficult to meet in practice. Hence, we are exploring learning-based approaches to arrive at team policies in presence of latent states [20], [21]. To address the challenges associated with state and action observability, the development of an AI Coach would be greatly enhanced by nuanced surgical tool detection and people tracking methodology.…”
Section: Discussionmentioning
confidence: 99%
“…For instance, the proposed approach assumes accurate specification of team members' policies and complete observablity of their actions, both of which might be difficult to meet in practice. Hence, we are exploring learning-based approaches to arrive at team policies in presence of latent states [20], [21]. To address the challenges associated with state and action observability, the development of an AI Coach would be greatly enhanced by nuanced surgical tool detection and people tracking methodology.…”
Section: Discussionmentioning
confidence: 99%
“…Konidaris et al (2012), Niekum et al (2015), and Ranchod et al (2015) frame the IRL problem in a semi-Markov setting. Further Unhelkar and Shah (2019) proposed agent Markov models (AMM), a hierarchical approach that models the demonstrator’s policy as piecewise Markov with discrete control modes inferred using a non-parametric prior. These approaches utilize reward functions or policies to represent the task specification implicitly.…”
Section: Related Workmentioning
confidence: 99%
“…The literature investigated how to create a reasonable model of humans and how to obtain task knowledge, e.g., [22]. Hierarchical models consist of layered abstractions and are considered suitable or close to human intuitions.…”
Section: Related Work A) Theory Of Mind In Hrcmentioning
confidence: 99%