Explicability? Legibility? Predictability? Transparency? Privacy? Security? The Emerging Landscape of Interpretable Agent Behavior

Chakraborti, Tathagata; Kulkarni, Anagha; Sreedharan, Sarath; Smith, David E.; Kambhampati, Subbarao

doi:10.1609/icaps.v29i1.3463

Cited by 48 publications

(35 citation statements)

References 27 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Explanations are essential for humans to understand the outputs and decisions made by AI systems (Core et al 2006;Miller 2019). There exist many works that provide explanations for different AI use cases ranging from automated planning (Fox, Long, and Magazzeni 2017;Chakraborti et al 2019) to machine learning (Carvalho, Pereira, and Cardoso 2019) or deep learning (Samek, Wiegand, and Müller 2017). Explanations are also crucial in multi-agent environments where some extra challenges arise, such as privacy preservation or fairness (Kraus et al 2020).…”

Section: Related Workmentioning

confidence: 99%

Explaining Preference-Driven Schedules: The EXPRES Framework

Pozanco¹,

Mosca²,

Zehtabi³

et al. 2022

ICAPS

View full text Add to dashboard Cite

Scheduling is the task of assigning a set of scarce resources distributed over time to a set of agents, who typically have preferences over the assignments they would like to get. Due to the constrained nature of these problems, satisfying all agents' preferences often turns infeasible, which might lead to some agents not being happy with the resulting schedule. Providing explanations has been shown to increase satisfaction and trust in solutions produced by AI tools. However, explaining schedules poses some particular challenges such as problem interpretability (i.e., generating explanations from a huge and dense amount of information) and privacy preservation (i.e., generating explanations respecting the privacy of other agents involved). In this paper we introduce the EXPRES framework, that can explain why a given preference was unsatisfied in a given optimal schedule. The EXPRES framework consists of (i) an explanation generator, that, based on a Mixed-Integer Linear Programming model, finds the best set of reasons that can explain an unsatisfied preference; and (ii) an explanation parser, which translates the generated explanations into human interpretable ones, while preserving agents' privacy. Through simulations, we show that the explanation generator can efficiently scale to large instances. Finally, through a set of user studies within J.P. Morgan, we show that employees preferred the explanations generated by EXPRES over human-generated ones when considering workforce scheduling scenarios.

show abstract

Section: Related Workmentioning

confidence: 99%

Explaining Preference-Driven Schedules: The EXPRES Framework

Pozanco¹,

Mosca²,

Zehtabi³

et al. 2022

ICAPS

View full text Add to dashboard Cite

show abstract

“…In the AI and robotics communities, there has been growing interest in interpretable agent behavior in the past few years (Dragan, Lee, and Srinivasa 2013;Langley et al 2017;Gunning and Aha 2019;Chakraborti et al 2019;Sreedharan et al 2021), stemming from the consideration that rarely, if ever, agents act in isolation from humans. Synthesizing interpretable behavior facilitates smoother Human-AI interaction and also supports trust in autonomy (Bhatt, Ravikumar, and Moura 2019).…”

Section: Related Workmentioning

confidence: 99%

“…Synthesizing interpretable behavior facilitates smoother Human-AI interaction and also supports trust in autonomy (Bhatt, Ravikumar, and Moura 2019). Interpretability has been studied along three main dimensions, legibility, explicability and predictability (Chakraborti et al 2019), but, lately, some effort has been made to connect and integrate these concepts in unified frameworks (Sreedharan et al 2021; Miura and Zilberstein 2021). We will limit our discussion to legibility and the most relevant related work.…”

Section: Related Workmentioning

confidence: 99%

A Network Flow Interpretation of Robust Goal Legibility in Path Finding

Bernardini

Fagnani

Franco

et al. 2022

ICAPS

View full text Add to dashboard Cite

In this paper, we define goal legibility in a multi-agent path-finding setting. We consider a set of identical agents moving in an environment and tasked with reaching a set of locations that need to be serviced. An observer monitors their movements from a distance to identify their destinations as soon as possible. Our algorithm constructs a set of paths for the agents, one to each destination, that overlap as little as possible while satisfying a budget constraint. In this way, the observer, knowing the possible agents' destinations as well as the set of paths they might follow, is guaranteed to determine with certainty an agent's destination by looking at the shortest possible fragment of the agent's trajectory, regardless of when it starts observing. Our technique is robust because the observer's inference mechanism requires no coordination with the agents' motions. By reformulating legible path planning into a classical minimum cost flow problem, we can leverage powerful tools from combinatorial optimization, obtaining fast and scalable algorithms. We present experiments that show the benefits offered by our approach.

show abstract

“…Generally speaking, each method regularizes a specific part of the agent's behavior to match an observer's expectations, therefore reducing the ambiguity that the agent's intention have in the observer model (see Figure 1). Depending on the specific technique the observer model is designed to be interested in different part of intentions such as goals, future plans, or underlying beliefs [6], and thus each interpretability technique regularizes corresponding parts of the agent's intentional model.…”

Section: Introductionmentioning

confidence: 99%

“…In addition, the agent has an estimate of the intentional model about itself that is possessed by the observer, P H R , which provides the agent information on how its intention is being understood. P H R is a second order theory of mind focused on the observer's inferences about the agent [6]. In this context, the behavior of the agent is therefore a balance between three types of behavior: optimal behavior, interpretable behavior and explanations, all together having the general objective of fulfilling the agent's intention while keeping |P R − P H R |, the distance between the intentional models, low.…”

Section: Introductionmentioning

confidence: 99%

The Mirror Agent Model: A Bayesian Architecture for Interpretable Agent Behavior

Persiani

Hellström²

2022

Explainable and Transparent AI and Multi-Agent Systems

View full text Add to dashboard Cite

In this paper we illustrate a novel architecture generating interpretable behavior and explanations. We refer to this architecture as the Mirror Agent Model because it defines the observer model, that is the target of explicit and implicit communications, as a mirror of the agent's. With the goal of providing a general understanding of this work, we firstly show prior relevant results addressing the informative communication of agents intentions and the production of legible behavior. In the second part of the paper we furnish the architecture with novel capabilities for explanations through off-the-shelf saliency methods, followed by preliminary qualitative results.

show abstract

Explicability? Legibility? Predictability? Transparency? Privacy? Security? The Emerging Landscape of Interpretable Agent Behavior

Cited by 48 publications

References 27 publications

Explaining Preference-Driven Schedules: The EXPRES Framework

Explaining Preference-Driven Schedules: The EXPRES Framework

A Network Flow Interpretation of Robust Goal Legibility in Path Finding

The Mirror Agent Model: A Bayesian Architecture for Interpretable Agent Behavior

Contact Info

Product

Resources

About