2022
DOI: 10.48550/arxiv.2202.08434
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

A Survey of Explainable Reinforcement Learning

Abstract: Explainable reinforcement learning (XRL) is an emerging subfield of explainable machine learning that has attracted considerable attention in recent years. The goal of XRL is to elucidate the decision-making process of learning agents in sequential decision-making settings. In this survey, we propose a novel taxonomy for organizing the XRL literature that prioritizes the RL setting. We overview techniques according to this taxonomy. We point out gaps in the literature, which we use to motivate and outline a ro… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
24
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
5
3
2

Relationship

1
9

Authors

Journals

citations
Cited by 17 publications
(24 citation statements)
references
References 6 publications
0
24
0
Order By: Relevance
“…The second one relates to the models not being built off labelled training data (which would simplify explainability). Further inspiration could be taken from relevant survey papers and implementations [10,49,74,75,80,82,91,95,96,109,116]. Failure to address this challenge will lead to the automated blue agent not being certified by industrial employees within networked systems since the trust towards the agent will be low.…”
Section: Explainable Rl (A24)mentioning
confidence: 99%
“…The second one relates to the models not being built off labelled training data (which would simplify explainability). Further inspiration could be taken from relevant survey papers and implementations [10,49,74,75,80,82,91,95,96,109,116]. Failure to address this challenge will lead to the automated blue agent not being certified by industrial employees within networked systems since the trust towards the agent will be low.…”
Section: Explainable Rl (A24)mentioning
confidence: 99%
“…Recent work from Milani et al [27] summarizes different methods of explanations in RL algorithms under a new taxonomy based on three main groups:…”
Section: Explainable Reinforcement Learningmentioning
confidence: 99%
“…Most work on interpretable RL focuses on the single-agent setting [23]. We first discuss techniques that directly learn DT policies.…”
Section: Related Workmentioning
confidence: 99%