2020
DOI: 10.48550/arxiv.2011.07553
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

CDT: Cascading Decision Trees for Explainable Reinforcement Learning

Zihan Ding,
Pablo Hernandez-Leal,
Gavin Weiguang Ding
et al.

Abstract: Deep Reinforcement Learning (DRL) has recently achieved significant advances in various domains. However, explaining the policy of RL agents still remains an open problem due to several factors, one being the complexity of explaining neural networks decisions. Recently, a group of works have used decision-tree-based models to learn explainable policies. Soft decision trees (SDTs) and discretized differentiable decision trees (DDTs) have been demonstrated to achieve both good performance and share the benefit o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 18 publications
0
2
0
Order By: Relevance
“…Univariate nodes are introduced to discretize SDTs for better interpretability (Silva et al, 2020). A feature learning tree is integrated with the standard SDT to improve model expressivity (Ding et al, 2020). A series of novel metrics are proposed for a comprehensive evaluation (Dahlin et al, 2020).…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Univariate nodes are introduced to discretize SDTs for better interpretability (Silva et al, 2020). A feature learning tree is integrated with the standard SDT to improve model expressivity (Ding et al, 2020). A series of novel metrics are proposed for a comprehensive evaluation (Dahlin et al, 2020).…”
Section: Related Workmentioning
confidence: 99%
“…2) Our work concentrates on the aircraft separation assurance task with a complex high-dimensional input space. But the previous works mainly focus on tasks with simple low-dimensional input [e.g., CartPole (Ding et al, 2020), LunarLander (Silva et al, 2020)] or tasks with pixel-based input [e.g., Mario AI Benchmark (Coppens et al, 2019), Wildfire Tracking (Haksar and Schwager, 2018)]. 3) Our work focuses on a complex multi-agent problem so the proposed methods need to consider the cooperation among all agents in the system.…”
Section: Related Workmentioning
confidence: 99%