2020
DOI: 10.1016/j.engappai.2020.103559
|View full text |Cite
|
Sign up to set email alerts
|

Interpretable policies for reinforcement learning by empirical fuzzy sets

Abstract: This paper proposes a method and an algorithm to implement interpretable fuzzy reinforcement learning (IFRL). It provides alternative solutions to common problems in RL, like function approximation and continuous action space. The learning process resembles that of human beings by clustering the encountered states, developing experiences for each of the typical cases, and making decisions fuzzily. The learned policy can be expressed as human-intelligible IF-THEN rules, which facilitates further investigation a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
1
1
1

Relationship

1
6

Authors

Journals

citations
Cited by 10 publications
(5 citation statements)
references
References 29 publications
0
5
0
Order By: Relevance
“…In a typical reinforcement learning framework, an agent learns to achieve a goal by interacting with the environment, which is defined in the form of a Markov decision process. The agent gets either rewards or penalties for the actions it performs, and its main goal is to maximize the long-term reward (Huang et al 2020).…”
Section: Reinforcement Learning-based Fuzzy Systemsmentioning
confidence: 99%
See 1 more Smart Citation
“…In a typical reinforcement learning framework, an agent learns to achieve a goal by interacting with the environment, which is defined in the form of a Markov decision process. The agent gets either rewards or penalties for the actions it performs, and its main goal is to maximize the long-term reward (Huang et al 2020).…”
Section: Reinforcement Learning-based Fuzzy Systemsmentioning
confidence: 99%
“…By treating the combined return value of a series of actions as the fitness value to be maximized, a particle swarm reinforcement learning method is presented in Hein et al (2017) to learn the best policy represented by fuzzy rules. Since the majority of existing fuzzy reinforcement learning methods are implemented on the basis of (fuzzy) neural networks with very limited interpretability, an interpretable reinforcement learning scheme is proposed in Huang et al (2020), where the learned policy can be expressed as human-intelligible IF-THEN rules and the value function is approximated through the AnYa type fuzzy rule-based system.…”
Section: Reinforcement Learning-based Fuzzy Systemsmentioning
confidence: 99%
“…It is a type of decision maker for handling vague inputs. It has a capability of decision making similar to human that is by framing set of rules [21]. Several inputs are considered at an interval and appropriate action is obtained [22].…”
Section: Modelmentioning
confidence: 99%
“…As, for instance, knowing that two actions have nearly the same probability in a given state indicates that they are both equally good, which allows our rule mining algorithm to chose one action or the other, depending on what leads to the simplest rule set. This exploitation of the meta-information of an RL process also differentiates our work from approaches that translate the RL policy in a set of fuzzy rules, such as [11] and [9].…”
Section: Related Workmentioning
confidence: 99%
“…0 0 0 0 0 0 9 9 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 4 4 4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 16 0 0 0 0 0 0 0 0 0 0 9 0 9 9 0 0 0 23 0 11 11 As explained above, we propose a two-phased distillation algorithm, to produce meaningful rules from a black-box policy learned with Deep Reinforcement Learning. The first phase produces a list of rules that approximate how the Deep RL policy maps states to actions.…”
Section: Explaining Policies Through Distillationmentioning
confidence: 99%