Transferring task models in Reinforcement Learning agents

Fachantidis, Anestis; Partalas, Ioannis; Tsoumakas, Grigorios; Vlahavas, Ioannis

doi:10.1016/j.neucom.2012.08.039

Cited by 18 publications

(14 citation statements)

References 21 publications

(34 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The experiments verified the initial claim that L3 outperforms its non transfer-learning versions, and also state-of-the-art transfer learning algorithms. The results show that L3 outperforms traditional RL algorithms, such as Q-Learning and SARSA(λ), HARL algorithms such as the HAQL and HA-SARSA(λ), and TL algorithms such as Taylor's MASTER (Taylor, 2008) and TiMRLA Value-Addition algorithms (Fachantidis et al, 2013). It is worth pointing out that, in contrast to L3, HAQL and HA-SARSA(λ) presuppose user-defined domain knowledge.…”

Section: Discussionmentioning

confidence: 95%

“…L3 was compared with other state-of-the-art transfer learning algorithms: L3 was compared with the the TiMRLA Value-Addition algorithm transfer learning algorithm (Fachantidis et al, 2013) and Taylor's MASTER algorithm (Taylor, 2008) on the Mountain Car experiment (Section 5.1). We did not find any competing algorithm in the literature to compare the Humanoid Robot Stabilisation experiment (presented in Section 5.2), which was a domain included in this paper to illustrate the generality of the algorithm proposed.…”

Section: Discussionmentioning

confidence: 99%

“…In this work, in order to avoid negative mappings, the authors propose a method for selecting the most relevant mappings. A hybrid approach implementing model-free and model-based learning for transferring models of potential-based, reward shaping functions is proposed in (Fachantidis et al, 2013), whereby the transition and reward functions of the source task are obtained by cascade neural networks. A value-function approximation (the Cerebral Model Articulation Controller, CMAC (Albus, 1975)) is used to find the value of a state, given the neighbouring state values.…”

Section: Related Workmentioning

confidence: 99%

“…For this reason, the algorithms proposed in (Fachantidis et al, 2013) and in (Taylor, 2008) (mentioned above) are used in the present paper for comparison purposes, as described in Section 5.…”

Section: Related Workmentioning

confidence: 99%

See 3 more Smart Citations

Transferring knowledge as heuristics in reinforcement learning: A case-based approach

Bianchi

Celiberto

Santos

et al. 2015

Artificial Intelligence

View full text Add to dashboard Cite

The goal of this paper is to propose and analyse a transfer learning meta-algorithm that allows the implementation of distinct methods using heuristics to accelerate a Reinforcement Learning procedure in one domain (the target) that are obtained from another (simpler) domain (the source domain). This meta-algorithm works in three stages: first, it uses a Reinforcement Learning step to learn a task on the source domain, storing the knowledge thus obtained in a case base; second, it does an unsupervised mapping of the source-domain actions to the target-domain actions; and, third, the case base obtained in the first stage is used as heuristics to speed up the learning process in the target domain.A set of empirical evaluations were conducted in two target domains: the 3D mountain car (using a learned case base from a 2D simulation) and stability learning for a humanoid robot in the Robocup 3D Soccer Simulator (that uses knowledge learned from the Acrobot domain). The results attest that our transfer learning algorithm outperforms recent heuristically-accelerated reinforcement learning and transfer learning algorithms.

show abstract

Section: Discussionmentioning

confidence: 95%

Section: Discussionmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

See 2 more Smart Citations

Transferring knowledge as heuristics in reinforcement learning: A case-based approach

Bianchi

Celiberto

Santos

et al. 2015

Artificial Intelligence

View full text Add to dashboard Cite

show abstract

“…Fachantidis et al [10] transfer the model of the transition and reward functions of a source task to a relevant but different target task, and then the agent takes a hybrid approach, implementing both model-free and model-based learning. But all the two methods are limited by the need of inter-task mapping.…”

Section: Related Workmentioning

confidence: 99%

Shaping in reinforcement learning via knowledge transferred from human-demonstrations

Wang

Fang

et al. 2015

2015 34th Chinese Control Conference (CCC)

View full text Add to dashboard Cite

Transfer has been widely used to ameliorate the slow convergence speed of reinforcement learning (RL) by reusing the previous obtained knowledge from other related but distinct tasks. In this paper, we propose a framework to transfer knowledge learned directly from human-demonstration trajectories of source tasks to shape the RL algorithm in target task, so as to avoid the time-consuming training process of RL in source tasks and thus we expand the learning paradigm of transfer in RL domains. In our framework, rather than transferring the most common value function or policy, we adopt the visit frequencies of states in successful demonstration trajectories as the acquired knowledge, and then perform transfer via shared agent space. Simulation experiments in obstacle avoidance problems suggest that the transferred knowledge could accelerate the learning process in target task obviously. And as a case study, the experiments show the potential of our framework in knowledge transfer in RL tasks.

show abstract

Knowledge Reuse of Learning Agent Based on Factor Information of Behavioral Rules

Saitoh

2019

Communications in Computer and Information Science

View full text Add to dashboard Cite

Transferring task models in Reinforcement Learning agents

Cited by 18 publications

References 21 publications

Transferring knowledge as heuristics in reinforcement learning: A case-based approach

Transferring knowledge as heuristics in reinforcement learning: A case-based approach

Shaping in reinforcement learning via knowledge transferred from human-demonstrations

Knowledge Reuse of Learning Agent Based on Factor Information of Behavioral Rules

Contact Info

Product

Resources

About