Matthew E. Taylor scite author profile

A typical goal for transfer learning algorithms is to utilize knowledge gained in a source task to learn a target task faster. Recently introduced transfer methods in reinforcement learning settings have shown considerable promise, but they typically transfer between pairs of very similar tasks. This work introduces Rule Transfer, a transfer algorithm that first learns rules to summarize a source task policy and then leverages those rules to learn faster in a target task. This paper demonstrates that Rule Transfer can effectively speed up learning in Keepaway, a benchmark RL problem in the robot soccer domain, based on experience from source tasks in the gridworld domain. We empirically show, through the use of three distinct transfer metrics, that Rule Transfer is effective across these domains.

show abstract

Transferring Instances for Model-Based Reinforcement Learning

Taylor

Jong

Stone

View full text Add to dashboard Cite

Reinforcement learning agents typically require a significant amount of data before performing well on complex tasks. Transfer learning methods have made progress reducing sample complexity, but they have primarily been applied to model-free learning methods, not more data-efficient model-based learning methods. This paper introduces timbrel, a novel method capable of transferring information effectively into a model-based reinforcement learning algorithm. We demonstrate that timbrel can significantly improve the sample efficiency and asymptotic performance of a model-based algorithm when learning in a continuous state space. Additionally, we conduct experiments to test the limits of timbrel's effectiveness.

show abstract

Protecting against evaluation overfitting in empirical reinforcement learning

Whiteson

Tanner

Taylor

et al. 2011

View full text Add to dashboard Cite

Abstract-Empirical evaluations play an important role in machine learning. However, the usefulness of any evaluation depends on the empirical methodology employed. Designing good empirical methodologies is difficult in part because agents can overfit test evaluations and thereby obtain misleadingly high scores. We argue that reinforcement learning is particularly vulnerable to environment overfitting and propose as a remedy generalized methodologies, in which evaluations are based on multiple environments sampled from a distribution. In addition, we consider how to summarize performance when scores from different environments may not have commensurate values. Finally, we present proof-of-concept results demonstrating how these methodologies can validate an intuitively useful rangeadaptive tile coding method.

show abstract

Transfer via inter-task mappings in policy search reinforcement learning

Taylor

Whiteson

Stone

2007

View full text Add to dashboard Cite

Reinforcement learning agents providing advice in complex video games

Taylor

Carboni²,

Fachantidis

et al. 2014

Connection Science

View full text Add to dashboard Cite

This article introduces a teacher-student framework for reinforcement learning, synthesising and extending material that appeared in conference proceedings [Torrey, L., & Taylor, M. E. (2013)]. Teaching on a budget: Agents advising agents in reinforcement learning. Proceedings of the international conference on autonomous agents and multiagent systems] and in a non-archival workshop paper [Carboni, N., & Taylor, M. E. (2013, May)]. Preliminary results for 1 vs. 1 tactics in StarCraft. Proceedings of the adaptive and learning agents workshop (at AAMAS-13)]. In this framework, a teacher agent instructs a student agent by suggesting actions the student should take as it learns. However, the teacher may only give such advice a limited number of times. We present several novel algorithms that teachers can use to budget their advice effectively, and we evaluate them in two complex video games: StarCraft and Pac-Man. Our results show that the same amount of advice, given at different moments, can have different effects on student learning, and that teachers can significantly affect student learning even when students use different learning methods and state representations.

show abstract

Abstraction and Generalization in Reinforcement Learning: A Summary and Framework

Ponsen

Taylor

Tuyls

2010

View full text Add to dashboard Cite

Multi-objectivization of reinforcement learning problems by reward shaping

Brys

Harutyunyan

Vrancx

et al. 2014

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Matthew E. Taylor

Learning behaviors via human-delivered discrete feedback: modeling implicit feedback strategies to speed up learning

Cross-domain transfer for reinforcement learning

Transferring Instances for Model-Based Reinforcement Learning

Protecting against evaluation overfitting in empirical reinforcement learning

Transfer via inter-task mappings in policy search reinforcement learning

Reinforcement learning agents providing advice in complex video games

Abstraction and Generalization in Reinforcement Learning: A Summary and Framework

Multi-objectivization of reinforcement learning problems by reward shaping

Contact Info

Product

Resources

About