Learning to Play General Video-Games via an Object Embedding Network

Woof, William; Chen, Ke

doi:10.1109/cig.2018.8490438

Cited by 8 publications

(9 citation statements)

References 18 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…From experimental results, we do observe that there are notable difference in performance between pixel-level and objectbased representations. It is evident from the experimental results that our object-based learnable agent is capable of autonomous learning in all five games and leads to better generalisation performance or robustness immune to noise and the background variation in raw video data (see [24] for detailed results).…”

Section: Object-based Learnable Agentmentioning

confidence: 97%

See 1 more Smart Citation

Learning-Based Video Game Development in MLP@UoM: An Overview

Chen

2019

Preprint

Self Cite

View full text Add to dashboard Cite

In general, video games not only prevail in entertainment but also have become an alternative methodology for knowledge learning, skill acquisition and assistance for medical treatment as well as health care in education, vocational/military training and medicine. On the other hand, video games also provide an ideal test bed for AI researches. To a large extent, however, video game development is still a laborious yet costly process, and there are many technical challenges ranging from game generation to intelligent agent creation. Unlike traditional methodologies, in Machine Learning and Perception Lab at the University of Manchester (MLP@UoM), we advocate applying machine learning to different tasks in video game development to address several challenges systematically. In this paper, we overview the main progress made in MLP@UoM recently and have an outlook on the future research directions in learningbased video game development arising from our works.

show abstract

Section: Object-based Learnable Agentmentioning

confidence: 97%

“…Abundant evidence suggests that human players work on objects in a video game via perceptual organisation of coherent pixels rather than treating all the pixels in a frame independently. To this end, we have developed object-based reinforcement learning techniques [24] to create "human-like" learnable agents.…”

Section: Object-based Learnable Agentmentioning

confidence: 99%

Learning-Based Video Game Development in MLP@UoM: An Overview

Chen

2019

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…without history based policy) Proximal Policy Optimisation algorithm [32], using our JSON network model to encode the environment state into a single latent vector, which we branch off into separate policy and value heads using a single linear transformation. Since the game-state consists of multiple object-level descriptions, for encoding this final list we use the architecture used in [33] (which is a variant of the architecture of [1]), which is applied via path-specific function mapping. Additionally, since various object attributes are cosmetic, or otherwise irrelevant, we ignore these via path-specific function mappings.…”

Section: Reinforcement Learning Taskmentioning

confidence: 99%

“…For our baseline we use the object-based method of [33] using the same object-level features, but adapted for the PPO algorithm. This uses the same root level architecture, but with hand-defined object-level feature vectors.…”

Section: Reinforcement Learning Taskmentioning

confidence: 99%

A Framework for End-to-End Learning on Semantic Tree-Structured Data

Woof,

Chen

2020

Preprint

Self Cite

View full text Add to dashboard Cite

While learning models are typically studied for inputs in the form of a fixed dimensional feature vector, real world data is rarely found in this form. In order to meet the basic requirement of traditional learning models, structural data generally have to be converted into fix-length vectors in a handcrafted manner, which is tedious and may even incur information loss. A common form of structured data is what we term "semantic tree-structures", corresponding to data where rich semantic information is encoded in a compositional manner, such as those expressed in JavaScript Object Notation (JSON) and eXtensible Markup Language (XML). For tree-structured data, several learning models have been studied to allow for working directly on raw tree-structure data, however such learning models are limited to either a specific tree-topology or a specific tree-structured data format, e.g., synthetic parse trees. In this paper, we propose a novel framework for end-to-end learning on generic semantic tree-structured data of arbitrary topologies and heterogeneous data types, such as data expressed in JSON, XML and so on. Motivated by the works in recursive and recurrent neural networks, we develop exemplar neural implementations of our framework for the JSON format. We evaluate our approach on several UCI benchmark datasets, including ablation and data-efficiency studies, and on a toy reinforcement learning task. Experimental results suggest that our framework yields comparable performance to use of standard models with dedicated feature-vectors in general, and even exceeds baseline performance in cases where compositional nature of the data is particularly important. The source code for a JSON-based implementation of our framework along with experiments can be downloaded at https://github.com/EndingCredits/json2vec.

show abstract

“…Tasks with varying numbers of objects are often solved with ad-hoc approaches such as input zero-padding. These methods can often lead to training inefficiencies [15]. We show that the proposed attention mechanism can accept varying numbers of input objects without ad-hoc approximation.…”

Section: Introductionmentioning

confidence: 95%

Exchangeable Input Representations for Reinforcement Learning

Mern

Sadigh

Kochenderfer

2020

2020 American Control Conference (ACC)

View full text Add to dashboard Cite

Poor sample efficiency is a major limitation of deep reinforcement learning in many domains. This work presents an attention-based method to project neural network inputs into an efficient representation space that is invariant under changes to input ordering. We show that our proposed representation results in an input space that is a factor of m! smaller for inputs of m objects. We also show that our method is able to represent inputs over variable numbers of objects. Our experiments demonstrate improvements in sample efficiency for policy gradient methods on a variety of tasks. We show that our representation allows us to solve problems that are otherwise intractable when using naïve approaches.

show abstract

Learning to Play General Video-Games via an Object Embedding Network

Cited by 8 publications

References 18 publications

Learning-Based Video Game Development in MLP@UoM: An Overview

Learning-Based Video Game Development in MLP@UoM: An Overview

A Framework for End-to-End Learning on Semantic Tree-Structured Data

Exchangeable Input Representations for Reinforcement Learning

Contact Info

Product

Resources

About