Bootstrapped Q-learning with Context Relevant Observation Pruning to Generalize in Text-based Games

Chaudhury, Subhajit; Kimura, Daiki; Talamadupula, Kartik; Tatsubori, Michiaki; Munawar, Asim; Tachibana, Ryuki

doi:10.18653/v1/2020.emnlp-main.241

Cited by 9 publications

(10 citation statements)

References 18 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…LeDeepChef (Adolphs and Hofmann, 2020) used recurrent feature extraction along with the A2C (Mnih et al, 2016). CREST (Chaudhury et al, 2020) was proposed for pruning observation information. TWC (Murugesan et al, 2021) was proposed for utilizing common sense reasoning.…”

Section: Related Workmentioning

confidence: 99%

Neuro-Symbolic Reinforcement Learning with First-Order Logic

Kimura¹,

Ono²,

Chaudhury³

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

Deep reinforcement learning (RL) methods often require many trials before convergence, and no direct interpretability of trained policies is provided. In order to achieve fast convergence and interpretability for the policy in RL, we propose a novel RL method for text-based games with a recent neurosymbolic framework called Logical Neural Network, which can learn symbolic and interpretable rules in their differentiable network. The method is first to extract first-order logical facts from text observation and external word meaning network (ConceptNet), then train a policy in the network with directly interpretable logical operators. Our experimental results show RL training with the proposed method converges significantly faster than other state-of-the-art neuro-symbolic methods in a TextWorld benchmark.

show abstract

Section: Related Workmentioning

confidence: 99%

Neuro-Symbolic Reinforcement Learning with First-Order Logic

Kimura¹,

Ono²,

Chaudhury³

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…The field of text-based and interactive games has seen a lot of recent interest and work, thanks in large part to the creation and availability of pioneering environments such as TextWorld [11] and the Jericho [17] collection. Based on these domains, several interesting approaches have been proposed that seek to improve the efficiency of agents in these environments [4,12,9,22]. We mention and discuss this prior work in context in the earlier parts of this paper.…”

Section: Related Workmentioning

confidence: 99%

“…Specifically, we consider RL agents in the TextWorld and Jericho TBG environments; and additional information that can be provided to such agents to improve their performance. Past work has focused on trying to use external knowledge to either limit [9] or enhance [22] the space of actions: however, this has also been restricted to the text modality. At their crux, these efforts are all trying fundamentally to solve the problem of relationships within the environment -how are different things in the world related to each other?…”

Section: Introductionmentioning

confidence: 99%

Eye of the Beholder: Improved Relation Generalization for Text-based Reinforcement Learning Agents

Murugesan¹,

Chaudhury²,

Talamadupula³

2021

Preprint

Self Cite

View full text Add to dashboard Cite

Text-based games (TBGs) have become a popular proving ground for the demonstration of learning-based agents that make decisions in quasi real-world settings. The crux of the problem for a reinforcement learning agent in such TBGs is identifying the objects in the world, and those objects' relations with that world. While the recent use of text-based resources for increasing an agent's knowledge and improving its generalization have shown promise, we posit in this paper that there is much yet to be learned from visual representations of these same worlds. Specifically, we propose to retrieve images that represent specific instances of text observations from the world and train our agents on such images. This improves the agent's overall understanding of the game scene and objects' relationships to the world around them, and the variety of visual representations on offer allow the agent to generate a better generalization of a relationship. We show that incorporating such images improves the performance of agents in various TBG settings.

show abstract

“…Under certain controls necessary for studying RL, text-based games provide complex, interactive, and a variety of simulated environments where the environmental game state observation * denotes equal contribution is obtained through the text description, and the agent is expected to make progress by entering text commands. In addition to language understanding (Ammanabrolu and Riedl, 2019;Adhikari et al, 2020), successful play requires skills such as long-term memory (Narasimhan et al, 2015), exploration , observation pruning (Chaudhury et al, 2020), and common sense reasoning (Keerthiram Murugesan and Campbell, 2021). However, these studies are not using the neuro-symbolic approach which is a combination of the neural network and the symbolic framework.…”

Section: Introductionmentioning

confidence: 99%

LOA: Logical Optimal Actions for Text-based Interaction Games

Kimura¹,

Chaudhury²,

Ono³

et al. 2021

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Confer

Self Cite

View full text Add to dashboard Cite

We present Logical Optimal Actions (LOA), an action decision architecture of reinforcement learning applications with a neurosymbolic framework which is a combination of neural network and symbolic knowledge acquisition approach for natural language interaction games. The demonstration for LOA experiments consists of a web-based interactive platform for text-based games and visualization for acquired knowledge for improving interpretability for trained rules. This demonstration also provides a comparison module with other neuro-symbolic approaches as well as non-symbolic state-ofthe-art agent models on the same text-based games. Our LOA also provides open-sourced implementation in Python for the reinforcement learning environment to facilitate an experiment for studying neuro-symbolic agents.

show abstract

Bootstrapped Q-learning with Context Relevant Observation Pruning to Generalize in Text-based Games

Cited by 9 publications

References 18 publications

Neuro-Symbolic Reinforcement Learning with First-Order Logic

Neuro-Symbolic Reinforcement Learning with First-Order Logic

Eye of the Beholder: Improved Relation Generalization for Text-based Reinforcement Learning Agents

LOA: Logical Optimal Actions for Text-based Interaction Games

Contact Info

Product

Resources

About