Grounding English Commands to Reward Functions

MacGlashan, James; Babes-Vroman, Monica; desJardins, Marie; Littman, Michael L.; Muresan, Smaranda; Squire, Shawn; Tellex, Stefanie; Arumugam, Dilip; Yang, Lei

doi:10.15607/rss.2015.xi.018

Cited by 49 publications

(66 citation statements)

References 19 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The question of how to effectively convert between natural language instructions and robot behavior has been widely studied in previous work [50,34,24,14,9,47,8,18,11,1,33,28,36,27,40,7,2,19,37]. So far, there have been three categories of behavior specifications that these works have mapped natural language to: action sequences, goal states, and LTL specifications.…”

Section: Related Workmentioning

confidence: 99%

“…Following the framework introduced by MacGlashan et al [33], we treat natural language as the specification of a latent reward function that completes the definition of an otherwise fully-specified MDP. We use a language grounding model to arrive at a more consolidated, semantic representation of that reward function, thereby completing the MDP and allowing it to be passed to an arbitrary planning algorithm for generating robot behavior.…”

Section: A Problem Settingmentioning

confidence: 99%

“…The OO-MDP propositional functions serve as possible atoms of the LTL expressions, creating an expressive language for defining tasks. In particular, LTL formulae are sufficiently expressive to subsume semantic representations used in previous goal-based language grounding work such as MacGlashan et al [33] and Arumugam et al [2].…”

Section: A Problem Settingmentioning

confidence: 99%

See 2 more Smart Citations

Sequence-to-Sequence Language Grounding of Non-Markovian Task Specifications

Gopalan

Arumugam

Wong

et al. 2018

Robotics: Science and Systems XIV

Self Cite

View full text Add to dashboard Cite

Abstract-Often times, natural language commands issued to robots not only specify a particular target configuration or goal state but also outline constraints on how the robot goes about its execution. That is, the path taken to achieving some goal state is given equal importance to the goal state itself. One example of this could be instructing a wheeled robot to "go to the living room but avoid the kitchen," in order to avoid scuffing the floor. This class of behaviors poses a serious obstacle to existing language understanding for robotics approaches that map to either action sequences or goal state representations. Due to the non-Markovian nature of the objective, approaches in the former category must map to potentially unbounded action sequences whereas approaches in the latter category would require folding the entirety of a robot's trajectory into a (traditionally Markovian) state representation, resulting in an intractable decision-making problem. To resolve this challenge, we use a recently introduced probabilistic variant of Linear Temporal Logic (LTL) as a goal specification language for a Markov Decision Process (MDP). While demonstrating that standard neural sequence-to-sequence learning models can successfully ground language to this semantic representation, we also provide analysis that highlights generalization to novel, unseen logical forms as an open problem for this class of model. We evaluate our system within two simulated robot domains as well as on a physical robot, demonstrating accurate language grounding alongside a significant expansion in the space of interpretable robot behaviors.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: A Problem Settingmentioning

confidence: 99%

See 1 more Smart Citation

Sequence-to-Sequence Language Grounding of Non-Markovian Task Specifications

Gopalan

Arumugam

Wong

et al. 2018

Robotics: Science and Systems XIV

Self Cite

View full text Add to dashboard Cite

show abstract

“…Here, we learn an adaptive strategy that aims at maximising the overall learning performance simultaneously, by properly adjusting the positive confidence threshold in the range of 0.65 to 0.95. We train the optimization using a RL libraryBurlap (MacGlashan, 2015) as follows, in detail:…”

Section: Adaptive Confidence Thresholdmentioning

confidence: 99%

“…There has been a broad and diverse set of work examining how best to interpret and execute natural language instructions on a robot platform (Vogel and Jurafsky, 2010;Tellex et al, 2011;Artzi and Zettlemoyer, 2013;Howard et al, 2014;Andreas and Klein, 2015;MacGlashan et al, 2015;Paul et al, 2016;Mei et al, 2016;Arumugam et al, 2017). Vogel and Jurafsky (2010) produce policies using language and expert trajectories based rewards, which allow for planning within a stochastic environment along with re-planning in case of failure.…”

Section: Related Workmentioning

confidence: 99%

Proceedings of the First Workshop on Language Grounding for Robotics

2017

View full text Add to dashboard Cite

ii IntroductionAfter the remarkable successes of recent work visually grounded models of language, the embodied and task-oriented aspects of language learning stand as a natural next challenge. As autonomous robotic agents become increasingly capable and are deployed to progressively more complex environments, expressive, accessible interfaces are becoming essential to realizing the potential of such technologies. Natural language is immediately available to non-expert users and expressive enough to represent complex actions and plans. Can we give instructions to robotic agents to assist with navigation and manipulation tasks in remote settings? Can we talk to robots about the surrounding visual world, and help them interactively learn the language needed to finish a task? To build robots that we can converse with in our homes, offices, hospitals, and warehouses, it is essential that we develop new techniques for linking language to action in the real world.While the opportunity is clear, enabling effective interaction between users and autonomous agents requires addressing some of the core open challenges in NLP while studying new domains and tasks. This workshop aims to explore these challenges, bringing together members of the NLP, robotics, and vision communities to focus on language grounding in robots and other interactive goal-driven systems. The program features twelve new articles and seven cross-submissions from related areas, to be presented as both posters and talks. We are also excited to host remarkable invited speakers, including Regina Barzilay, Joyce Chai, Karl Moritz Hermann, Hadas Kress-Gazit, Terence Langendoen, Percy Liang, Ray Mooney, Nicholas Roy, Stefanie Tellex and Jason Weston.We thank the program committee, the ACL workshop chairs Wei Xu and Jonathan Berant, the invited speakers, and our sponsors DeepMind and Facebook.

show abstract