Multimodal estimation and communication of latent semantic knowledge for robust execution of robot instructions

Arkin, Jacob; Park, Daehyung; Roy, Subhro; Walter, Matthew R.; Roy, Nicholas; Howard, Thomas M.; Paul, Rohan

doi:10.1177/0278364920917755

Cited by 36 publications

(26 citation statements)

References 73 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Efficient and accurate interpretation of instructions is particularly important for space missions involving robotic partners where communication or interaction between humans and robots is intermittent and bandwidth limited since the robot may not always have the ability to request a clarification when performing the task. Arkin et al have developed transparent and computationally efficient models for verifiable grounded language communication [54].…”

Section: Human-robot Communicationmentioning

confidence: 99%

A Review of NASA Human-Robot Interaction in Space

2021

View full text Add to dashboard Cite

Purpose of Review This review provides an overview of the motivation, challenges, state-of-the-art, and recent research for human-robot interaction (HRI) in space. For context, we focus on NASA space missions, use cases, and systems (both flight and research). However, the discussion is broadly applicable to all activities in space that require or make use of human-robot teams. Recent Findings To date, HRI in space has largely been limited to remote interaction between humans on Earth and robots in space. This interaction is associated with telerobotic operations-from direct (manual) control to intermittent, supervisory control. Recent work, however, has begun to address a wide range of human-robot arrangements (co-located, remote, 1:1, groups, etc.). In addition, researchers have been studying human-robot teaming theory and system design, efficient interaction methods, and human-robot communication. Summary We begin by describing NASA's use of robots in space for both deep space science and human exploration. We then describe several aspects of HRI that are important for space missions, with emphasis on factors that are critical or unique for the space environment. Next, we provide a brief overview of HRI associated with space systems, including technology demonstrations. Finally, we conclude with a short survey of recent research, which will affect human-robot interaction for both Artemis missions and future missions to Mars.

show abstract

Section: Human-robot Communicationmentioning

confidence: 99%

A Review of NASA Human-Robot Interaction in Space

2021

View full text Add to dashboard Cite

show abstract

“…Bayesian logic networks have been used to cope with noise and non-deterministic data from different data sources [13]. More recently, knowledge graph (KG) embedding models were introduced as scalable frameworks to model object knowledge encoded in multi-relational KGs [16,4]. Although the above techniques effectively model objects, they only support reasoning about binary class-level facts, therefore lacking the discriminative features needed to model object semantics in realistic environments.…”

Section: A Semantic Reasoning In Roboticsmentioning

confidence: 99%

“…In this paper, we compare to two variants of TuckER. The regular TuckER model follows existing work [16,4] to model binary relations between object class and object properties. • TuckER+ is a TuckER embedding model we implement to model binary relations between all pairs of property types (e.g., color and material, shape and location); it approximates an n-ary relation with a combination of binary relations.…”

Section: Experiments On Link Datasetmentioning

confidence: 99%

Learning Instance-Level N-Ary Semantic Knowledge At Scale For Robots Operating in Everyday Environments

Liu¹,

Bansal²,

Daruna³

et al. 2021

Robotics: Science and Systems XVII

View full text Add to dashboard Cite

Robots operating in everyday environments need to effectively perceive, model, and infer semantic properties of objects. Existing knowledge reasoning frameworks only model binary relations between an object's class label and its semantic properties, unable to collectively reason about object properties detected by different perception algorithms and grounded in diverse sensory modalities. We bridge the gap between multimodal perception and knowledge reasoning by introducing an n-ary representation that models complex, inter-related object properties. To tackle the problem of collecting n-ary semantic knowledge at scale, we propose a transformer neural network that directly generalizes knowledge from observations of object instances. The learned model can reason at different levels of abstraction, effectively predicting unknown properties of objects in different environmental contexts given different amounts of observed information. We quantitatively validate our approach against five prior methods on LINK, a unique dataset we contribute that contains 1457 situated object instances with 15 multimodal properties types and 200 total properties. Compared to the top-performing baseline, a Markov Logic Network, our model obtains a 10% improvement in predicting unknown properties of novel object instances while reducing training and inference time by 150 times. Additionally, we apply our work to a mobile manipulation robot, demonstrating its ability to leverage n-ary reasoning to retrieve objects and actively detect object properties. The code and data are available at https://github.com/wliu88/LINK.

show abstract

“…To address this problem, several lines of research have shown that incorporating a variety of sensory modalities is the key to further enhance the robotic capabilities in recognizing multisensory object properties (see [4] and [21] for a review). For example, visual and physical interaction data yields more accurate haptic classification for objects [11], and non-visual sensory modalities (e.g., audio, haptics) coupled with exploratory actions (e.g., touch or grasp) have been shown useful for recognizing objects and their properties [5,10,15,24,30], as well as grounding natural language descriptors that people use to refer to objects [3,39]. More recently, researchers have developed end-to-end systems to enable robots to learn to perceive the environment and perform actions at the same time [20,42].…”

Section: Data Augmentationmentioning

confidence: 99%

Planning Multimodal Exploratory Actions for Online Robot Attribute Learning

Zhang

Sinapov

Zhang

2021

Robotics: Science and Systems XVII

View full text Add to dashboard Cite

Robots frequently need to perceive object attributes, such as "red," "heavy," and "empty," using multimodal exploratory actions, such as "look," "lift," and "shake." Robot attribute learning algorithms aim to learn an observation model for each perceivable attribute given an exploratory action. Once the attribute models are learned, they can be used to identify attributes of new objects, answering questions, such as "Is this object red and empty?" Attribute learning and identification are being treated as two separate problems in the literature. In this paper, we first define a new problem called online robot attribute learning (On-RAL), where the robot works on attribute learning and attribute identification simultaneously. Then we develop an algorithm called information-theoretic reward shaping (ITRS) that actively addresses the trade-off between exploration and exploitation in On-RAL problems. ITRS was compared with competitive robot attribute learning baselines, and experimental results demonstrate ITRS' superiority in learning efficiency and identification accuracy. 1

show abstract

Multimodal estimation and communication of latent semantic knowledge for robust execution of robot instructions

Cited by 36 publications

References 73 publications

A Review of NASA Human-Robot Interaction in Space

A Review of NASA Human-Robot Interaction in Space

Learning Instance-Level N-Ary Semantic Knowledge At Scale For Robots Operating in Everyday Environments

Planning Multimodal Exploratory Actions for Online Robot Attribute Learning

Contact Info

Product

Resources

About