Table-top scene analysis using knowledge-supervised MCMC

The International Journal of Robotics Research

Xiang

Jenkins

et al. 2017

Performing robust goal-directed manipulation tasks remains a crucial challenge for autonomous robots. In an ideal case, shared autonomous control of manipulators would allow human users to specify their intent as a goal state and have the robot reason over the actions and motions to achieve this goal. However, realizing this goal remains elusive due to the problem of perceiving the robot’s environment. We address and describe the problem of axiomatic scene estimation for robot manipulation in cluttered scenes which is the estimation of a tree-structured scene graph describing the configuration of objects observed from robot sensing. We propose generative approaches to scene inference (as the axiomatic particle filter, and the axiomatic scene estimation by Markov chain Monte Carlo based sampler) of the robot’s environment as a scene graph. The result from AxScEs estimation are axioms amenable to goal-directed manipulation through symbolic inference for task planning and collision-free motion planning and execution. We demonstrate the results for goal-directed manipulation of multi-object scenes by a PR2 robot.

Section: Axsces Scene Estimation Methodsmentioning

confidence: 99%

Section: Axsces Scene Estimation Methodsmentioning

confidence: 99%

See 1 more Smart Citation

Goal-directed robot manipulation through axiomatic scene estimation

The International Journal of Robotics Research

Xiang

Jenkins

et al. 2017

“…Given observations of the scene, our work estimates a scene graph that represent the scene structure. Liu et al [23] also estimate a scene graph given observations, however, their approach approximates objects as oriented bounding boxes. Sui et al proposed a generative approach (AxMC) [35] for scene graph estimation and use Markov Chain Monte Carlo (MCMC) to search for the best scene graph hypothesis that explains the observations.…”

Section: B Scene Perception For Manipulationmentioning

confidence: 99%

Semantic Robot Programming for Goal-Directed Manipulation in Cluttered Scenes

Zeng

Zhou

2018 IEEE International Conference on Robotics and Automation (ICRA)

et al. 2018

We present the Semantic Robot Programming (SRP) paradigm as a convergence of robot programming by demonstration and semantic mapping. In SRP, a user can directly program a robot manipulator by demonstrating a snapshot of their intended goal scene in workspace. The robot then parses this goal as a scene graph comprised of object poses and inter-object relations, assuming known object geometries. Task and motion planning is then used to realize the user's goal from an arbitrary initial scene configuration. Even when faced with different initial scene configurations, SRP enables the robot to seamlessly adapt to reach the user's demonstrated goal. For scene perception, we propose the Discriminatively-Informed Generative Estimation of Scenes and Transforms (DIGEST) method to infer the initial and goal states of the world from RGBD images. The efficacy of SRP with DIGEST perception is demonstrated for the task of tray-setting with a Michigan Progress Fetch robot. Scene perception and task execution are evaluated with a public household occlusion dataset and our cluttered scene dataset.

“…The efficiency of deterministic inference without its fragility to uncertainty [11], [12]. Generative-discriminative algorithms may be especially advantageous when exposed to adversarial attack, building on foundational ideas in this space [13], [14], [15], [16], [17]. Furthermore, we expect our approach will be more generally applicable to guard against broad categories of attack with a clear pathway for explanability of the resulting perceptual estimates.…”

Section: Introductionmentioning

confidence: 99%

GRIP: Generative Robust Inference and Perception for Semantic Robot Manipulation in Adversarial Environments

Chen

2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

et al. 2019

Recent advancements have led to a proliferation of machine learning systems used to assist humans in a wide range of tasks. However, we are still far from accurate, reliable, and resource-efficient operations of these systems. For robot perception, convolutional neural networks (CNNs) for object detection and pose estimation are recently coming into widespread use. However, neural networks are known to suffer from overfitting during the training process and are less robust under unforeseen conditions (which makes them especially vulnerable to adversarial scenarios). In this work, we propose Generative Robust Inference and Perception (GRIP) as a two-stage object detection and pose estimation system that aims to combine the relative strengths of discriminative CNNs and generative inference methods to achieve robust estimation. Our results show that a second stage of samplebased generative inference is able to recover from false object detections by CNNs, and produce robust estimations in adversarial conditions. We demonstrate the efficacy of GRIP robustness through comparison with state-of-the-art learningbased pose estimators and pick-and-place manipulation in dark and cluttered environments.