Hierarchical Semantic Labeling for Task-Relevant RGB-D Perception

Wu, Chenxia; Lenz, Ian; Saxena, Ashutosh

doi:10.15607/rss.2014.x.006

Cited by 64 publications

(49 citation statements)

References 42 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Many variants augment CRFs with latent variables in order to model hidden states, such as latent CRFs [33] that have been applied to object recognition [36,14,42], scene understanding [35], gesture recognition [40] and grounding natural language to robotic tasks [29]. However, in these models, the predefined latent space is discrete and small to keep the learning and inference tractable.…”

Section: Related Workmentioning

confidence: 99%

Modeling High-Dimensional Humans for Activity Anticipation using Gaussian Process Latent CRFs

Jiang

Saxena

2014

Robotics: Science and Systems X

Self Cite

View full text Add to dashboard Cite

Abstract-For robots, the ability to model human configurations and temporal dynamics is crucial for the task of anticipating future human activities, yet requires conflicting properties: On one hand, we need a detailed high-dimensional description of human configurations to reason about the physical plausibility of the prediction; on the other hand, we need a compact representation to be able to parsimoniously model the relations between the human and the environment.We therefore propose a new model, GP-LCRF, which admits both the high-dimensional and low-dimensional representation of humans. It assumes that the high-dimensional representation is generated from a latent variable corresponding to its lowdimensional representation using a Gaussian process. The generative process not only defines the mapping function between the high-and low-dimensional spaces, but also models a distribution of humans embedded as a potential function in GP-LCRF along with other potentials to jointly model the rich context among humans, objects and the activity. Through extensive experiments on activity anticipation, we show that our GP-LCRF consistently outperforms the state-of-the-art results and reduces the predicted human trajectory error by 11.6%.

show abstract

Section: Related Workmentioning

confidence: 99%

Modeling High-Dimensional Humans for Activity Anticipation using Gaussian Process Latent CRFs

Jiang

Saxena

2014

Robotics: Science and Systems X

Self Cite

View full text Add to dashboard Cite

show abstract

“…In the area of computer vision, some works have considered relating phrases and attributes to images and videos [39,15,26,25,21,50]. These works focus primarily on labeling the image/video by modeling the rich perceptual data rather than modeling the relations in the language and the entities in the environment.…”

Section: Related Workmentioning

confidence: 99%

Tell Me Dave: Context-Sensitive Grounding of Natural Language to Manipulation Instructions

Misra

Sung

Lee

et al. 2014

Robotics: Science and Systems X

Self Cite

View full text Add to dashboard Cite

Abstract-We consider performing a sequence of mobile manipulation tasks with instructions given in natural language (NL). Given a new environment, even a simple task such as of boiling water would be performed quite differently depending on the presence, location and state of the objects. We start by collecting a dataset of task descriptions in free-form natural language and the corresponding grounded task-logs of the tasks performed in an online robot simulator. We then build a library of verbenvironment-instructions that represents the possible instructions for each verb in that environment-these may or may not be valid for a different environment and task context. We present a model that takes into account the variations in natural language, and ambiguities in grounding them to robotic instructions with appropriate environment context and task constraints. Our model also handles incomplete or noisy NL instructions. Our model is based on an energy function that encodes such properties in a form isomorphic to a conditional random field. In evaluation, we show that our model produces sequences that perform the task successfully in a simulator and also significantly outperforms the state-of-the-art. We also verify by executing our output instruction sequences on a PR2 robot.

show abstract

“…While most methods use 2D visual information only [2], there are numerous 3D shape based recognition techniques [3,4], as well as methods that use both visual and shape information [5,6]. Object detection methods are essential for scene understanding [7], which has a number of applications in different fields, such as robotics [1] or augmented reality [8].…”

Section: Introductionmentioning

confidence: 99%

3D Object Detection and Scene Optimization for Tangible Augmented Reality

Szemenyei

Vajda

2018

Period. Polytech. Elec. Eng. Comp. Sci.

View full text Add to dashboard Cite

Object recognition in 3D scenes is one of the fundamental tasks in computer vision. It is used frequently in robotics or augmented reality applications [1]. In our work we intend to apply 3D shape recognition to create a Tangible Augmented Reality system that is able to pair virtual and real objects in natural indoors scenes. In this paper we present a method for arranging virtual objects in a real-world scene based on primitive shape graphs. For our scheme, we propose a graph node embedding algorithm for graphs with vectorial nodes and edges, and genetic operators designed to improve the quality of the global setup of virtual objects. We show that our methods improve the quality of the arrangement significantly.

show abstract

Hierarchical Semantic Labeling for Task-Relevant RGB-D Perception

Cited by 64 publications

References 42 publications

Modeling High-Dimensional Humans for Activity Anticipation using Gaussian Process Latent CRFs

Modeling High-Dimensional Humans for Activity Anticipation using Gaussian Process Latent CRFs

Tell Me Dave: Context-Sensitive Grounding of Natural Language to Manipulation Instructions

3D Object Detection and Scene Optimization for Tangible Augmented Reality

Contact Info

Product

Resources

About