2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2016
DOI: 10.1109/cvpr.2016.94
|View full text |Cite
|
Sign up to set email alerts
|

Deep Sliding Shapes for Amodal 3D Object Detection in RGB-D Images

Abstract: We focus on the task of amodal 3D object detection in RGB-D images, which aims to produce a 3D bounding box of an object in metric form at its full extent. We introduce Deep Sliding Shapes, a 3D ConvNet formulation that takes a 3D volumetric scene from a RGB-D image as input and outputs 3D object bounding boxes. In our approach, we propose the first 3D Region Proposal Network (RPN) to learn objectness from geometric shapes and the first joint Object Recognition Network (ORN) to extract geometric features in 3D… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
566
0
1

Year Published

2017
2017
2018
2018

Publication Types

Select...
4
3
3

Relationship

1
9

Authors

Journals

citations
Cited by 648 publications
(568 citation statements)
references
References 26 publications
(70 reference statements)
1
566
0
1
Order By: Relevance
“…Sliding Shapes [30]). Because the depth from these sensors is very reliable, 3D shape can play a more important role in a recognition pipeline.…”
Section: D Shapenetsmentioning
confidence: 99%
“…Sliding Shapes [30]). Because the depth from these sensors is very reliable, 3D shape can play a more important role in a recognition pipeline.…”
Section: D Shapenetsmentioning
confidence: 99%
“…On the contrary, our approach is designed towards indoor scenes with no prior training or knowledge about objects we detect in the scenes. Song and Xiao [10], [11] train exemplar-SVMs on synthetic data on hundreds of rendered views. They slide these exemplars in 3D space to search for objects by densely evaluating 3D windows.…”
Section: Related Workmentioning
confidence: 99%
“…Other approaches exist which aim to provide 3D completion and classification [35], [34], although only Wu et al [35] uses a single representation for both tasks and is able to do partial-object classification. Amodal perception approaches have also been proposed [28]. While these methods do not provide detailed 3D completions, they do estimate a coarse 3D bounding box from a 2.5D image.…”
Section: D Object Completionmentioning
confidence: 99%