2015 IEEE International Conference on Computer Vision (ICCV) 2015
DOI: 10.1109/iccv.2015.259
|View full text |Cite
|
Sign up to set email alerts
|

Hierarchical Higher-Order Regression Forest Fields: An Application to 3D Indoor Scene Labelling

Abstract: This paper addresses the problem of semantic segmentation of 3D indoor scenes reconstructed from RGB-D images. Traditionally label prediction for 3D points is tackled by employing graphical models that capture scene features and complex relations between different class labels. However, the existing work is restricted to pairwise conditional random fields, which are insufficient when encoding rich scene context. In this work we propose models with higher-order potentials to describe complex relational informat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
12
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
4
3
2

Relationship

3
6

Authors

Journals

citations
Cited by 22 publications
(12 citation statements)
references
References 28 publications
0
12
0
Order By: Relevance
“…3D point, patches). The graphical model like Conditional Random Fields (CRF) is always employed to capture scene features and different categories (Pham et al, 2015;Vosselman et al, 2017). This encoded 3D contextual information hinders its wide application to construct and optimize such a complex graphical model.…”
Section: Supervised Methodsmentioning
confidence: 99%
“…3D point, patches). The graphical model like Conditional Random Fields (CRF) is always employed to capture scene features and different categories (Pham et al, 2015;Vosselman et al, 2017). This encoded 3D contextual information hinders its wide application to construct and optimize such a complex graphical model.…”
Section: Supervised Methodsmentioning
confidence: 99%
“…In recent years, we have seen impressive progress in semantic aware reconstruction. Early works [13], [14], [15] use graphical models to assign semantic labels to a geometric reconstruction. SemanticFusion [16] employs a deep network to predict pixel-wise semantic labels given RGB frames, which are then fused into a semantic mapping by leveraging the geometric reconstruction from a RGBD SLAM.…”
Section: Related Workmentioning
confidence: 99%
“…State-of-the-art methods such as Mask-RCNN [1], YOLO9000 [3] fail to detect such unknown objects. This behavior is detrimental to the goal of robotic scene understanding that would ideally result in a semantically meaningful map [5]- [8] comprising all objects, environmental structures, and their various complex relations. The ability to extract information about objects (e.g., semantic classes and affordances [9]) and the scene geometry in complex environments under realistic, open-set conditions is increasingly important for robotics.…”
Section: Introductionmentioning
confidence: 99%
“…In addition, SceneCut reasons jointly over objects and scene geometry and therefore produces a segmentation of non-object surfaces such as ceilings and walls, as well as supporting surfaces such as floor or tabletops. Unlike methods [5], [10]- [12] that deal with geometric segmentation and object segmentation in isolation, our method simultaneously segments objects and planes of a scene in an unified formulation. SceneCut therefore ensures the segmentation of geometric surfaces is consistent with the discovered objects.…”
Section: Introductionmentioning
confidence: 99%