2015
DOI: 10.1007/s11263-015-0843-8
|View full text |Cite
|
Sign up to set email alerts
|

Integrating Geometrical Context for Semantic Labeling of Indoor Scenes using RGBD Images

Abstract: Inexpensive structured light sensors can capture rich information from indoor scenes, and scene labeling problems provide a compelling opportunity to make use of this information. In this paper we present a novel conditional random field (CRF) model to effectively utilize depth information for semantic labeling of indoor scenes. At the core of the model, we propose a novel and efficient plane detection algorithm which is robust to erroneous depth maps. Our CRF formulation defines local, pairwise and higher ord… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
16
0

Year Published

2016
2016
2021
2021

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 22 publications
(16 citation statements)
references
References 49 publications
0
16
0
Order By: Relevance
“…IMLS systems, in addition to the point clouds, provide a continuous trajectory of device locations instead of few discrete station points in TLS. Current methods for indoor reconstruction and semantic labelling use mainly TLSs (Becker et al, 2015;Mura et al, 2014a;Oesau et al, 2014) or RGB-Depth data (Armeni et al, 2016;Khan et al, 2015). If MLS data is used as in (Xiao and Furukawa, 2014), the benefit of trajectory data is not exploited.…”
Section: Introductionmentioning
confidence: 99%
“…IMLS systems, in addition to the point clouds, provide a continuous trajectory of device locations instead of few discrete station points in TLS. Current methods for indoor reconstruction and semantic labelling use mainly TLSs (Becker et al, 2015;Mura et al, 2014a;Oesau et al, 2014) or RGB-Depth data (Armeni et al, 2016;Khan et al, 2015). If MLS data is used as in (Xiao and Furukawa, 2014), the benefit of trajectory data is not exploited.…”
Section: Introductionmentioning
confidence: 99%
“…Note that the 12-class accuracy of our network is calculated through the model previously trained for 37 classes. Our model substantially outperforms the one from [9] on large planar regions such as those labeled as floors and ceilings. This also results from the incorporated convolutional features and the fused global contexts.…”
Section: Results and Comparisonsmentioning
confidence: 90%
“…Specifically, kernel descriptions based on traditional multi-channel features, such as color, depth gradient, and surface normal, are used as photometric and depth features [24]. A rich feature set containing various traditional features, e.g., SIFT, HOG, LBP and plane orientation, are used as local appearance features and plane appearance features in [9]. HOG features of RGB images and HOG+HH (histogram of height) features of depth images are extracted as representations in [25] for training successive classifiers.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations