2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2015
DOI: 10.1109/cvpr.2015.7299075
|View full text |Cite
|
Sign up to set email alerts
|

3D all the way: Semantic segmentation of urban scenes from start to end in 3D

Abstract: We propose a new approach for semantic segmentation of 3D city models. Starting from an SfM reconstruction of a street-side scene, we perform classification and facade splitting purely in 3D, obviating the need for slow imagebased semantic segmentation methods. We show that a properly trained pure-3D approach produces high quality labelings, with significant speed benefits (20x faster) allowing us to analyze entire streets in a matter of minutes. Additionally, if speed is not of the essence, the 3D labeling ca… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
96
0

Year Published

2015
2015
2018
2018

Publication Types

Select...
5
3
1

Relationship

1
8

Authors

Journals

citations
Cited by 122 publications
(104 citation statements)
references
References 56 publications
0
96
0
Order By: Relevance
“…• Recent advances in augmented reality [392] and virtual reality [393]; developments in the fusion of computer graphics, GIS and BIM (e.g., [394][395][396][397][398][399]); and advances in procedural modelling [21,22,[400][401][402] appear as promising catalysts that will contribute to providing 3D city models to practitioners.…”
Section: Discussionmentioning
confidence: 99%
“…• Recent advances in augmented reality [392] and virtual reality [393]; developments in the fusion of computer graphics, GIS and BIM (e.g., [394][395][396][397][398][399]); and advances in procedural modelling [21,22,[400][401][402] appear as promising catalysts that will contribute to providing 3D city models to practitioners.…”
Section: Discussionmentioning
confidence: 99%
“…A CRF model is proposed where unary potentials are from random forest and Potts model is set to calculate pairwise potentials. Instead of assigning labels to image pixels, Martinović et al (2015) design a 3D pipeline to take advantages of 2D images and 3D point clouds from Structure from Motion for 3D labeling. Height, depth, normal vector and spin image descriptors at different scales are 3D features used in a random forest classifier.…”
Section: Related Workmentioning
confidence: 99%
“…Comparing with those traditional datasets that only include a single view for each façade, this point cloud provides additional 3D geometrical cues to solve the problem. Although some previous studies employ multi-view façade images for façade segmentation (Gadde et al, 2017;Martinović et al, 2015), they only focus on terrestrial images and no semantic segmentation has been done from airborne images. This work aims to explore the potential of airborne images to address the problem.…”
Section: Introductionmentioning
confidence: 99%
“…For Haussmannian architecture, strong priors or regular floors may be sufficient to model the buildings [47]. For more general architecture, more relaxed structural principles such as symmetries have to be used [33,41,32]. Further even, in the case of real cities with regular planar buildings and complex shapes like statues, a hybrid model [26] or a topology joining approach [31] may be applied.…”
Section: Related Workmentioning
confidence: 99%