2018
DOI: 10.5194/isprs-archives-xlii-4-87-2018
|View full text |Cite
|
Sign up to set email alerts
|

Building Classification of VHR Airborne Stereo Images Using Fully Convolutional Networks and Free Training Samples

Abstract: <p><strong>Abstract.</strong> Semantic segmentation, especially for buildings, from the very high resolution (VHR) airborne images is an important task in urban mapping applications. Nowadays, the deep learning has significantly improved and applied in computer vision applications. Fully Convolutional Networks (FCN) is one of the tops voted method due to their good performance and high computational efficiency. However, the state-of-art results of deep nets depend on the training on large-sca… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2019
2019
2020
2020

Publication Types

Select...
1
1

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(3 citation statements)
references
References 9 publications
(7 reference statements)
0
3
0
Order By: Relevance
“…All images were also rescaled to 1024 × 1024 with a batch size of 2 for 100 epoches in FCN. The learning rate was set to 1e-5 according to (Chen et al, 2018). The same weight decay and momentum were defined as in the case of the Mask R-CNN.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…All images were also rescaled to 1024 × 1024 with a batch size of 2 for 100 epoches in FCN. The learning rate was set to 1e-5 according to (Chen et al, 2018). The same weight decay and momentum were defined as in the case of the Mask R-CNN.…”
Section: Methodsmentioning
confidence: 99%
“…Topographic maps have been used to generate training samples automatically for VHR images (Maggiori et al, 2017, Kaiser et al, 2017, Chen et al, 2018. However, the topographic maps are often mismatched with images due to time differences.…”
Section: Introductionmentioning
confidence: 99%
“…Since the appearance of a normal virtual model is similar to but not the same as a photograph, it is difficult to obtain highly accurate detection results in the real world even if the image is used for deep learning training. A method is also proposed for generating label images by using aerial photographs and map information (Chen et al 2018). In this research, we aim to improve the representation of the VR models by using textured aerial photographs on 3D models.…”
Section: Introduction Backgroundmentioning
confidence: 99%