2019
DOI: 10.1109/access.2019.2940527
|View full text |Cite
|
Sign up to set email alerts
|

Automatic Building Extraction on High-Resolution Remote Sensing Imagery Using Deep Convolutional Encoder-Decoder With Spatial Pyramid Pooling

Abstract: Automatic extraction of buildings from remote sensing imagery plays a significant role in many applications, such as urban planning and monitoring changes to land cover. Various building segmentation methods have been proposed for visible remote sensing images, especially state-of-the-art methods based on convolutional neural networks (CNNs). However, high-accuracy building segmentation from high-resolution remote sensing imagery is still a challenging task due to the potentially complex texture of buildings i… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
71
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
6
2

Relationship

1
7

Authors

Journals

citations
Cited by 102 publications
(71 citation statements)
references
References 51 publications
0
71
0
Order By: Relevance
“…As presented in Table 6, the model with ASPP shows an obvious improvement over the model without ASPP across all evaluation metrics. The comparison result demonstrates the efficiency and applicability of the ASPP module as a connector for building extraction from high-resolution aerial images [38,71].…”
Section: B the Effect Of Atrous Spatial Pyramid Poolingmentioning
confidence: 76%
See 1 more Smart Citation
“…As presented in Table 6, the model with ASPP shows an obvious improvement over the model without ASPP across all evaluation metrics. The comparison result demonstrates the efficiency and applicability of the ASPP module as a connector for building extraction from high-resolution aerial images [38,71].…”
Section: B the Effect Of Atrous Spatial Pyramid Poolingmentioning
confidence: 76%
“…The DeepLab_v2 [37] employs atrous convolution and atrous spatial pyramid pooling (ASPP) to enlarge the receptive field on different levels. Liu et al [38] merged the spatial pyramid pooling module into the encoder-decoder architecture with a particular focus on building extraction. The JointNet [39] introduced a new, dense atrous convolution block combining a dense connectivity block and atrous convolution to obtain multi-scale features.…”
Section: Introductionmentioning
confidence: 99%
“…The improvement after the combination of the bright detector and the dark detector of the image is obvious. (Liu et al, 2019). All models used the same test data.…”
Section: Resultsmentioning
confidence: 99%
“…We further conducted a quantitative comparison with different models on the Inria Aerial Image Labeling Dataset. The literature (Liu et al, 2019) quantifies the performance of different methods including SegNet, FCN, U-Net, Tiramisu, FRRN, and USPP on the dataset, using consistent evaluation metrics and the same test data with our method. The results are summarized in Table 2.…”
Section: Resultsmentioning
confidence: 99%
“…3,4 The full convolutional network (FCN) 5 and its improved network have been applied for land cover classification tasks in high-resolution remote sensing images, which have achieved a certain effect. 6,7 However, the distortion caused by the convolution structure of upsampling and downsampling will inevitably lead to errors, such as edge blurring and holes. Therefore, FCNs often confuse categories and provide unclear boundaries, 8 thereby leading to poor performance in extracting land cover from dense and variable areas.…”
Section: Introductionmentioning
confidence: 99%