Pixel-Wise Classification Method for High Resolution Remote Sensing Imagery Using Deep Neural Networks

Guo, Rui; Li, Jianbo; Li, Na; Liu, Shibin; Fu, Chen; Cheng, Bo; Duan, Jianbo; Li, Xinpeng; Ma, Caihong

doi:10.3390/ijgi7030110

Cited by 42 publications

(14 citation statements)

References 50 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Marmanis [32] extracted scale-dependent class boundaries before each pooling level, with the class boundaries fused into the final multi-scale boundary prediction. Guo [33] extracted bounding boxes of potential ground objects which augmented the training dataset before training the DCNNs. Tian [34] presented DFCNet(Dense Fusion Classmate Network) which was jointly trained with auxiliary road dataset properly compensates the lack of mid-level information.…”

Section: Related Workmentioning

confidence: 99%

DV3+HED+: a DCNN-based framework to monitor temporary works and ESAs in railway construction project using VHR satellite images

Guo

Liu

et al. 2020

J. Appl. Rem. Sens.

Self Cite

View full text Add to dashboard Cite

Current VHR(Very High Resolution) satellite images enable the detailed monitoring of the earth and can capture the ongoing works of railway construction. In this paper, we present an integrated framework applied to monitoring the railway construction in China, using QuickBird, GF-2 and Google Earth VHR satellite images. We also construct a novel DCNNs-based (Deep Convolutional Neural Networks) semantic segmentation network to label the temporary works such as borrow & spoil area, camp, beam yard and ESAs(Environmental Sensitive Areas) such as resident houses throughout the whole railway construction project using VHR satellite images. In addition, we employ HED edge detection sub-network to refine the boundary details and attention cross entropy loss function to fit the sample class disequilibrium problem. Our semantic segmentation network is trained on 572 VHR true color images, and tested on the 15 QuickBird true color images along Ruichang-Jiujiang railway during 2015-2017. The experiment results show that compared with the existing state-of-the-art approach, our approach has obvious improvements with an overall accuracy of more than 80%.

show abstract

Section: Related Workmentioning

confidence: 99%

DV3+HED+: a DCNN-based framework to monitor temporary works and ESAs in railway construction project using VHR satellite images

Guo

Liu

et al. 2020

J. Appl. Rem. Sens.

Self Cite

View full text Add to dashboard Cite

show abstract

“…This method achieved high overall accuracy as well as good performance for small objects segmentation. Guo et al [19] exploited FCN with atrous convolution to perform semantic segmentation for high-resolution remote sensing images. They used graph-based segmentation and selective search method to augment the training data and conditional random fields(CRF) to refine the segmentation results.…”

Section: Related Workmentioning

confidence: 99%

A Dual-Path and Lightweight Convolutional Neural Network for High-Resolution Aerial Image Segmentation

Zhang

Leí

Cui

et al. 2019

IJGI

View full text Add to dashboard Cite

Semantic segmentation on high-resolution aerial images plays a significant role in many remote sensing applications. Although the Deep Convolutional Neural Network (DCNN) has shown great performance in this task, it still faces the following two challenges: intra-class heterogeneity and inter-class homogeneity. To overcome these two problems, a novel dual-path DCNN, which contains a spatial path and an edge path, is proposed for high-resolution aerial image segmentation. The spatial path, which combines the multi-level and global context features to encode the local and global information, is used to address the intra-class heterogeneity challenge. For inter-class homogeneity problem, a Holistically-nested Edge Detection (HED)-like edge path is employed to detect the semantic boundaries for the guidance of feature learning. Furthermore, we improve the computational efficiency of the network by employing the backbone of MobileNetV2. We enhance the performance of MobileNetV2 with two modifications: (1) replacing the standard convolution in the last four Bottleneck Residual Blocks (BRBs) with atrous convolution; and (2) removing the convolution stride of 2 in the first layer of BRBs 4 and 6. Experimental results on the ISPRS Vaihingen and Potsdam 2D labeling dataset show that the proposed DCNN achieved real-time inference speed on a single GPU card with better performance, compared with the state-of-the-art baselines.by these methods are not good at discriminating: (1) two objects which are classified into the same semantic label but with different appearances, named intra-class heterogeneity, as shown in Figure 1a, where the houses (or cars) have different shapes, sizes, and colors, but they belong to the same semantic label; and (2) two adjacent objects which are categorized into two different semantic labels but with similar appearances, named inter-class homogeneity, as shown in Figure 1b, where the low vegetation and trees are similar in colors, but their semantic labels are distinct. To tackle these two challenges, we need to consider each category of pixels as a whole, instead of assigning semantic label to each single pixel independently. To address the intra-class heterogeneity issue, we need to combine the multi-level and global context features to encode the local and global information, which can learn the discriminative and effective features to correctly categorize variant objects belonged to the same semantic label. Semantic boundaries can detect the feature variations on adjacent objects with similar appearance but different semantic labels. We can integrate it into the training process to help the network to learn the discriminative features to enlarge the inter-class differences. Based on the above two points, we propose a novel Deep Convolutional Neural Network (DCNN) that contains a spatial path and an edge path to tackle the problems of intra-class heterogeneity and inter-class homogeneity in high-resolution aerial images simultaneously.(a) intra-class heterogeneity (b) inter-class homogeneity

show abstract

“…FCN modifies CNN to obtain the classification results of each pixel and implement semantic segmentation. In general, CRF usually serves as a postprocessing method of FCN [17,18] to improve the segmentation results and DeepLab is a typical case. DeepLab performs semantic segmentation with atrous convolution, deep convolutional nets, and fully connected CRFs.…”

Section: Introductionmentioning

confidence: 99%

Edge Prior Multilayer Segmentation Network Based on Bayesian Framework

Shi

Fang

et al. 2020

Journal of Sensors

View full text Add to dashboard Cite

In recent years, methods based on neural network have achieved excellent performance for image segmentation. However, segmentation around the edge area is still unsatisfactory when dealing with complex boundaries. This paper proposes an edge prior semantic segmentation architecture based on Bayesian framework. The entire framework is composed of three network structures, a likelihood network and an edge prior network at the front, followed by a constraint network. The likelihood network produces a rough segmentation result, which is later optimized by edge prior information, including the edge map and the edge distance. For the constraint network, the modified domain transform method is proposed, in which the diffusion direction is revised through the newly defined distance map and some added constraint conditions. Experiments about the proposed approach and several contrastive methods show that our proposed method had good performance and outperformed FCN in terms of average accuracy for 0.0209 on ESAR data set.

show abstract

Pixel-Wise Classification Method for High Resolution Remote Sensing Imagery Using Deep Neural Networks

Cited by 42 publications

References 50 publications

DV3+HED+: a DCNN-based framework to monitor temporary works and ESAs in railway construction project using VHR satellite images

DV3+HED+: a DCNN-based framework to monitor temporary works and ESAs in railway construction project using VHR satellite images

A Dual-Path and Lightweight Convolutional Neural Network for High-Resolution Aerial Image Segmentation

Edge Prior Multilayer Segmentation Network Based on Bayesian Framework

Contact Info

Product

Resources

About