Big Map R-CNN for object detection in large-scale remote sensing images

Wang, Linfei; Tao, Dapeng; Wang, Ruonan; Wang, Ruxin; Li, Hao

doi:10.3934/mfc.2019019

Cited by 9 publications

(3 citation statements)

References 37 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Sampling focuses therefore on target objects only; no background or negative samples are needed, which is an advantage over other approaches in a time‐critical, operational setting. Mask R‐CNN has been successfully applied in similar satellite image analysis tasks, for example for sparse and multi‐sized object detection in VHR images (Wang, Tao, Wang, Wang, & Li, 2019), building extraction (Wen et al., 2019) or within the DeepGlobe Building Extraction Challenge (Zhao, Kang, Jung, & Sohn, 2018).…”

Section: Methodsmentioning

confidence: 99%

Mask R‐CNN‐based building extraction from VHR satellite data in operational humanitarian action: An example related to Covid‐19 response in Khartoum, Sudan

Tiede

Schwendemann²,

Alobaidi³

et al. 2021

Transactions in GIS

View full text Add to dashboard Cite

Within the constraints of operational work supporting humanitarian organizations in their response to the Covid‐19 pandemic, we conducted building extraction for Khartoum, Sudan. We extracted approximately 1.2 million dwellings and buildings, using a Mask R‐CNN deep learning approach from a Pléiades very high‐resolution satellite image with 0.5 m pixel resolution. Starting from an untrained network, we digitized a few hundred samples and iteratively increased the number of samples by validating initial classification results and adding them to the sample collection. We were able to strike a balance between the need for timely information and the accuracy of the result by combining the output from three different models, each aiming at distinctive types of buildings, in a post‐processing workflow. We obtained a recall of 0.78, precision of 0.77 and F 1 score of 0.78, and were able to deliver first results in only 10 days after the initial request. The procedure shows the great potential of convolutional neural network frameworks in combination with GIS routines for dwelling extraction even in an operational setting.

show abstract

Section: Methodsmentioning

confidence: 99%

Mask R‐CNN‐based building extraction from VHR satellite data in operational humanitarian action: An example related to Covid‐19 response in Khartoum, Sudan

Tiede

Schwendemann²,

Alobaidi³

et al. 2021

Transactions in GIS

View full text Add to dashboard Cite

show abstract

“…Current data augmentation for STD can be classified into two categories. The first category is to use image-level data augmentation, such as using random rotation, random flipping, random clipping, and color or contrast jitter on images [2][3] [4]. However, such a manner requires changing the content of the whole image and belongs to image-level variation; it ignores the possible usefulness of the instance-level variations.…”

Section: Introductionmentioning

confidence: 99%

Self-compositional data augmentation for scene text detection

Zhu¹,

Wang²,

Tao³

2023

International Conference on Cloud Computing, Performance Computing, and Deep Learning (CCPCDL 2023)

View full text Add to dashboard Cite

“…[Available]: http://jiong.tea.ac.cn/people/JunweiHan/NWPUVHR 10dataset.html2 Online. [Available]: https://github.com/RSIA-LIESMARS-WHU/RSOD-Dataset-3 Online. [Available]: http://www.escience.cn/people/gongcheng/DIOR.html University.…”

mentioning

confidence: 99%

A Contextual Bidirectional Enhancement Method for Remote Sensing Image Object Detection

Zhang

Xie

et al. 2020

IEEE J. Sel. Top. Appl. Earth Observations Remote Sensing

View full text Add to dashboard Cite

In remote sensing images, the backgrounds of objects include crucial contextual information that may contribute to distinguishing objects. However, there are at least two issues that should be addressed: not all the backgrounds are beneficial, and object information may be suppressed by backgrounds. To address these problems, in this article, we propose the contextual bidirectional enhancement (CBD-E) method to simultaneously remove unexpected background information and enhance objects' features. CBD-E integrates the features of different background regions sequentially in two directions. On the one hand, a gate function is used to filter out unexpected information in the background and thus improve the recall of detection. On the other hand, a spatial-group-based visual attention mechanism is adopted to enhance the features of objects to reduce the false alarm. The gate function provides an approach to selecting meaningful information in the background, while the spatial-group-based visual attention mechanism enhances the information control ability of the gate function. In the experiments, we have validated the effectiveness of both the gate function and the visual attention mechanism and further demonstrated that the proposed contextual fusion strategy performs well on two published data sets.

show abstract

Big Map R-CNN for object detection in large-scale remote sensing images

Cited by 9 publications

References 37 publications

Mask R‐CNN‐based building extraction from VHR satellite data in operational humanitarian action: An example related to Covid‐19 response in Khartoum, Sudan

Mask R‐CNN‐based building extraction from VHR satellite data in operational humanitarian action: An example related to Covid‐19 response in Khartoum, Sudan

Self-compositional data augmentation for scene text detection

A Contextual Bidirectional Enhancement Method for Remote Sensing Image Object Detection

Contact Info

Product

Resources

About