2017
DOI: 10.3390/rs9121312
|View full text |Cite
|
Sign up to set email alerts
|

Deformable ConvNet with Aspect Ratio Constrained NMS for Object Detection in Remote Sensing Imagery

Abstract: Convolutional neural networks (CNNs) have demonstrated their ability object detection of very high resolution remote sensing images. However, CNNs have obvious limitations for modeling geometric variations in remote sensing targets. In this paper, we introduced a CNN structure, namely deformable ConvNet, to address geometric modeling in object recognition. By adding offsets to the convolution layers, feature mapping of CNN can be applied to unfixed locations, enhancing CNNs' visual appearance understanding. In… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
61
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
7
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 104 publications
(68 citation statements)
references
References 27 publications
0
61
0
Order By: Relevance
“…The R-P-Faster R-CNN framework is developed in [14] for small objects. While both deformable convolution layers [6] and R-FCN are combined by [40] to improve detection accuracy. More recently, the authors in [40] adopt top-down and skipped connections to produce a single high-level feature map of a fine resolution, improving the performance of the deformable Faster R-CNN.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…The R-P-Faster R-CNN framework is developed in [14] for small objects. While both deformable convolution layers [6] and R-FCN are combined by [40] to improve detection accuracy. More recently, the authors in [40] adopt top-down and skipped connections to produce a single high-level feature map of a fine resolution, improving the performance of the deformable Faster R-CNN.…”
Section: Related Workmentioning
confidence: 99%
“…While both deformable convolution layers [6] and R-FCN are combined by [40] to improve detection accuracy. More recently, the authors in [40] adopt top-down and skipped connections to produce a single high-level feature map of a fine resolution, improving the performance of the deformable Faster R-CNN. However such horizontal region based detectors still are confronted with the challenges for the aforementioned bottlenecks in terms of scale, orientation and density, which call for more principled methods beyond the setting for horizontal region detection.…”
Section: Related Workmentioning
confidence: 99%
“…As a result, the proposed SD-MS framework outperforms all comparison approaches for all ten classes of the NWPU VHR-10 dataset, which demonstrates the superiority of the proposed method compared with the eight other methods. In addition, to quantitatively evaluate the proposed SD-MS model, this study compared it with eight existing methods: rotation-invariant CNN (RICNN) [15], region proposal networks with faster R-CNN (R-P-faster R-CNN) (R-P-F-R-CNN ) [50], deformable R-FCN (D-R-FCN) [51], collection of part detectors (COPD) [11], position-sensitive balancing (PSB) [20], deformable faster R-CNN (D-F-R-CNN) [52], recurrent detection with activated semantics (RDAS512) [53], and multi-scale CNN (MS-CNN) [19]. As can be seen from Table 4, the proposed SD-MS obtains the best mAP.…”
Section: Experimental Results and Comparisonsmentioning
confidence: 99%
“…It will be done before it is applied to compensate radar image. For example, in other neural network related papers [24,35,37], they do not consider the training time when listing total performance. Thus, the time consumption of training procedure should not be considered in the total performance.…”
Section: Discussionmentioning
confidence: 99%