2018
DOI: 10.3390/app8050813
|View full text |Cite
|
Sign up to set email alerts
|

Small Object Detection in Optical Remote Sensing Images via Modified Faster R-CNN

Abstract: The PASCAL VOC Challenge performance has been significantly boosted by the prevalently CNN-based pipelines like Faster R-CNN. However, directly applying the Faster R-CNN to the small remote sensing objects usually renders poor performance. To address this issue, this paper investigates on how to modify Faster R-CNN for the task of small object detection in optical remote sensing images. First of all, we not only modify the RPN stage of Faster R-CNN by setting appropriate anchors but also leverage a single high… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

1
122
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
4
4
1

Relationship

0
9

Authors

Journals

citations
Cited by 218 publications
(123 citation statements)
references
References 29 publications
1
122
0
Order By: Relevance
“…This deletes most of the cobbles and fine boulder information from the data if large scale mosaics were fed directly into the network. Hence, the smaller tiles exported from the backscatter mosaics were upscaled to values between 300 and 1200 pixels, which is the simplest approach to facilitate small object detection [30,31]. The size of anchor boxes used to determine the bounding box of objects were left at their standard settings of 32, 64, 128, For classification and object detection, we use an open source RetinaNet [27] implementation in Python, available on GitHub (https://github.com/fizyr/keras-retinanet, last accessed on 6 February 2019).…”
Section: Preparation Of Train Validation and Test Datasetsmentioning
confidence: 99%
See 1 more Smart Citation
“…This deletes most of the cobbles and fine boulder information from the data if large scale mosaics were fed directly into the network. Hence, the smaller tiles exported from the backscatter mosaics were upscaled to values between 300 and 1200 pixels, which is the simplest approach to facilitate small object detection [30,31]. The size of anchor boxes used to determine the bounding box of objects were left at their standard settings of 32, 64, 128, For classification and object detection, we use an open source RetinaNet [27] implementation in Python, available on GitHub (https://github.com/fizyr/keras-retinanet, last accessed on 6 February 2019).…”
Section: Preparation Of Train Validation and Test Datasetsmentioning
confidence: 99%
“…Therefore, it is mandatory to detect objects of the smallest possible size and to consider the minimum object size detectable by the trained models. The minimum size of objects whose detection can be trained by RetinaNet depends on a) the resolution of the input backscatter mosaic and b) the minimum anchor box of the network measured in pixels multiplied by the threshold of areal overlap of 0.5 required for a positive training [30,31]. For a minimum anchor box of 32 pixels, this results in a theoretical minimum threshold for positive training of 23 × 23 pixels.…”
Section: Constraining the Minimum Size Of Detected Bouldersmentioning
confidence: 99%
“…Two-stage method originated from R-CNN [33], then successively arise Fast R-CNN [34] and Faster R-CNN [28]. R-CNN is the first object detection framework based on deep convolutional neural networks [35], which uses the selective search algorithm (SS) to extract the candidate regions and computes features by CNN. A set of class-specific linear SVMs [36] and regressors are used to classify and fine-tune the bounding boxes, respectively.…”
Section: Introductionmentioning
confidence: 99%
“…Region proposal-based detection networks have been extensively applied in the target detection field, including Region-Convolutional Neural Networks (R-CNN) [16], Fast-Region Convolutional Neural Networks (Fast-RCNN) [17], and Faster-Region Convolutional Neural Networks (Faster-RCNN) [18]. Through analyzing these works, scientists put forward some small target detection algorithms [19][20][21]. Moreover, Cai et al [14] proposed a Multi-scale Deep Convolutional Neural Network (MS-CNN) which predicts objects at different layers of the feature hierarchy.…”
Section: Introductionmentioning
confidence: 99%