Multiple-Oriented and Small Object Detection with Convolutional Neural Networks for Aerial Image

Chen, Chao; Zhong, Jiandan; Tan, Yi

doi:10.3390/rs11182176

Cited by 35 publications

(20 citation statements)

References 56 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In (Chen et al, 2019), expect for minimizing the classification error, Cheng et al impose a rotationinvariant regularizer and a Fisher discrimination regularizer on the FC7 layer of VGGNet-16 (Simonyan & Zisserman, 2014) to enforce the CNN features to be rotation-invariant and have powerful discriminative capability. In (Li et al, 2020), an unified object detection framework is proposed for combining the RPN and the contextual feature fusion network to extract the proposals and to simultaneously locate the geospatial objects.…”

Section: Related Workmentioning

confidence: 99%

“…We now compare the performance of our method with the four state-of-the-art approaches Faster R-CNN (Ren et al, 2017), RIFD-CNN (Chen et al, 2019), YOLOv4 (Bochkovskiy et al, 2020), and R2CNN (Jiang et al, 2017). We present the detection results of different object detection algorithms on the two datasets in Tables 7 and Tables 8. Generally, we can see from the table that compared to other comparison algorithms, our R-FRCNN algorithm has the lowest MAR and FAR on both data sets, and its F1 index is the highest.…”

Section: Algorithm Comparisonmentioning

confidence: 99%

See 1 more Smart Citation

Arbitrary-angle bounding box based location for object detection in remote sensing image

Sun

Liu

et al. 2021

European Journal of Remote Sensing

View full text Add to dashboard Cite

Object location is a fundamental yet challenging problem in object detection. In the remote sensing image, different imaging projection directions make the same object have various rotation angles, and in some scenes, the object distribution is relatively dense. Most of the existing deep learning-based object detection algorithms utilize horizontal bounding box to locate objects, which causes inaccurate location of the objects with dense distribution or arbitrary direction, thus leading to the detection misses. In this paper, we propose an arbitraryangle bounding box based object location and embed it into the Faster R-CNN, developing a new framework called Rotated Faster R-CNN (R-FRCNN) for object detection in remote sensing image. In R-FRCNN, we specially improve anchor ratios to adapt to the objects like ship with large aspect ratio and increase the weights of the horizontal bounding box regression to reduce the interference of the arbitrary-angle bounding box on the horizontal bounding box prediction. Comprehensive experiments on a public dataset and a self-assembled dataset (which we make publically available) show the superior performance of our method compared to standalone state-of-the-art object detectors.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Algorithm Comparisonmentioning

confidence: 99%

Arbitrary-angle bounding box based location for object detection in remote sensing image

Sun

Liu

et al. 2021

European Journal of Remote Sensing

View full text Add to dashboard Cite

show abstract

“…In recent years, many rotation detectors have been proposed to introduce the additional orientation prediction to detect arbitrary-oriented objects in aerial images [8][9][10][11][12][13][14][15]. These detectors first densely preset a large number of prior boxes (also called anchors) to align with the ground-truth (GT) objects.…”

Section: Introductionmentioning

confidence: 99%

Sparse Label Assignment for Oriented Object Detection in Aerial Images

Miao

Zhou

et al. 2021

Remote Sensing

View full text Add to dashboard Cite

Object detection in aerial images has received extensive attention in recent years. The current mainstream anchor-based methods directly divide the training samples into positives and negatives according to the intersection-over-unit (IoU) of the preset anchors. This label assignment strategy assigns densely arranged samples for training, which leads to a suboptimal learning process and cause the model to suffer serious duplicate detections and missed detections. In this paper, we propose a sparse label assignment strategy (SLA) to select high-quality sparse anchors based on the posterior IoU of detections. In this way, the inconsistency between classification and regression is alleviated, and better performance can be achieved through balanced training. Next, to accurately detect small and densely arranged objects, we use a position-sensitive feature pyramid network (PS-FPN) with a coordinate attention module to extract position-sensitive features for accurate localization. Finally, the distance rotated IoU loss is proposed to eliminate the inconsistency between the training loss and the evaluation metric for better bounding box regression. Extensive experiments on the DOTA, HRSC2016, and UCAS-AOD datasets demonstrate the superiority of the proposed approach.

show abstract

“…Recently, many methods [21][22][23][24][25][26] have been proposed and try to solve the issue of small objects. However, in the UAV-captured image, the object detection based on deep learning still faces severe challenges.…”

Section: Introductionmentioning

confidence: 99%

Small-Object Detection in UAV-Captured Images via Multi-Branch Parallel Feature Pyramid Networks

2020

View full text Add to dashboard Cite

Small object is one of the primary challenges in the field of object detection, which is notably pronounced to the detection in the images from Unmanned Aerial Vehicles (UAV). Existing detectors based on deep-learning methods usually apply the feature extraction networks with a large down-sampling factor to obtain higher-level features. However, such big stride tends to make the feature information of small objects become the little point or even vanish in the low-resolution feature maps due to the limitation of pixels. Therefore, a novel structure called Multi-branch Parallel Feature Pyramid Networks (MPFPN) is proposed in this paper, which aims to extract more abundant feature information of the objects with a small size. Specifically, the parallel branch is designed to recover the features that missed in the deeper layers. Meanwhile, a supervised spatial attention module (SSAM) is applied to weaken the impact of background noise inference and focus object information. Furthermore, we adopt cascade architecture in the Fast R-CNN stage for a more powerful localization capability. Experiments on the public drone-based datasets named VisDrone-DET demonstrate that our method achieves competitive performance compared with other state-of-the-art detection algorithms.INDEX TERMS Unmanned aerial vehicle, object detection, multi-branch parallel feature pyramid networks (MPFPN), feature fusion, cascade architecture

show abstract

Multiple-Oriented and Small Object Detection with Convolutional Neural Networks for Aerial Image

Cited by 35 publications

References 56 publications

Arbitrary-angle bounding box based location for object detection in remote sensing image

Arbitrary-angle bounding box based location for object detection in remote sensing image

Sparse Label Assignment for Oriented Object Detection in Aerial Images

Small-Object Detection in UAV-Captured Images via Multi-Branch Parallel Feature Pyramid Networks

Contact Info

Product

Resources

About