2020
DOI: 10.3390/rs12050784
|View full text |Cite
|
Sign up to set email alerts
|

Extended Feature Pyramid Network with Adaptive Scale Training Strategy and Anchors for Object Detection in Aerial Images

Abstract: Multi-scale object detection is a basic challenge in computer vision. Although many advanced methods based on convolutional neural networks have succeeded in natural images, the progress in aerial images has been relatively slow mainly due to the considerably huge scale variations of objects and many densely distributed small objects. In this paper, considering that the semantic information of the small objects may be weakened or even disappear in the deeper layers of neural network, we propose a new detection… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
4
2
1
1

Relationship

1
7

Authors

Journals

citations
Cited by 14 publications
(9 citation statements)
references
References 35 publications
0
9
0
Order By: Relevance
“…e process of the meta-testing stage is the same as that of meta-evaluation. Formulas (10) and (11) are used for meta-testing. e algorithm flow is shown in Figure 3.…”
Section: Meta-evaluation and Meta-testmentioning
confidence: 99%
See 1 more Smart Citation
“…e process of the meta-testing stage is the same as that of meta-evaluation. Formulas (10) and (11) are used for meta-testing. e algorithm flow is shown in Figure 3.…”
Section: Meta-evaluation and Meta-testmentioning
confidence: 99%
“…e characteristics of different layers of the deep convolutional neural network comprise different information. e bottom layer comprises high amounts of low-level visual information, whereas the top layer comprises semantic information [11,12]. Low-level visual information and semantic information are important in visual recognition tasks.…”
Section: Introductionmentioning
confidence: 99%
“…Regarding Semantic Segmentation [84,147,148,149], there are some use cases, such as building footprint extraction [11,12,150,13,14,15,16,17,18], road extraction [151,152,153] and land use and land cover (LULC) analysis [154,155]. • SpaceNet [158,159]: dataset with satellite imagery of the following cities: Rio de Janeiro, Las Vegas, Paris, Khartoum, and Shanghai.…”
Section: Applications On Remote Sensing and Examples Of Available Datasetsmentioning
confidence: 99%
“…Particularly in the context of semantic Segmentation, neural networks have achieved outstanding results [11,12,13,14,15,16,17,18]. Unlike traditional pixel-wise classification, semantic Segmentation using CNNs can preserve the object boundaries producing sharp, fine-scale Segmentation.…”
Section: Introductionmentioning
confidence: 99%
“…We also used data augmentation to balance their sizes. k-means [15] is then used to cluster different numbers of anchor boxes to find the optimised number and size for better results. Finally, the model is retrained by zoom out-category data.…”
Section: Pest Animal Annotation Model Training and Detectionmentioning
confidence: 99%