2023
DOI: 10.1016/j.imavis.2023.104697
|View full text |Cite
|
Sign up to set email alerts
|

Improved YOLOX-X based UAV aerial photography object detection algorithm

Xin Wang,
Ning He,
Chen Hong
et al.
Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 26 publications
(3 citation statements)
references
References 49 publications
0
3
0
Order By: Relevance
“…The neck network structure mainly uses the Feature Pyramid Network (FPN) [33] and the Pyramid Attention Network (PAN) [34]. FPN adopts the top-down paths and lateral connections and fuses the underlying high-resolution features with the top-level semantic information.…”
Section: Yolov5smentioning
confidence: 99%
“…The neck network structure mainly uses the Feature Pyramid Network (FPN) [33] and the Pyramid Attention Network (PAN) [34]. FPN adopts the top-down paths and lateral connections and fuses the underlying high-resolution features with the top-level semantic information.…”
Section: Yolov5smentioning
confidence: 99%
“…Wu et al [20] proposed a multi-branch parallel network that utilizes multi-branch up-sampling and down-sampling to reduce information loss when the size of a feature map changes. Wang et al [21] added an ultra-lightweight subspace attention module (ULSAM) to a path aggregation network to highlight object features. Huang et al [22] proposed a feature-guided enhancement (FGE) module that designs two nonlinear operators to learn discriminant information.…”
Section: Introductionmentioning
confidence: 99%
“…Although data augmentation improved the detection of small objects to some extent, it merely increased the proportion of small objects in the data, lacking the integration and utilization of semantic information. Wang et al [ 15 ] introduced the Ultra-lightweight Subspace Attention Module (ULSAM) into the network structure, with an emphasis on target features and the attenuation of background features. However, this module primarily incorporated spatial information, neglecting channel information, and resulting in suboptimal small object detection performance, especially in densely occluded scenes.…”
Section: Introductionmentioning
confidence: 99%