2023
DOI: 10.3390/drones7070439
|View full text |Cite
|
Sign up to set email alerts
|

Multi-Branch Parallel Networks for Object Detection in High-Resolution UAV Remote Sensing Images

Abstract: Uncrewed Aerial Vehicles (UAVs) are instrumental in advancing the field of remote sensing. Nevertheless, the complexity of the background and the dense distribution of objects both present considerable challenges for object detection in UAV remote sensing images. This paper proposes a Multi-Branch Parallel Network (MBPN) based on the ViTDet (Visual Transformer for Object Detection) model, which aims to improve object detection accuracy in UAV remote sensing images. Initially, the discriminative ability of the … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(6 citation statements)
references
References 42 publications
0
2
0
Order By: Relevance
“…After the computation outlined in Equation ( 9), we obtain intermediate variables, as shown in Equation (10).…”
Section: 𝑓 = 𝛿(𝐹 ([𝑧 𝑧 ]))mentioning
confidence: 99%
See 1 more Smart Citation
“…After the computation outlined in Equation ( 9), we obtain intermediate variables, as shown in Equation (10).…”
Section: 𝑓 = 𝛿(𝐹 ([𝑧 𝑧 ]))mentioning
confidence: 99%
“…One of the primary challenges of UAV remote sensing imaging technology lies in the inherent variability in data acquisition conditions. Unlike fixed surveillance cameras, UAVs capture images from varying altitudes, angles, and distances, leading to significant geometric distortions and scale variations in the acquired imagery [10]. These variations pose substantial challenges for pedestrian detection algorithms, which must adapt to the diverse spatial resolutions and perspectives encountered in UAV imagery.…”
Section: Introductionmentioning
confidence: 99%
“…Anchor Frame Configuration. P2 (3,4), (6,5), (4,8), (11,6) P3 (6,12), (11,11), (10,20), (20,10) P4 (17,18), (31,16), (17,32) In the domain of small target detection in UAV imagery, a significant challenge involves effectively combining multi-scale features [20]. As shown in Figure 2, The original YOLOv5 algorithm used a cascade architecture comprising the feature pyramid network (FPN) [21] and pyramid attention network (PANet) [22] for feature fusion.…”
Section: Detection Branchmentioning
confidence: 99%
“…However, due to significant disparities in spatial and semantic information among feature maps at different levels, the fusion process easily introduced redundant information and noise, potentially leading to the loss of small object details in different levels. To address the problem of semantic disparities in feature maps at different levels, Wu et al [ 17 ], based on the use of a multi-branch parallel pyramid network, introduced a feature concatenation fusion module. Nevertheless, this method introduced a significant number of additional parameters, which consequently reduced detection speed.…”
Section: Introductionmentioning
confidence: 99%
“…Many excellent researchers are working to solve the difficulties of UAV object detection. Wu et al [20] proposed a multi-branch parallel network that utilizes multi-branch up-sampling and down-sampling to reduce information loss when the size of a feature map changes. Wang et al [21] added an ultra-lightweight subspace attention module (ULSAM) to a path aggregation network to highlight object features.…”
Section: Introductionmentioning
confidence: 99%