2022
DOI: 10.1109/lgrs.2022.3178479
|View full text |Cite
|
Sign up to set email alerts
|

Multiscale Deformable Attention and Multilevel Features Aggregation for Remote Sensing Object Detection

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 23 publications
(11 citation statements)
references
References 19 publications
0
11
0
Order By: Relevance
“…LAG [ 24 ] proposes a hierarchical anchor generation algorithm that generates anchors in different layers based on the diagonal and aspect ratio of the object, making the anchors in each layer match better with the detection range of that layer. The authors of [ 25 ] proposed a new multi-scale deformable attention module and a multi-level feature aggregation module and inserts them into the feature pyramid network (FPN) to improve the detection performance of various shapes and sizes of remote sensing objects. RSADet [ 26 ] considers the spatial distribution, scale, and orientation changes of the objects in remote sensing images by introducing deformable convolution and a new bounding box confidence prediction branch.…”
Section: Related Workmentioning
confidence: 99%
“…LAG [ 24 ] proposes a hierarchical anchor generation algorithm that generates anchors in different layers based on the diagonal and aspect ratio of the object, making the anchors in each layer match better with the detection range of that layer. The authors of [ 25 ] proposed a new multi-scale deformable attention module and a multi-level feature aggregation module and inserts them into the feature pyramid network (FPN) to improve the detection performance of various shapes and sizes of remote sensing objects. RSADet [ 26 ] considers the spatial distribution, scale, and orientation changes of the objects in remote sensing images by introducing deformable convolution and a new bounding box confidence prediction branch.…”
Section: Related Workmentioning
confidence: 99%
“…UAV remote sensing images often contain noise originating from complex scenes, which can interfere with detection outcomes [2]. Additionally, the large feature map resolution of shallow networks tends to produce a lower level of feature abstraction and weaker semantic information, thereby containing more fine-grained details.…”
Section: Rfe and Csamentioning
confidence: 99%
“…In the domain of remote sensing image object detection, multi-scale features enhance the model's detection accuracy [2]. However, some semantic information is lost during the sampling operation performed for feature fusion.…”
Section: Mbus and Mbdsmentioning
confidence: 99%
See 1 more Smart Citation
“…In drone ground detection, single-modality data, such as RGB images [6][7][8], infrared images [9][10][11], and other spectral or radar data [12,13], are predominantly used. Multimodal data for target detection has received limited research [14,15].…”
Section: Introductionmentioning
confidence: 99%