2023
DOI: 10.1109/jstars.2022.3230797
|View full text |Cite
|
Sign up to set email alerts
|

Dual-Resolution and Deformable Multihead Network for Oriented Object Detection in Remote Sensing Images

Abstract: Compared with general object detection, the scale variations, arbitrary orientations, and complex backgrounds of objects in remote sensing images make it more challenging to detect oriented objects. Especially for oriented objects that have large aspect ratios, it is more difficult to accurately detect their boundary. Many methods show excellent performance on oriented object detection, most of which are anchor-based algorithms. To mitigate the performance gap between anchor-free algorithms and anchor-based al… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 67 publications
(98 reference statements)
0
2
0
Order By: Relevance
“…Meanwhile, attention mechanism, which focuses on important features and suppresses unnecessary ones, has been widely integrated in CNNs, especially in U-Net like or other variants of the encoder-decoder architecture, for improving the representation of interests and the segmentation results. For example, Cui et al [23] created a reverse attention module that suppresses seawater features, enabling the learning characteristics for both apparent and inapparent aquaculture sites. Qin et al [24] embedded the convolutional block attention module (CBAM) [25] into the decoder of the network they proposed to gain accurate feature maps for offshore farm extraction, etc.…”
Section: Introductionmentioning
confidence: 99%
“…Meanwhile, attention mechanism, which focuses on important features and suppresses unnecessary ones, has been widely integrated in CNNs, especially in U-Net like or other variants of the encoder-decoder architecture, for improving the representation of interests and the segmentation results. For example, Cui et al [23] created a reverse attention module that suppresses seawater features, enabling the learning characteristics for both apparent and inapparent aquaculture sites. Qin et al [24] embedded the convolutional block attention module (CBAM) [25] into the decoder of the network they proposed to gain accurate feature maps for offshore farm extraction, etc.…”
Section: Introductionmentioning
confidence: 99%
“…Different methods have been developed to address these problems. Yu et al [15] employed deformable convolution to align feature maps of different scales, and designed a feature fusion module using dilated convolution to enhance the perception of object shape and direction. Hou et al [16] designed an asymmetric feature pyramid network to enrich the spatial representation of features and improve the detection of objects with extreme aspect ratios.…”
Section: Introductionmentioning
confidence: 99%