A Semantic Segmentation Method for Remote Sensing Images Based on the Swin Transformer Fusion Gabor Filter

Feng, Dongdong; Zhang, Zhihua; Yan, Kun

doi:10.1109/access.2022.3193248

Cited by 20 publications

(15 citation statements)

References 69 publications

(79 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Because self-attention can calculate within the window, its computational complexity increases linearly with the size of the picture rather than quadratic. Therefore, in RS semantic segmentation, it is widely used (Panboonyuen et al, 2021;Xu et al, 2021;Feng et al, 2022;Gu et al, 2022;Liu Y. et al, 2022;Meng et al, 2022;Xu Y. et al, 2022). ST first used the module to segment the data into many non-overlapping different patches.…”

Section: Transformer-based Methodsmentioning

confidence: 99%

Deep learning-based semantic segmentation of remote sensing images: a review

Shen

et al. 2023

Front. Ecol. Evol.

View full text Add to dashboard Cite

Semantic segmentation is a fundamental but challenging problem of pixel-level remote sensing (RS) data analysis. Semantic segmentation tasks based on aerial and satellite images play an important role in a wide range of applications. Recently, with the successful applications of deep learning (DL) in the computer vision (CV) field, more and more researchers have introduced and improved DL methods to the task of RS data semantic segmentation and achieved excellent results. Although there are a large number of DL methods, there remains a deficiency in the evaluation and advancement of semantic segmentation techniques for RS data. To solve the problem, this paper surveys more than 100 papers in this field in the past 5 years and elaborates in detail on the aspects of technical framework classification discussion, datasets, experimental evaluation, research challenges, and future research directions. Different from several previously published surveys, this paper first focuses on comprehensively summarizing the advantages and disadvantages of techniques and models based on the important and difficult points. This research will help beginners quickly establish research ideas and processes in this field, allowing them to focus on algorithm innovation without paying too much attention to datasets, evaluation indicators, and research frameworks.

show abstract

Section: Transformer-based Methodsmentioning

confidence: 99%

Deep learning-based semantic segmentation of remote sensing images: a review

Shen

et al. 2023

Front. Ecol. Evol.

View full text Add to dashboard Cite

show abstract

“…The Detection Transformer (DEtection TRansformer, DE-TR) [40], [41] with an ensemble global loss that makes predictions through bilateral match and a classical encoderdecoder architecture, which containing three components: a CNN based backbone to extracte feature representations, a Transformer pretraining model to enhance features, and a simple feedforward network (FFN) for performing the object detection prediction.The detail structure is shown as Figure 3. Starting from an initial image x img ∈ R 3×H0×W0 (3 color channels, To batch the input images together with sufficient 0 padding to have the same dimension (H 0 ,W 0 ) as the largest image in same batch), a convolutional network then to generate a activation map f ∈ R C×H×W with lower resolution.…”

Section: A Real-time Target Detection Based On Transformermentioning

confidence: 99%

“…For a decoder, it transforms N embeddings with size d by multi-head attention mechanism. The authors in [40] adopted an auto-regressive model to predict one element of the output sequence at once. Because the decoder is also permutationindependent (order-independent), hence thedifferent results will be produced according to N input embeddings.…”

Section: A Real-time Target Detection Based On Transformermentioning

confidence: 99%

A Robust Visual SLAM Method for Additive Manufacturing of Vehicular Parts Under Dynamic Scenes

Wen-bo

Fan

et al. 2023

IEEE Access

View full text Add to dashboard Cite

“…Remote sensing image 1 – 4 object detection refers to the automatic identification and localization of target objects of interest in remote sensing images. It is a fundamental and crucial task in optical remote sensing image processing.…”

Section: Introductionmentioning

confidence: 99%

GODANet: an object detection model for remote sensing images fusing contextual information and dynamic convolution

Rong,

Zhang,

Yuan

et al. 2024

J. Appl. Rem. Sens.

Self Cite

View full text Add to dashboard Cite

Remote sensing images are characterized by complex feature backgrounds and large target scale differences, so object detection for remote sensing images is a challenging problem. This work proposes a one-stage structure remote sensing image object detection model called GODANet. First, the GODANet incorporates a Global Context Network (GCNet) in the feature extraction structure. The GCNet focuses the model on the image region of interest from a global perspective. Second, the output layer utilizes an omni-dimensional dynamic convolution technique, allowing for more flexible adaptation to targets or edges in specific regions. Finally, an adaptive spatial feature fusion structure, IR-ASFF, which fuses improved-RFB (IRFB) modules is proposed to fuse the critical information of multiple levels of features to realize the adaptability to object detection at different scales. The GODANet efficiently aggregates network performance and possesses two main advantages: adaptability to multi-scale targets and focus on features of interest. The mean average precision (mAP) on the DIOR dataset and the NWPU VHR-10 dataset reached 93.7% and 92.9%, respectively, and compared with YOLOv7, the mAP was improved by 3.1% and 0.3%, respectively. Therefore, we believe the GODANet suits remote sensing image detection tasks.

show abstract

A Semantic Segmentation Method for Remote Sensing Images Based on the Swin Transformer Fusion Gabor Filter

Cited by 20 publications

References 69 publications

Deep learning-based semantic segmentation of remote sensing images: a review

Deep learning-based semantic segmentation of remote sensing images: a review

A Robust Visual SLAM Method for Additive Manufacturing of Vehicular Parts Under Dynamic Scenes

GODANet: an object detection model for remote sensing images fusing contextual information and dynamic convolution

Contact Info

Product

Resources

About