2022
DOI: 10.1109/access.2022.3193248
|View full text |Cite
|
Sign up to set email alerts
|

A Semantic Segmentation Method for Remote Sensing Images Based on the Swin Transformer Fusion Gabor Filter

Abstract: Semantic segmentation of remote sensing images is increasingly important in urban planning, autonomous driving, disaster monitoring, and land cover classification. With the development of high-resolution remote sensing satellite technology, multilevel, large-scale, and high-precision segmentation has become the focus of current research. High-resolution remote sensing images have high intraclass diversity and low interclass separability, which pose challenges to the precision of the detailed representation of … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
8
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
7

Relationship

2
5

Authors

Journals

citations
Cited by 20 publications
(15 citation statements)
references
References 69 publications
(79 reference statements)
0
8
0
Order By: Relevance
“…Because self-attention can calculate within the window, its computational complexity increases linearly with the size of the picture rather than quadratic. Therefore, in RS semantic segmentation, it is widely used (Panboonyuen et al, 2021;Xu et al, 2021;Feng et al, 2022;Gu et al, 2022;Liu Y. et al, 2022;Meng et al, 2022;Xu Y. et al, 2022). ST first used the module to segment the data into many non-overlapping different patches.…”
Section: Transformer-based Methodsmentioning
confidence: 99%
“…Because self-attention can calculate within the window, its computational complexity increases linearly with the size of the picture rather than quadratic. Therefore, in RS semantic segmentation, it is widely used (Panboonyuen et al, 2021;Xu et al, 2021;Feng et al, 2022;Gu et al, 2022;Liu Y. et al, 2022;Meng et al, 2022;Xu Y. et al, 2022). ST first used the module to segment the data into many non-overlapping different patches.…”
Section: Transformer-based Methodsmentioning
confidence: 99%
“…The Detection Transformer (DEtection TRansformer, DE-TR) [40], [41] with an ensemble global loss that makes predictions through bilateral match and a classical encoderdecoder architecture, which containing three components: a CNN based backbone to extracte feature representations, a Transformer pretraining model to enhance features, and a simple feedforward network (FFN) for performing the object detection prediction.The detail structure is shown as Figure 3. Starting from an initial image x img ∈ R 3×H0×W0 (3 color channels, To batch the input images together with sufficient 0 padding to have the same dimension (H 0 ,W 0 ) as the largest image in same batch), a convolutional network then to generate a activation map f ∈ R C×H×W with lower resolution.…”
Section: A Real-time Target Detection Based On Transformermentioning
confidence: 99%
“…For a decoder, it transforms N embeddings with size d by multi-head attention mechanism. The authors in [40] adopted an auto-regressive model to predict one element of the output sequence at once. Because the decoder is also permutationindependent (order-independent), hence thedifferent results will be produced according to N input embeddings.…”
Section: A Real-time Target Detection Based On Transformermentioning
confidence: 99%
“…Remote sensing image 1 4 object detection refers to the automatic identification and localization of target objects of interest in remote sensing images. It is a fundamental and crucial task in optical remote sensing image processing.…”
Section: Introductionmentioning
confidence: 99%