2023
DOI: 10.1117/1.jrs.17.026507
|View full text |Cite
|
Sign up to set email alerts
|

MST-UNet: a modified Swin Transformer for water bodies’ mapping using Sentinel-2 images

Abstract: Deep learning is widely used in remote sensing field of feature recognition.Symmetric encoder-decoder network, such as UNet, is one of the most commonly used image segmentation networks, but the accuracy is often low due to its simple structure. We combine two neural network models of convolutional neural network (CNN) and Swin Transformer called modified Swin Transformer using UNet structure (MST-UNet) to achieve accurate segmentation of water bodies from remote sensing data, with Xiamen City as study area. M… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(1 citation statement)
references
References 32 publications
(41 reference statements)
0
0
0
Order By: Relevance
“…Another study [16] uses Sentinel-1 images and Swin transformers to perform water body detection for agricultural reservoirs, while [17] compares Swin transformers with CNNs for wetland classification, using Sentinel-1 and Sentinel-2 images, demonstrating that the former outperforms the latter. Last but not least, [18] combines two models-a Swin transformer and a CNN-to perform water body mapping in remote sensing images. This literature review successfully demonstrates that vision models, especially vision transformers, can be used efficiently for flood detection, segmentation, and mapping.…”
Section: Introductionmentioning
confidence: 99%
“…Another study [16] uses Sentinel-1 images and Swin transformers to perform water body detection for agricultural reservoirs, while [17] compares Swin transformers with CNNs for wetland classification, using Sentinel-1 and Sentinel-2 images, demonstrating that the former outperforms the latter. Last but not least, [18] combines two models-a Swin transformer and a CNN-to perform water body mapping in remote sensing images. This literature review successfully demonstrates that vision models, especially vision transformers, can be used efficiently for flood detection, segmentation, and mapping.…”
Section: Introductionmentioning
confidence: 99%