MST-UNet: a modified Swin Transformer for water bodies’ mapping using Sentinel-2 images

Li, Jiakai; Xie, Tong; Wu, Zebin

doi:10.1117/1.jrs.17.026507

Cited by 3 publications

(1 citation statement)

References 32 publications

(41 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Another study [16] uses Sentinel-1 images and Swin transformers to perform water body detection for agricultural reservoirs, while [17] compares Swin transformers with CNNs for wetland classification, using Sentinel-1 and Sentinel-2 images, demonstrating that the former outperforms the latter. Last but not least, [18] combines two models-a Swin transformer and a CNN-to perform water body mapping in remote sensing images. This literature review successfully demonstrates that vision models, especially vision transformers, can be used efficiently for flood detection, segmentation, and mapping.…”

Section: Introductionmentioning

confidence: 99%

Vision Transformer for Flood Detection Using Satellite Images from Sentinel-1 and Sentinel-2

Chamatidis,

Istrati,

Lagaros

2024

Water

View full text Add to dashboard Cite

Floods are devastating phenomena that occur almost all around the world and are responsible for significant losses, in terms of both human lives and economic damages. When floods occur, one of the challenges that emergency response agencies face is the identification of the flooded area so that access points and safe routes can be determined quickly. This study presents a flood detection methodology that combines transfer learning with vision transformers and satellite images from open datasets. Transformers are powerful models that have been successfully applied in Natural Language Processing (NLP). A variation of this model is the vision transformer (ViT), which can be applied to image classification tasks. The methodology is applied and evaluated for two types of satellite images: Synthetic Aperture Radar (SAR) images from Sentinel-1 and Multispectral Instrument (MSI) images from Sentinel-2. By using a pre-trained vision transformer and transfer learning, the model is fine-tuned on these two datasets to train the models to determine whether the images contain floods. It is found that the proposed methodology achieves an accuracy of 84.84% on the Sentinel-1 dataset and 83.14% on the Sentinel-2 dataset, revealing its insensitivity to the image type and applicability to a wide range of available visual data for flood detection. Moreover, this study shows that the proposed approach outperforms state-of-the-art CNN models by up to 15% on the SAR images and 9% on the MSI images. Overall, it is shown that the combination of transfer learning, vision transformers, and satellite images is a promising tool for flood risk management experts and emergency response agencies.

show abstract

Section: Introductionmentioning

confidence: 99%

Vision Transformer for Flood Detection Using Satellite Images from Sentinel-1 and Sentinel-2

Chamatidis,

Istrati,

Lagaros

2024

Water

View full text Add to dashboard Cite

show abstract

Combining topography and reflectance indices for better surface water detection

Hu,

Lee,

Paik

2024

Journal of Hydro-environment Research

View full text Add to dashboard Cite

Anomaly Detection Using Normalizing Flow-Based Density Estimation and Synthetic Defect Classification

Oh,

Kim

2024

IEEE Access

View full text Add to dashboard Cite

We propose a novel deep learning-based anomaly detection (AD) system that combines a pixelwise classification network with conditional normalizing flow (CNF) networks by sharing feature extractors. We trained the pixelwise classification network using synthetic abnormal data to fine-tune a pretrained feature extractor of the CNF networks, thereby learning the discriminative features of the indomain data. After that, we trained the CNF networks using normal data with the fine-tuned feature extractor to estimate the density of normal data. During inference, we detect anomalies by calculating the weighted average of the anomaly scores from the pixelwise classification and CNF networks. Because the proposed system not only has learned the properties of in-domain data but also aggregated the anomaly scores of the classification and CNF networks, it showed significantly improved performance compared to existing methods in experiments using the MvTecAD and BTAD datasets. Moreover, the proposed system does not increase computations intensively since the classification and the density estimation systems share feature extractors.

show abstract

MST-UNet: a modified Swin Transformer for water bodies’ mapping using Sentinel-2 images

Cited by 3 publications

References 32 publications

Vision Transformer for Flood Detection Using Satellite Images from Sentinel-1 and Sentinel-2

Vision Transformer for Flood Detection Using Satellite Images from Sentinel-1 and Sentinel-2

Combining topography and reflectance indices for better surface water detection

Anomaly Detection Using Normalizing Flow-Based Density Estimation and Synthetic Defect Classification

Contact Info

Product

Resources

About