2023
DOI: 10.1109/tgrs.2022.3194732
|View full text |Cite
|
Sign up to set email alerts
|

RingMo: A Remote Sensing Foundation Model With Masked Image Modeling

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
23
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 78 publications
(57 citation statements)
references
References 113 publications
0
23
0
Order By: Relevance
“…In this work, we propose a methodology to significantly improve generalizability of DL natural hazards mappers based on pre-training on a suitable pre-task. Our approach supports the development of foundation models for earth monitoring, such as [9], with the objective of directly segmenting unseen natural hazards across unseen geographic regions. Our contributions are as follows: First, we demonstrate across four U-Net architectures that our approach significantly improves the generalizability of DL models for the segmentation of unseen natural hazards.…”
Section: Motivationmentioning
confidence: 99%
“…In this work, we propose a methodology to significantly improve generalizability of DL natural hazards mappers based on pre-training on a suitable pre-task. Our approach supports the development of foundation models for earth monitoring, such as [9], with the objective of directly segmenting unseen natural hazards across unseen geographic regions. Our contributions are as follows: First, we demonstrate across four U-Net architectures that our approach significantly improves the generalizability of DL models for the segmentation of unseen natural hazards.…”
Section: Motivationmentioning
confidence: 99%
“…SatMAE [23] leveraged temporal and multi-spectral information in RS images to improve self-supervised pre-training with MIM. RingMo [14] applied the MAE [29] method and designed a new mask strategy for self-supervised representation learning on a 3 million unlabeled RS images dataset. The fine-tuning results on various downstream tasks showed that the new mask strategy was more appropriate for RS images and the learned representations by RingMo were generalized well to various RS downstream tasks.…”
Section: Self-supervised Learning In Remote Sensingmentioning
confidence: 99%
“…2) Adaptation for RS Images: Although the simplicity and effectiveness of SimMIM, there are some limitations must be taken into account when applying SimMIM into RS images. One issue is that SimMIM replaces masked patches with the [MASK] token, but RS images are known for their multiobject characteristics [36] and the objects are usually densely distributed [14]. The masking operation may cause the dense and small objects in the image to be lost [14], leading to incomplete semantic meaning and making image reconstruction more difficult [14].…”
Section: B Masked Image Modeling Branchmentioning
confidence: 99%
See 2 more Smart Citations