Saliency-Guided Remote Sensing Image Super-Resolution

Liu, Baodi; Zhao, Lifei; Li, Jiaoyue; Zhao, Hengle; Liu, Weifeng; Li, Ye; Wang, Yanjiang; Chen, Honglong; Cao, Weijia

doi:10.3390/rs13245144

Cited by 17 publications

(15 citation statements)

References 75 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Many people propose their work by modifying the network structure and designed some novel loss functions for SRGAN: Xiong et al 28 used a Wasserstein distance to replace the KL and JS divergence as well as modify part of network structure; Zhang et al 29 designed an encoder-decoder structure for unsupervised SR and used a robust loss function based on perceptual loss; Xu et al 30 designed residual generators with self-attention mechanisms and weight normalization and combined multiple losses to optimize the training process. Liu et al 31 proposed an SG-GAN network based on saliency guidance, focusing on more salient parts of complex structures while maintaining rich edges in details. Dong et al 32 proposed a reference-based RRSGAN, which has better generalization ability in different scenarios by using gradient-assisted feature alignment and incorporating the relevant attention module (RAM).…”

Section: Cnn-based Methodsmentioning

confidence: 99%

See 1 more Smart Citation

DAE2GAN: Image super-resolution for remote sensing based on an improved edge-enhanced generative adversarial network with double-end attention mechanism

Lin,

Liu,

et al. 2024

J. Appl. Rem. Sens.

View full text Add to dashboard Cite

The super-resolution algorithms of single remote sensing image (SRSI) based on generative adversarial network have made a breakthrough recently, which can effectively learn local details to generate more realistic RSIs with high resolution. However, most of them ignore the characteristics of large size and small targets, resulting in the loss of details at the edges of the generated images, resulting in a strong sense of blur. To solve these problems, this paper proposes an improved architecture named DAE 2 GAN based on attention mechanism and transformer. First, to process large-size RSIs, the vision transformer is selected as a discriminator to compensate the lack of global information attention of convolutional generator. At the same time, to make the generator consider small objects of RSIs better, channel attention is introduced to realize the focus on the local contour with high frequency. Then, an edge loss is designed to constrain the training process, so that the edge details of the generated image can be kept more complete. Experiments show that the proposed method can more effectively improve the reconstruction quality in visual effect of SRSI, presenting clearer and richer detailed information, and the PSNR and Structural Similarity of reconstructed images generated by the DAE 2 GAN are improved well, where the highest increase is 1.68/0.078 compared with the existing mainstream methods. Therefore, the proposed DAE 2 GAN can efficiently assisting the implementation of various remote sensing tasks, such as urban road identification, agricultural monitoring, and geological exploration.

show abstract

Section: Cnn-based Methodsmentioning

confidence: 99%

“…designed residual generators with self-attention mechanisms and weight normalization and combined multiple losses to optimize the training process. Liu et al 31 . proposed an SG-GAN network based on saliency guidance, focusing on more salient parts of complex structures while maintaining rich edges in details.…”

Section: Related Workmentioning

confidence: 99%

DAE2GAN: Image super-resolution for remote sensing based on an improved edge-enhanced generative adversarial network with double-end attention mechanism

Lin,

Liu,

et al. 2024

J. Appl. Rem. Sens.

View full text Add to dashboard Cite

show abstract

“…Lei et al [44] propose coupled adversarial training with a well-designed discriminator to learn a better discrimination between the super-resolved image and the corresponding ground truth. Liu et al [45] design a saliency-guided GAN method to improve visual results with additional salient priors. Some researchers focus on the SR of remote sensing satellite videos.…”

Section: A Deep Learning Based Remote Sensing Image Srmentioning

confidence: 99%

Resolution-Agnostic Remote Sensing Scene Classification With Implicit Neural Representations

Chen

et al. 2023

IEEE Geosci. Remote Sensing Lett.

View full text Add to dashboard Cite

Despite its fruitful applications in remote sensing, image super-resolution is troublesome to train and deploy as it handles different resolution magnifications with separate models. Accordingly, we propose a highly-applicable super-resolution framework called FunSR, which settles different magnifications with a unified model by exploiting context interaction within implicit function space. FunSR composes a functional representor, a functional interactor, and a functional parser. Specifically, the representor transforms the low-resolution image from Euclidean space to multi-scale pixel-wise function maps; the interactor enables pixel-wise function expression with global dependencies; and the parser, which is parameterized by the interactor's output, converts the discrete coordinates with additional attributes to RGB values. Extensive experimental results demonstrate that FunSR reports state-of-the-art performance on both fixed-magnification and continuous-magnification settings, meanwhile, it provides many friendly applications thanks to its unified nature.

show abstract

“…In this direction, Gong et al [31] proposed the Enlighten-GAN model that uses a self-supervised hierarchical perceptual loss. Liu et al [32] exploited the salient maps of images to learn additional structure priors and to make the model focus more on the salient objects. Huan et al [33] proposed a multi-scale residual network with hierarchical feature fusion and multiscale dilation residual blocks.…”

Section: Related Workmentioning

confidence: 99%

“…In this paper, the authors exploit the best practices derived from state-of-the-art experimentation up to date, suggesting partly keeping the training principles (content and adversarial losses) of the highly successful ESRGAN while modifying the way the perceptual > REPLACE THIS LINE WITH YOUR MANUSCRIPT ID NUMBER (DOUBLE-CLICK HERE TO EDIT) < loss is conceived in the context of super-resolution in general and in remote sensing specifically. The key idea of the proposed approach lies in the observation that single-image superresolution is an ill-posed problem in the sense that for any LR image exist numerous HR images that could correspond to it [23], [31], [32]. Thus, for any successful model to achieve superior performance, it must derive significant pixel-level knowledge during the training.…”

Section: Related Workmentioning

confidence: 99%

Exploiting Digital Surface Models for Inferring Super-Resolution for Remotely Sensed Images

Karatsiolis

Padubidri

Kamilaris

2022

IEEE Trans. Geosci. Remote Sensing

View full text Add to dashboard Cite

Despite the plethora of successful Super-Resolution Reconstruction (SRR) models applied to natural images, their application to remote sensing imagery tends to produce poor results. Remote sensing imagery is often more complicated than natural images and has its peculiarities such as being of lower resolution, it contains noise, and often depicting large textured surfaces. As a result, applying non-specialized SRR models like the Enhanced Super Resolution Generative Adversarial Network (ESRGAN) on remote sensing imagery results in artifacts and poor reconstructions. To address these problems, we propose a novel strategy for enabling an SRR model to output realistic remote sensing images: instead of relying on feature-space similarities as a perceptual loss, the model considers pixel-level information inferred from the normalized Digital Surface Model (nDSM) of the image. This allows the application of betterinformed updates during the training of the model which sources from a task (elevation map inference) that is closely related to remote sensing. Nonetheless, the nDSM auxiliary information is not required during production i.e., the model infers a superresolution image without additional data. We assess our model on two remotely sensed datasets of different spatial resolutions that also contain the DSMs of the images: the DFC2018 dataset and the dataset containing the national LiDAR fly-by of Luxembourg. We compare our model with ESRGAN and we show that it achieves better performance and does not introduce any artifacts in the results. In particular, the results for the high-resolution DFC2018 dataset are realistic and almost indistinguishable from the ground truth images.

show abstract

Saliency-Guided Remote Sensing Image Super-Resolution

Cited by 17 publications

References 75 publications

DAE2GAN: Image super-resolution for remote sensing based on an improved edge-enhanced generative adversarial network with double-end attention mechanism

DAE2GAN: Image super-resolution for remote sensing based on an improved edge-enhanced generative adversarial network with double-end attention mechanism

Resolution-Agnostic Remote Sensing Scene Classification With Implicit Neural Representations

Exploiting Digital Surface Models for Inferring Super-Resolution for Remotely Sensed Images

Contact Info

Product

Resources

About