Single-Image Super-Resolution (SISR) has always been an important topic in the field of image processing, which attempts to improve the image resolution and is of great significance in practice. Recently, SISR has made substantial progress aided by deep learning (DL), which has demonstrated impressive potential in many low-level tasks. In the current DL-based SISR approaches, most of them are based on supervised learning. However, in the real world, only low-resolution (LR) images with unknown degradation are provided, which limit the application of current supervised models. To mitigate this problem, in this paper, a two-stage semi-supervised SISR method called SRAttentionGAN, is proposed. First, an upsampling network SRResNet, which is pre-trained in a supervised manner, is employed to scale the LR image to the desired size. Then, the upsampled results are fed into our improved unsupervised CycleGAN framework, which does not need paired samples, to obtain sharper and more realistic super-resolution (SR) images. Specifically, in the improved CycleGAN part, an attention-guided generator is proposed to perceive the discriminative semantic parts between the source and target images, to avoid the impact from low-level information. It also prevents the overall color tone from being changed. A multiscale discriminator is also adopted to further rich texture details. The effectiveness of the proposed SRAttentionGAN experiments is validated using four benchmarks (Set5, Set14, Urban100, and BSDS100) in both quantitative and qualitative aspects. Compared with the state-of-the-arts, the results are visually promising and show competitive performance in perceptual metrics, Natural Image Quality Evaluator (NIQE) and Perception Index (PI), which have better agreement with the human visual perception. INTRODUCTIONRecently, due to the rapid development of network communication and high-end display devices, single image super-resolution (SISR), which is one of the most important low-level tasks, has drawn increasing attention in both academia and industry. The goal of SISR is to reconstruct a high-resolution (HR) image from its corresponding low-resolution (LR) image. In recent decade, many SISR methods based on deep learning (DL) have emerged and achieved significant improvements in Peak Signal to Noise Ratio (PSNR)-based evaluations [1-7], or in perceptual image quality evaluations [8-10] over traditional methods. However, most of these methods belong to supervised learning, which means that a large number of paired HR and LR images are required and it is difficult to collect such a dataset This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.