FusionGRAM: An Infrared and Visible Image Fusion Framework Based on Gradient Residual and Attention Mechanism

Wang, Jinxin; Xi, Xiaoli; Li, Dongmei; Li, Fang

doi:10.1109/tim.2023.3237814

Cited by 19 publications

(7 citation statements)

References 44 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Therefore, when the window w is certain, the original image with large covariance will have a larger weight. Moreover, in order to better extract the details as well as the texture in each original image, and at the same time to highlight the infrared thermal information of the target of interest, the detail loss and pixel loss from the literature [16] are also added to the loss function. The detail loss assumes that the texture information in the fused image is the one that corresponds to the largest difference in gradient and the intensity information is the pixel with the largest brightness in the original image.…”

Section: Loss Functionmentioning

confidence: 99%

“…In conventional image fusion algorithms, various transformation methods are usually used to extract features. The process that generates a huge amount of redundant information and requires complex fusion model design [6,[16][17][18]. In recent years, with the development of artificial intelligence technology, deep learning techniques have also been widely used in the field of image fusion [19].…”

Section: Introductionmentioning

confidence: 99%

“…Unlike other neural networks that require high-quality original and fused images as datasets, neural networks based on unsupervised learning do not require labeled fused images for training. Unsupervised models extract features from different bands and perform feature fusion according to a designed specific fusion strategy, and finally recover the fused images using a decoder [16,[24][25][26][27]. In intensity-polarization image fusion, a neural network model extracts the texture information from the intensity image and the salient information from the polarization image for feature fusion, which is subsequently recovered and reconstructed into a fused image with both original image information [28][29][30].…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

TPFNet: a tri-band polarimetric image fusion neural network

Wu,

Ma,

Chang

et al. 2024

Sixth Conference on Frontiers in Optical Imaging and Technology: Imaging Detection and Target Recognition

View full text Add to dashboard Cite

In current image fusion techniques, image fusion is usually performed on a dual-band image to obtain a fused image with significant target information, or on an intensity image and a polarization image to obtain an image with stronger visual perception. If more information is to be obtained in a single image, tri-band fusion and intensity/polarization image fusion techniques can be combined. In order to solve the above problems, in this paper, we have acquired some tri-band polarization images through a common aperture multispectral polarization photoelectric device, which contains intensity and polarization visible (VIS) images, as well as the intensity images of near-infrared (NIR) and long-wave infrared (LIR). Besides, in order to obtain good image fusion results, we built an end-to-end self-supervised image fusion network and designed an efficient loss function to train the network. We conducted experiments on TPFNet on the acquired dataset and compared it with other image fusion algorithms. The results show that TPFNet achieves excellent results in both subjective and objective evaluations.

show abstract

Section: Loss Functionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

TPFNet: a tri-band polarimetric image fusion neural network

Wu,

Ma,

Chang

et al. 2024

Sixth Conference on Frontiers in Optical Imaging and Technology: Imaging Detection and Target Recognition

View full text Add to dashboard Cite

show abstract

“…Recently, it is a tendency to build performance-efficient deep neural networks for various image fusion tasks due to their strong nonlinear learning abilities. Learning-based fusion architectures, such as autoencoder (AE) [ 13 , 14 , 16 , 19 ], convolutional neural network (CNN) [ 15 , 18 , 20 ] and generative adversarial network (GAN) [ 21 , 22 , 24 , 27 , 29 ] have witnessed obvious improvements in fusion performance, but their single-scale frameworks can hardly capture the full-scale features of the real-world targets and fail to make the fused images photorealistic. More importantly, most methods directly capitalize on the features extracted in the last layer to reconstruct fused images, whereas earlier features do not.…”

Section: Technical Backgroundsmentioning

confidence: 99%

“…Reference [ [14] , [15] , [16] , [17] ] employed convolution kernels of different sizes to extract common and unique features of source images. Reference [ [18] , [19] , [20] ] captured the multilevel features of the source images via residual learning. Moreover, modern GAN-based approaches [ [21] , [22] , [23] , [24] , [25] , [26] , [27] , [28] , [29] , [30] ] exploit multi-granularity convolution kernels of the same feature level, yielding different receptive fields and in turn improving fusion performance.…”

Section: Introductionmentioning

confidence: 99%