A Multi-Stage Visible and Infrared Image Fusion Network Based on Attention Mechanism

Zheng, Xin; Yang, Qi; Si, Pengbo; Wu, Qiang

doi:10.3390/s22103651

Cited by 7 publications

(8 citation statements)

References 44 publications

(59 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Another study proposes a fusion algorithm based on the adaptive dual-channel unit-linking pulse coupled neural network (PCNN) for infrared and visible images fusion in nonsubsampled contourlet transform (NSCT) domain [13]. A multi-stage visible and infrared image fusion network based on an attention mechanism (MSFAM) to refine fusion features is shown in [14]. However, the methods for NDE fusion are few and the paucity has promoted investigations into some of the most advanced infrared and visual image fusion techniques.…”

Section: Deep Learning Methodsmentioning

confidence: 99%

A multimodal fusion-based autoencoder for nondestructive evaluation of aircraft structures

Fan

Rayhana

Cao

et al. 2023

NDE 4.0, Predictive Maintenance, Communication, and Energy Systems: The Digital Transformation of NDE

View full text Add to dashboard Cite

The effects of the lightning strike on composite aircraft structures have been an active research area in the aviation industry, given the concern over safe aircraft operations. To maintain safe operations, civil and military regulators require effective approaches to assess and quantify the severity of lightning damage. Although X-rays are commonly used to determine material damage in aircraft structures, the technique requires access to both sides of the investigated part. This paper proposes a novel autoencoder model to check the feasibility of evaluating the damage to carbon fiber reinforced polymers (CFRP) panels from the outer surface of in-service aircraft structures. Two alternative techniques to X-ray, such as ultrasonic testing (UT) and infrared thermography (IR), nondestructive evaluation methods, are employed to develop the proposed model. The fusion model uses U-net as the backbone and spatial attention fusion as the fusion strategy while combining structural similarity index (SSIM) and perceptual losses as the loss function. Also, the log-Gabor filter is used in the model to obtain high-frequency edge information for fusion. The results are then compared against five state-of-the-art fusion methods, revealing that the proposed model performs better in quantifying the lightning damage to aircraft CFRP structures.

show abstract

Section: Deep Learning Methodsmentioning

confidence: 99%

A multimodal fusion-based autoencoder for nondestructive evaluation of aircraft structures

Fan

Rayhana

Cao

et al. 2023

NDE 4.0, Predictive Maintenance, Communication, and Energy Systems: The Digital Transformation of NDE

View full text Add to dashboard Cite

show abstract

“…Recently, it is a tendency to build performance-efficient deep neural networks for various image fusion tasks due to their strong nonlinear learning abilities. Learning-based fusion architectures, such as autoencoder (AE) [ 13 , 14 , 16 , 19 ], convolutional neural network (CNN) [ 15 , 18 , 20 ] and generative adversarial network (GAN) [ 21 , 22 , 24 , 27 , 29 ] have witnessed obvious improvements in fusion performance, but their single-scale frameworks can hardly capture the full-scale features of the real-world targets and fail to make the fused images photorealistic. More importantly, most methods directly capitalize on the features extracted in the last layer to reconstruct fused images, whereas earlier features do not.…”

Section: Technical Backgroundsmentioning

confidence: 99%

“…U-Net is originally proposed for image segmentation tasks [ 31 ]. With the powerful multi-scale representation advantage, more and more computer vision tasks are realized by using U-Net as the backbone network, such as image dehazing [ 32 ], salient object detection [ 33 ], facial emotion recognition [ 34 ], image denoising [ 35 ], image fusion [ 13 , [36] , [37] , [38] , [39] ]. U-Net architecture adopts a symmetric encoder-decoder manner that overcomes the disadvantages of local and global features loss in fully convolutional networks.…”

Section: Technical Backgroundsmentioning

confidence: 99%

See 1 more Smart Citation

Image fusion using Y-net-based extractor and global-local discriminator

Yang,

Zhu,

Wang

et al. 2024

Heliyon

View full text Add to dashboard Cite

“…Therefore, extracting the significant features of the fusion image is one of the central problems. Over the past few decades, numerous fusion methods have been proposed by researchers, which can be roughly divided into two categories: traditional fusion methods [ 5 , 6 , 7 ] and deep learning-based fusion methods [ 8 , 9 , 10 ]. Traditional fusion methods measure pixels’ salience in the spatial domain or transform domain, and later design specific fusion rules to fuse them to obtain the fused image.…”

Section: Introductionmentioning

confidence: 99%

DSA-Net: Infrared and Visible Image Fusion via Dual-Stream Asymmetric Network

Yin

Yang

Huang

et al. 2023

Sensors

View full text Add to dashboard Cite

Infrared and visible image fusion technologies are used to characterize the same scene using diverse modalities. However, most existing deep learning-based fusion methods are designed as symmetric networks, which ignore the differences between modal images and lead to source image information loss during feature extraction. In this paper, we propose a new fusion framework for the different characteristics of infrared and visible images. Specifically, we design a dual-stream asymmetric network with two different feature extraction networks to extract infrared and visible feature maps, respectively. The transformer architecture is introduced in the infrared feature extraction branch, which can force the network to focus on the local features of infrared images while still obtaining their contextual information. The visible feature extraction branch uses residual dense blocks to fully extract the rich background and texture detail information of visible images. In this way, it can provide better infrared targets and visible details for the fused image. Experimental results on multiple datasets indicate that DSA-Net outperforms state-of-the-art methods in both qualitative and quantitative evaluations. In addition, we also apply the fusion results to the target detection task, which indirectly demonstrates the fusion performances of our method.

show abstract

A Multi-Stage Visible and Infrared Image Fusion Network Based on Attention Mechanism

Cited by 7 publications

References 44 publications

A multimodal fusion-based autoencoder for nondestructive evaluation of aircraft structures

A multimodal fusion-based autoencoder for nondestructive evaluation of aircraft structures

Image fusion using Y-net-based extractor and global-local discriminator

DSA-Net: Infrared and Visible Image Fusion via Dual-Stream Asymmetric Network

Contact Info

Product

Resources

About