DarkVisionNet: Low-Light Imaging via RGB-NIR Fusion with Deep Inconsistency Prior

Jin, Shuangping; Yu, Bo; Jing, Minhao; Zhou, Yi; Liang, Jiajun; Ji, Renhe

doi:10.1609/aaai.v36i1.19995

Cited by 17 publications

(1 citation statement)

References 28 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The structural features are represented by binary edge features, obtained through the application of the Sobel operator for image filtering. The inconsistency between images is defined as follows: 24 F(edgeC,edgeN)=λ(1−edgeC)(1−edgeN)+edgeC·edgeN,where edgeC represents the edge feature map for each color channel of the RGB image and edgeN represents the edge feature map for the NIR image, with the dimensions of the feature map being the same as those of the original images. Figure 4 shows the representation of structural inconsistency between RGB and NIR images.…”

Section: Methodsmentioning

confidence: 99%

Enhancement of dark areas on the surface of scrap metals based on RGB-NIR image fusion

Ma,

Ye,

et al. 2024

J. Electron. Imag.

View full text Add to dashboard Cite

The application of machine vision in object identification and classification has significantly enhanced recognition efficiency. Nevertheless, for non-ferrous scrap metals with poor surface smoothness, the unevenness of reflected light results in the generation of dark regions in the images, obscuring a considerable amount of detailed information and reducing the recognition accuracy. Addressing these challenges, we propose a method for enhancing the details of dark regions based on the RGB-NIR image fusion theory, integrating detailed information from NIR images into RGB images. First, a robust deep residual denoising network is constructed to estimate and remove noise in images. Subsequently, to address the difficulty of extracting structural features in dark regions, a multi-scale spatial deep structure feature extraction module based on channel attention blocks is developed. This module effectively extracts the structural features of RGB and NIR image pairs, with the target image serving as the supervisory signal. Finally, guided by the theory of structural inconsistency, multi-scale feature maps are fused. The image fusion network adopts an encoder-decoder architecture embedded with residual channel attention blocks. The experimental results indicate that the approach proposed in this study demonstrates notable efficacy in image denoising and detail enhancement.

show abstract