Abstract:Thermal imaging has substantially evolved, during the recent years, to be established as a complement, or even occasionally as an alternative to conventional visible light imaging, particularly for face analysis applications. Facial landmark detection is a crucial prerequisite for facial image processing. Given the upswing of deep learning based approaches, the performance of facial landmark detection has been significantly improved. However, this uprise is merely limited to visible spectrum based face analysi… Show more
“…Examples include the favtGAN approach [13] where thermal faces are generated by modifying a PatchGAN discriminator [27] to learn auxiliary thermal sensor classes from a combination of different datasets. Mallat et al apply a Cascaded Refinement Network (CRN) [39] based on progressively upsampled feature maps [14,40]. Pavez et al generates a set of stylized thermal facial images using the GansNRoses [41] architecture for thermal facial recognition experiments [42], and [15] uses StyleGAN2 to generate random high resolution thermal faces that are not mapped to existing visible faces.…”
“…The GAN framework has been applied successfully in both directions of Visible-to-Thermal (VT) and Thermal-to-Visible (TV) for person re-identification [2,3,4,5,6,7,8,9,10,11,12]. For facial VT translation, paired methods are preferable, in order to preserve the physiological mapping which is considered a biometric [13,14,15]. Yet, few works have explored how to estimate temperature from thermogram pixels where some works use metadata supplied by the camera, that demonstrate high quality results [16,17].…”
Thermal facial imagery offers valuable insight into physiological states such as inflammation and stress by detecting emitted radiation in the infrared spectrum, which is unseen in the visible spectra. Telemedicine applications could benefit from thermal imagery, but conventional computers are reliant on RGB cameras and lack thermal sensors. As a result, we propose the Visible-to-Thermal Facial GAN (VTF-GAN) that is specifically designed to generate high-resolution thermal faces by learning both the spatial and frequency domains of facial regions, across spectra. We compare VTF-GAN against several popular GAN baselines and the first conditional Denoising Diffusion Probabilistic Model (DDPM) for VT face translation (VTF-Diff). Results show that VTF-GAN achieves high quality, crisp, and perceptually realistic thermal faces using a combined set of patch, temperature, perceptual, and Fourier Transform losses, compared to all baselines including diffusion.
“…Examples include the favtGAN approach [13] where thermal faces are generated by modifying a PatchGAN discriminator [27] to learn auxiliary thermal sensor classes from a combination of different datasets. Mallat et al apply a Cascaded Refinement Network (CRN) [39] based on progressively upsampled feature maps [14,40]. Pavez et al generates a set of stylized thermal facial images using the GansNRoses [41] architecture for thermal facial recognition experiments [42], and [15] uses StyleGAN2 to generate random high resolution thermal faces that are not mapped to existing visible faces.…”
“…The GAN framework has been applied successfully in both directions of Visible-to-Thermal (VT) and Thermal-to-Visible (TV) for person re-identification [2,3,4,5,6,7,8,9,10,11,12]. For facial VT translation, paired methods are preferable, in order to preserve the physiological mapping which is considered a biometric [13,14,15]. Yet, few works have explored how to estimate temperature from thermogram pixels where some works use metadata supplied by the camera, that demonstrate high quality results [16,17].…”
Thermal facial imagery offers valuable insight into physiological states such as inflammation and stress by detecting emitted radiation in the infrared spectrum, which is unseen in the visible spectra. Telemedicine applications could benefit from thermal imagery, but conventional computers are reliant on RGB cameras and lack thermal sensors. As a result, we propose the Visible-to-Thermal Facial GAN (VTF-GAN) that is specifically designed to generate high-resolution thermal faces by learning both the spatial and frequency domains of facial regions, across spectra. We compare VTF-GAN against several popular GAN baselines and the first conditional Denoising Diffusion Probabilistic Model (DDPM) for VT face translation (VTF-Diff). Results show that VTF-GAN achieves high quality, crisp, and perceptually realistic thermal faces using a combined set of patch, temperature, perceptual, and Fourier Transform losses, compared to all baselines including diffusion.
“…In addition, face and landmark detection in the thermal spectrum was addressed by methods based on generative adversarial networks (GANs), aiming at translating facial images to the visible spectrum, then extracting facial key points and transferring the key points to the image of the original spectrum. Mallat et al [11] proposed converting existing visible face databases to the thermal spectrum. Active appearance models (AAMs) and DAN are then trained using the synthetic thermal, along with the shared landmark annotations.…”
Section: Related Workmentioning
confidence: 99%
“…We note that existing face and landmark detection algorithms, that were trained with visible face images, fail to generalize onto thermal images [7,14] due to the cross-spectral modality gap. At the same time, lack of available annotated thermal datasets is the primary cause for the scarcity of work focused on detection of thermal facial landmarks [8,11,16].…”
HAL is a multi-disciplinary open access archive for the deposit and dissemination of scientific research documents, whether they are published or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers. L'archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d'enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.