Random sampling for fast face sketch synthesis

Wang, Nannan; Gao, Xinbo; Li, Jie

doi:10.1016/j.patcog.2017.11.008

Cited by 113 publications

(65 citation statements)

References 30 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Considering that patch matching based on traditional image features (e.g., PCA and SIFT) is not robust, a recent method [4] used CNN feature to represent the training patches and computed more accurate combination coefficients. To accelerate the synthesis procedure, Song et al [1] formulated face sketch synthesis as a spatial sketch denoising (SSD) problem, and Wang et al [13] presented an offline random sampling strategy for nearest neighbor selection of patches.…”

Section: Exemplar Based Methodsmentioning

confidence: 99%

“…When evaluating on CUFS, the reference photo-sketch pairs only comes from CUFS, and the same applies to CUFSF. To demonstrate the effectiveness of our model, we compare our results both qualitatively and quantitatively with seven other methods, namely MWF [3], SSD [1], RSLCR [13], DGFL [4], FCN [14], Pix2Pix-GAN [28], and Cycle-GAN [7]. We also compare our results quantitatively with the latest GAN based sketch synthesis methods, i.e., PS 2 -MAN [29] and stack-CA-GAN [18].…”

Section: Evaluation On Public Benchmarksmentioning

confidence: 96%

“…Image Quality Assessment For datasets with ground truth sketches (e.g., CUFS and CUFSF), previous work [13,18,4] typically used structural similarity (SSIM) [31] as an image quality assessment metric to measure the similarity between a generated sketch and the ground truth sketch. However, many researchers (e.g., in super resolution [32] and face sketch synthesis [30,29]) pointed out that SSIM is not always consistent with the perceptual quality.…”

Section: Quantitative Comparisonmentioning

confidence: 99%

See 2 more Smart Citations

Semi-supervised Learning for Face Sketch Synthesis in the Wild

Chen

Liu

Tan

et al. 2019

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Face sketch synthesis has made great progress in the past few years. Recent methods based on deep neural networks are able to generate high quality sketches from face photos. However, due to the lack of training data (photo-sketch pairs), none of such deep learning based methods can be applied successfully to face photos in the wild. In this paper, we propose a semi-supervised deep learning architecture which extends face sketch synthesis to handle face photos in the wild by exploiting additional face photos in training. Instead of supervising the network with ground truth sketches, we first perform patch matching in feature space between the input photo and photos in a small reference set of photo-sketch pairs. We then compose a pseudo sketch feature representation using the corresponding sketch feature patches to supervise our network. With the proposed approach, we can train our networks using a small reference set of photo-sketch pairs together with a large face photo dataset without ground truth sketches. Experiments show that our method achieves state-of-the-art performance both on public benchmarks and face photos in the wild. Codes are available at https://github.com/chaofengc/Face-Sketch-Wild.

show abstract

Section: Exemplar Based Methodsmentioning

confidence: 99%

Section: Evaluation On Public Benchmarksmentioning

confidence: 96%

Section: Quantitative Comparisonmentioning

confidence: 99%

See 1 more Smart Citation

Semi-supervised Learning for Face Sketch Synthesis in the Wild

Chen

Liu

Tan

et al. 2019

Lecture Notes in Computer Science

View full text Add to dashboard Cite

show abstract

“…To minimize the ambiguity of human ranking, we follow the voting strategy [54] to conduct this experi-linear warping lightness shift noise structural damage shifting contrast change blur component lost ghosting checkerboard Figure 6: Our distortions. These distortions are generated by various real synthesis algorithms [27,31,45,53,55,59,64,75,92,93]. ment (∼152K judgments) through the following stages:…”

Section: Similarity Assessmentsmentioning

confidence: 99%

Scoot: A Perceptual Metric for Facial Sketches

Fan

Zhang

et al. 2019

2019 IEEE/CVF International Conference on Computer Vision (ICCV)

View full text Add to dashboard Cite

Human visual system has the strong ability to quick assess the perceptual similarity between two facial sketches. However, existing two widely-used facial sketch metrics, e.g., FSIM and SSIM fail to address this perceptual similarity in this field. Recent study in facial modeling area has verified that the inclusion of both structure and texture has a significant positive benefit for face sketch synthesis (FSS). But which statistics are more important, and are helpful for their success? In this paper, we design a perceptual metric, called Structure Co-Occurrence Texture (Scoot), which simultaneously considers the block-level spatial structure and co-occurrence texture statistics. To test the quality of metrics, we propose three novel meta-measures based on various reliable properties. Extensive experiments demonstrate that our Scoot metric exceeds the performance of prior work. Besides, we built the first large scale (152k judgments) human-perception-based sketch database that can evaluate how well a metric is consistent with human perception. Our results suggest that "spatial structure" and "co-occurrence texture" are two generally applicable perceptual features in face sketch synthesis.

show abstract

“…Given this fact, most of the recent researches have focused on developing SB-FPNR methods, such as neural networks (NN) [17], temporal high-pass filter (THPF) [18,19] and constant-statistics (CS) method [20][21]. As for SB-FPNR algorithms, the calibration parameters are iteratively updated by utilizing the information extracted from inter-frame motion, therefore, ghosting artifacts and over smooth effects resulted from the sudden deceleration of scene motion often seriously degrade the noise reduction performance, moreover, the relatively slow convergence process occurred in scene switching is unacceptable for most of the practical applications.In recent years, convolution neural network (CNN) [22] models were explored deeply and applied in various image processing tasks [23], such as image super resolution [24,25], image denoising [26], and sketch synthesis [27][28][29]. To the best of our knowledge, CNN based FPNR methods still have not been extensively investigated in the literature.…”

mentioning

confidence: 99%

Fixed pattern noise reduction for infrared images based on cascade residual attention CNN

et al. 2020

View full text Add to dashboard Cite

Existing fixed pattern noise reduction (FPNR) methods are easily affected by the motion state of the scene and working condition of the image sensor, which leads to over smooth effects, ghosting artifacts as well as slow convergence rate. To address these issues, we design an innovative cascade convolution neural network (CNN) model with residual skip connections to realize single frame blind FPNR operation without any parameter tuning. Moreover, a coarse-fine convolution (CF-Conv) unit is introduced to extract complementary features in various scales and fuse them to pick more spatial information. Inspired by the success of the visual attention mechanism, we further propose a particular spatial-channel noise attention unit (SCNAU) to separate the scene details from fixed pattern noise more thoroughly and recover the real scene more accurately.Experimental results on test data demonstrate that the proposed cascade CNN-FPNR method outperforms the existing FPNR methods in both of visual effect and quantitative assessment. affected by the fixed pattern noise (FPN), which is mainly caused by the spatial non-uniform response of individual detectors in the sensor [6][7]. More seriously, spatial FPN generally drifts with time, which makes the problem be more challenging [8][9][10][11]. As a result, the FPN causes a significant decline in imaging quality and decreases the precision for object detection and recognition. To meet this challenge, the cost-effective fixed pattern noise reduction (FPNR) techniques based on signal processing are continually investigated and applied in nearly all the infrared imaging systems.Existing FPNR algorithms are mainly divided into two primary categories: reference-based FPNR (RB-FPNR) and scene-based FPNR (SB-FPNR) [12][13][14]. The RB-FPNR methods remove the FPN according to fixed calibration parameters calculated from the response of blackbody radiation at different temperatures [15]. Unfortunately, such a calibration requires the camera to halt the normal operation and update the calibration parameters due to the inherent temporal drift of detector characteristics [16]. Given this fact, most of the recent researches have focused on developing SB-FPNR methods, such as neural networks (NN) [17], temporal high-pass filter (THPF) [18,19] and constant-statistics (CS) method [20][21]. As for SB-FPNR algorithms, the calibration parameters are iteratively updated by utilizing the information extracted from inter-frame motion, therefore, ghosting artifacts and over smooth effects resulted from the sudden deceleration of scene motion often seriously degrade the noise reduction performance, moreover, the relatively slow convergence process occurred in scene switching is unacceptable for most of the practical applications.In recent years, convolution neural network (CNN) [22] models were explored deeply and applied in various image processing tasks [23], such as image super resolution [24,25], image denoising [26], and sketch synthesis [27][28][29]. To the best of our knowledge, CNN based FPNR m...

show abstract

Random sampling for fast face sketch synthesis

Cited by 113 publications

References 30 publications

Semi-supervised Learning for Face Sketch Synthesis in the Wild

Semi-supervised Learning for Face Sketch Synthesis in the Wild

Scoot: A Perceptual Metric for Facial Sketches

Fixed pattern noise reduction for infrared images based on cascade residual attention CNN

Contact Info

Product

Resources

About