2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition 2018
DOI: 10.1109/cvpr.2018.00265
|View full text |Cite
|
Sign up to set email alerts
|

Burst Denoising with Kernel Prediction Networks

Abstract: We present a technique for jointly denoising bursts of images taken from a handheld camera. In particular, we propose a convolutional neural network architecture for predicting spatially varying kernels that can both align and denoise frames, a synthetic data generation approach based on a realistic noise formation model, and an optimization guided by an annealed loss function to avoid undesirable local minima. Our model matches or outperforms the stateof-the-art across a wide range of noise levels on both rea… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

3
439
0

Year Published

2018
2018
2020
2020

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 397 publications
(455 citation statements)
references
References 22 publications
3
439
0
Order By: Relevance
“…In recent work, the use of deep convolutional neural networks (CNNs) has become a common theme to improve image processing algorithms for a better imaging pipeline. Examples include models that perform demosaicing [4], denoising [5,6], and many other types of image enhancement and transformation methods [7,8,9].…”
Section: Related Workmentioning
confidence: 99%
“…In recent work, the use of deep convolutional neural networks (CNNs) has become a common theme to improve image processing algorithms for a better imaging pipeline. Examples include models that perform demosaicing [4], denoising [5,6], and many other types of image enhancement and transformation methods [7,8,9].…”
Section: Related Workmentioning
confidence: 99%
“…This allows us to aggregate sparse but highly related samples only, enabling an efficient implementation in terms of speed and memory and achieving state-ofthe-art results even with kernels of size 3×3 on several tasks. For comparison, the adaptive convolution and kernel prediction networks require much larger neighbors (e.g., 21 × 21 in (Bako et al 2017;Vogels et al 2018), 41 × 41 in (Niklaus et al 2017), and 8×5×5 in (Mildenhall et al 2018)). As will be seen in our experiments, learning sampling locations of neighbors clearly boosts the performance significantly compared to learning kernel weights only.…”
Section: Variants Of the Spatial Transformermentioning
confidence: 99%
“…multi-modal images (e.g., RGB/D images in depth map upsampling and RGB/cost images in semantic segmentation), and thus is applicable to various tasks including depth and saliency map upsampling, cross-modality image restoration, texture removal, and semantic segmentation. In contrast, the adaptive convolution network is specialized to video frame interpolation, and kernel prediction networks are applicable to denoising Monte Carlo renderings (Bako et al 2017;Vogels et al 2018) or burst denoising (Mildenhall et al 2018) only. Finally, our model learns spatially-variant kernels to compute residual images, not a final output as in (Bako et al 2017;Jia et al 2016;Mildenhall et al 2018;Niklaus et al 2017;Vogels et al 2018), with constraints on weight regression.…”
Section: Variants Of the Spatial Transformermentioning
confidence: 99%
See 1 more Smart Citation
“…Several recent works proposed to leverage neural networks to infer locally optimal parameters for regression models [KBS15], reconstruct a noise‐free image using predicted kernels [BVM*17, MBC*17] or produce the image directly [CKS*17]. While deep learning will undoubtedly offset denoising performance in the future, the acquisition of sufficiently large training sets (there are currently none with deep images), the increased memory requirements due to the deep structure and their relatively poor generalization currently permit deployment only in big production houses.…”
Section: Related Workmentioning
confidence: 99%