Kai Zhang scite author profile

Abstract-Discriminative model learning for image denoising has been recently attracting considerable attentions due to its favorable denoising performance. In this paper, we take one step forward by investigating the construction of feed-forward denoising convolutional neural networks (DnCNNs) to embrace the progress in very deep architecture, learning algorithm, and regularization method into image denoising. Specifically, residual learning and batch normalization are utilized to speed up the training process as well as boost the denoising performance. Different from the existing discriminative denoising models which usually train a specific model for additive white Gaussian noise (AWGN) at a certain noise level, our DnCNN model is able to handle Gaussian denoising with unknown noise level (i.e., blind Gaussian denoising). With the residual learning strategy, DnCNN implicitly removes the latent clean image in the hidden layers. This property motivates us to train a single DnCNN model to tackle with several general image denoising tasks such as Gaussian denoising, single image super-resolution and JPEG image deblocking. Our extensive experiments demonstrate that our DnCNN model can not only exhibit high effectiveness in several general image denoising tasks, but also be efficiently implemented by benefiting from GPU computing.

show abstract

NTIRE 2017 Challenge on Single Image Super-Resolution: Methods and Results

Timofte

Agustsson

Gool³

et al. 2017

981

545

View full text Add to dashboard Cite

End-to-End Blind Image Quality Assessment Using Deep Neural Networks

Liu

Zhang

et al. 2018

IEEE Trans. on Image Process.

442

275

View full text Add to dashboard Cite

We propose a multi-task end-to-end optimized deep neural network (MEON) for blind image quality assessment (BIQA). MEON consists of two sub-networks-a distortion identification network and a quality prediction network-sharing the early layers. Unlike traditional methods used for training multi-task networks, our training process is performed in two steps. In the first step, we train a distortion type identification sub-network, for which large-scale training samples are readily available. In the second step, starting from the pre-trained early layers and the outputs of the first sub-network, we train a quality prediction sub-network using a variant of the stochastic gradient descent method. Different from most deep neural networks, we choose biologically inspired generalized divisive normalization (GDN) instead of rectified linear unit as the activation function. We empirically demonstrate that GDN is effective at reducing model parameters/layers while achieving similar quality prediction performance. With modest model complexity, the proposed MEON index achieves state-of-the-art performance on four publicly available benchmarks. Moreover, we demonstrate the strong competitiveness of MEON against state-of-the-art BIQA models using the group maximum differentiation competition methodology.

show abstract

Deep Unfolding Network for Image Super-Resolution

2020

View full text Add to dashboard Cite

Learning-based single image super-resolution (SISR) methods are continuously showing superior effectiveness and efficiency over traditional model-based methods, largely due to the end-to-end training. However, different from model-based methods that can handle the SISR problem with different scale factors, blur kernels and noise levels under a unified MAP (maximum a posteriori) framework, learning-based methods generally lack such flexibility. To address this issue, this paper proposes an end-to-end trainable unfolding network which leverages both learningbased methods and model-based methods. Specifically, by unfolding the MAP inference via a half-quadratic splitting algorithm, a fixed number of iterations consisting of alternately solving a data subproblem and a prior subproblem can be obtained. The two subproblems then can be solved with neural modules, resulting in an end-to-end trainable, iterative network. As a result, the proposed network inherits the flexibility of model-based methods to super-resolve blurry, noisy images for different scale factors via a single model, while maintaining the advantages of learning-based methods. Extensive experiments demonstrate the superiority of the proposed deep unfolding network in terms of flexibility, effectiveness and also generalizability.

show abstract

Designing a Practical Degradation Model for Deep Blind Image Super-Resolution

et al. 2021

View full text Add to dashboard Cite

Maximum Margin Clustering Made Practical

Zhang

Tsang

Kwok

2009

IEEE Trans. Neural Netw.

120

136

View full text Add to dashboard Cite

Motivated by the success of large margin methods in supervised learning, maximum margin clustering (MMC) is a recent approach that aims at extending large margin methods to unsupervised learning. However, its optimization problem is nonconvex and existing MMC methods all rely on reformulating and relaxing the nonconvex optimization problem as semidefinite programs (SDP). Though SDP is convex and standard solvers are available, they are computationally very expensive and only small data sets can be handled. To make MMC more practical, we avoid SDP relaxations and propose in this paper an efficient approach that performs alternating optimization directly on the original nonconvex problem. A key step to avoid premature convergence in the resultant iterative procedure is to change the loss function from the hinge loss to the Laplacian/square loss so that overconfident predictions are penalized. Experiments on a number of synthetic and real-world data sets demonstrate that the proposed approach is more accurate, much faster (hundreds to tens of thousands of times faster), and can handle data sets that are hundreds of times larger than the largest data set reported in the MMC literature.

show abstract

Selecting the reference image for registration of CEST series

Zhang

Heo

Lee

et al. 2015

Magnetic Resonance Imaging

View full text Add to dashboard Cite

Purpose To compare different reference images selected for registration among chemical exchange saturation transfer (CEST) series. Materials and Methods Five normal volunteers and eight brain tumor patients were studied on a 3 Tesla scanner. Image registration was performed by choosing each of the acquired CEST saturation or unsaturation dynamic images as the reference. CEST images at 3.5 ppm (amide proton transfer, APT) were computed for each motion-corrected data set after main magnetic field inhomogeneity correction. A uniformity index was defined to quantify the efficacy of image registration using different reference images. Joint histograms and the structural similarity index (SSIM) were used to analyze the intrinsic image similarity between various dynamic images. Results Image registration increased the average uniformity index by 18% if the 3.5 ppm saturated image was selected as the reference image. However, registering to the unsaturated dynamic image reduced the uniformity index by 13% on average. The joint histogram analysis showed that the saturated dynamic images were highly similar (SSIM = 0.89 ± 0.01), and were considerably different from the unsaturated dynamic image (SSIM = 0.58 ± 0.03). Conclusion The selection of the 3.5 ppm dynamic image as the reference image generated the highest uniformity index for APT imaging though other saturated images were equally suited as reference images.

show abstract

Outlier-robust extreme learning machine for regression problems

2015

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Kai Zhang

Beyond a Gaussian Denoiser: Residual Learning of Deep CNN for Image Denoising

NTIRE 2017 Challenge on Single Image Super-Resolution: Methods and Results

End-to-End Blind Image Quality Assessment Using Deep Neural Networks

Deep Unfolding Network for Image Super-Resolution

Designing a Practical Degradation Model for Deep Blind Image Super-Resolution

Maximum Margin Clustering Made Practical

Selecting the reference image for registration of CEST series

Outlier-robust extreme learning machine for regression problems

Contact Info

Product

Resources

About