Full Resolution Image Compression with Recurrent Neural Networks

Toderici, George; Vincent, Damien; Johnston, Nick; Hwang, Sung Jin; Minnen, David; Shor, Joel; Covell, Michele

doi:10.1109/cvpr.2017.577

Cited by 814 publications

(835 citation statements)

References 18 publications

(24 reference statements)

Supporting

Mentioning

802

Contrasting

Unclassified

Order By: Relevance

“…Like [29], soft relaxation of quantization is used to resolve the nondifferentiability problem of the quantization function [27]. Different from [27,29], the thumbnail images are compressed by a recurrent neural networks architecture, in which a stochastic rounding operation makes feature maps binarized [30]. Recently, a virtual codec network has been learned to imitate the projection from the represented vectors to the decoded images to make the image compression framework trainable in an end-to-end way [11].…”

Section: B Deep Image Compression Frameworkmentioning

confidence: 99%

Deep Multiple Description Coding by Learning Scalar Quantization

Zhao

Bai

Wang

et al. 2019

2019 Data Compression Conference (DCC)

View full text Add to dashboard Cite

In this paper, we introduce a deep multiple description coding (MDC) framework optimized by minimizing multiple description (MD) compressive loss. First, MD multi-scale-dilated encoder network generates multiple description tensors, which are discretized by scalar quantizers, while these quantized tensors are decompressed by MD cascaded-ResBlock decoder networks. To greatly reduce the total amount of artificial neural network parameters, an auto-encoder network composed of these two types of network is designed as a symmetrical parameter sharing structure. Second, this autoencoder network and a pair of scalar quantizers are simultaneously learned in an end-to-end self-supervised way. Third, considering the variation in the image spatial distribution, each scalar quantizer is accompanied by an importance-indicator map to generate MD tensors, rather than using direct quantization. Fourth, we introduce the multiple description structural similarity distance loss, which implicitly regularizes the diversified multiple description generations, to explicitly supervise multiple description diversified decoding in addition to MD reconstruction loss. Finally, we demonstrate that our MDC framework performs better than several state-of-the-art MDC approaches regarding image coding efficiency when tested on several commonly available datasets.

show abstract

Section: B Deep Image Compression Frameworkmentioning

confidence: 99%

Deep Multiple Description Coding by Learning Scalar Quantization

Zhao

Bai

Wang

et al. 2019

2019 Data Compression Conference (DCC)

View full text Add to dashboard Cite

show abstract

“…In deep image compression [1], [7], [8], the handcrafted analysis and synthesis transforms are replaced by the encoder z = f (x; θ) and decoderx = g (ẑ; φ) of a convolutional autoencoder, parametrized by θ and φ. The fundamental difference is that the transforms are not designed but learned from training data.…”

Section: Introductionmentioning

confidence: 99%

“…The problem is solved using gradient descent and backpropagation [18]. To make the model differentiable, which is required to apply backpropagation, during training the quantizer is replaced by a differentiable proxy function [1], [7], [8]. Similarly, entropy coding is invertible, but it is necessary to compute the length of the bitstream b.…”

Section: Introductionmentioning

confidence: 99%

Variable Rate Deep Image Compression With Modulated Autoencoder

Yang

Herranz

Weijer

et al. 2020

IEEE Signal Process. Lett.

View full text Add to dashboard Cite

Variable rate is a requirement for flexible and adaptable image and video compression. However, deep image compression methods are optimized for a single fixed ratedistortion tradeoff. While this can be addressed by training multiple models for different tradeoffs, the memory requirements increase proportionally to the number of models. Scaling the bottleneck representation of a shared autoencoder can provide variable rate compression with a single shared autoencoder. However, the R-D performance using this simple mechanism degrades in low bitrates, and also shrinks the effective range of bit rates. Addressing these limitations, we formulate the problem of variable rate-distortion optimization for deep image compression, and propose modulated autoencoders (MAEs), where the representations of a shared autoencoder are adapted to the specific rate-distortion tradeoff via a modulation network. Jointly training this modulated autoencoder and modulation network provides an effective way to navigate the R-D operational curve. Our experiments show that the proposed method can achieve almost the same R-D performance of independent models with significantly fewer parameters.

show abstract

“…For example, some image compression approaches use generative models to learn the distribution of images using adversarial training [6,7,8] to achieved impressive subjective quality at extremely low bit rate. Some works use recurrent neural networks to compress the residual information recursively, such as [9,10,11] to realize scalable coding. Some approaches propose a hyperpriorbased and context-adaptive context model to compress codes effectively in [12,13,14].…”

Section: Introductionmentioning

confidence: 99%

Learned Lossless Image Compression with A Hyperprior and Discretized Gaussian Mixture Likelihoods

Cheng

Sun

Takeuchi

et al. 2020

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

View full text Add to dashboard Cite

Lossless image compression is an important task in the field of multimedia communication. Traditional image codecs typically support lossless mode, such as WebP, JPEG2000, FLIF. Recently, deep learning based approaches have started to show the potential at this point. HyperPrior is an effective technique proposed for lossy image compression. This paper generalizes the hyperprior from lossy model to lossless compression, and proposes a L2-norm term into the loss function to speed up training procedure. Besides, this paper also investigated different parameterized models for latent codes, and propose to use Gaussian mixture likelihoods to achieve adaptive and flexible context models. Experimental results validate our method can outperform existing deep learning based lossless compression, and outperform the JPEG2000 and WebP for JPG images.

show abstract

Full Resolution Image Compression with Recurrent Neural Networks

Cited by 814 publications

References 18 publications

Deep Multiple Description Coding by Learning Scalar Quantization

Deep Multiple Description Coding by Learning Scalar Quantization

Variable Rate Deep Image Compression With Modulated Autoencoder

Learned Lossless Image Compression with A Hyperprior and Discretized Gaussian Mixture Likelihoods

Contact Info

Product

Resources

About