2019 IEEE/CVF International Conference on Computer Vision (ICCV) 2019
DOI: 10.1109/iccv.2019.00355
|View full text |Cite
|
Sign up to set email alerts
|

Learned Video Compression

Abstract: We present a new algorithm for video coding, learned end-to-end for the low-latency mode. In this setting, our approach outperforms all existing video codecs across nearly the entire bitrate range. To our knowledge, this is the first ML-based method to do so.We evaluate our approach on standard video compression test sets of varying resolutions, and benchmark against all mainstream commercial codecs, in the low-latency mode. On standard-definition videos, relative to our algorithm, HEVC/H.265, AVC/H.264 and VP… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
117
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 199 publications
(124 citation statements)
references
References 50 publications
0
117
0
Order By: Relevance
“…Learned Video Compression Video compression shares many similarities with image compression, but the large size of video data, and the very high degree of redundancy create new challenges [15,30,33,40]. One of the first deep learning-based approaches proposes to model video autoregressively with a RNN-conditioned PixelCNN [23].…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Learned Video Compression Video compression shares many similarities with image compression, but the large size of video data, and the very high degree of redundancy create new challenges [15,30,33,40]. One of the first deep learning-based approaches proposes to model video autoregressively with a RNN-conditioned PixelCNN [23].…”
Section: Related Workmentioning
confidence: 99%
“…Very recently the video compression problem was attacked by considering flow compression and residual compression [27,33]. The additional components for flow and residual modeling allow to improve distortion in general, however, for low bit rates the proposed method is still outperformed by HEVC/H.265 on benchmark datasets.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Adaptive spatio-temporal decomposition prior to encoding, followed by CNN-based spatio-temporal upscaling after decoding has been proposed by Afonso et al and was validated with H.265/HEVC encoding [31]. Finally, Wave One recently proposed video encoding with deep neural networks [32] and demonstrated quality gains against a conventional video encoder without B frames, and focusing on very-high bitrate encoding (20mbps or higher for FHD). While these are important achievements, most of these proposals are still outperformed by post-2013 video encoders, like HEVC and VP9, when utilized with their most advanced video buffering verifier (VBV) encoding configurations and appropriate constant rate factor tuning [33].…”
Section: Related Workmentioning
confidence: 99%
“…While we get to utilize shared information between two images, we are not able to exploit the spatial-temporal redundancies within a tightly coupled image sequence. There has been an abundance of work on traditional multi-view and stereo compression [12,14] as well as deep-learning based image and video compression [5,33,36,49]. However, the space of deep multi-view compression is relatively unexplored.…”
Section: Introductionmentioning
confidence: 99%