2021
DOI: 10.48550/arxiv.2104.14335
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

ELF-VC: Efficient Learned Flexible-Rate Video Coding

Abstract: While learned video codecs have demonstrated great promise, they have yet to achieve sufficient efficiency for practical deployment. In this work, we propose several novel ideas for learned video compression which allow for improved performance for the low-latency mode (I-and Pframes only) along with a considerable increase in computational efficiency. In this setting, for natural videos our approach compares favorably across the entire R-D curve under metrics PSNR, MS-SSIM and VMAF against all mainstream vide… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(10 citation statements)
references
References 29 publications
(67 reference statements)
0
10
0
Order By: Relevance
“…[24,49], which use 3D convolution architectures, and Refs. [3,9,11,12,16,23,26,33,36,37,48,50,52,53,69], which model P-frames as an optical flow field applied to the previous frame plus a residual model.…”
Section: Related Workmentioning
confidence: 99%
“…[24,49], which use 3D convolution architectures, and Refs. [3,9,11,12,16,23,26,33,36,37,48,50,52,53,69], which model P-frames as an optical flow field applied to the previous frame plus a residual model.…”
Section: Related Workmentioning
confidence: 99%
“…Neural video compression approaches are on the way catching up with traditional standards. Existing works in this field can be classified into two categories, designed for low delay setting [2,10,21,23,28,31,32,35,42] and random access setting [15,41,48,50]. The low delay setting is suitable for applications such as live streaming, which only uses the past frame(s) to predict the current frame.…”
Section: Video Compressionmentioning
confidence: 99%
“…The aforementioned DVC [35], SSF [2] and FVC [23] are typical works improving single-reference prediction. Some multi-frame fusion modules are also designed for unidirectional prediction with multiple reference frames [23,28,42]. Obviously, fusing more reference frames benefits the RD performance, but also brings a significant increase of memory cost.…”
Section: Video Compressionmentioning
confidence: 99%
See 2 more Smart Citations