2021
DOI: 10.48550/arxiv.2112.08011
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Generalized Difference Coder: A Novel Conditional Autoencoder Structure for Video Compression

Abstract: Motion compensated inter prediction is a common component of all video coders. The concept was established in traditional hybrid coding and successfully transferred to learning-based video compression. To compress the residual signal after prediction, usually the difference of the two signals is compressed using a standard autoencoder. However, information theory tells us that a general conditional coder is more efficient. In this paper, we provide a solid foundation based on information theory and Shannon ent… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 12 publications
0
2
0
Order By: Relevance
“…The idea was extended in [22] for conditional motion coding, which encodes motion latents in an implicit, one-stage manner. However, Fabian et al [6] show that these conditional VAE-based approaches [21,22] may suffer from the bottleneck issue; that is, the latent representation of x c produced by a neural network for conditional decoding may not capture all the information of x c , which serves as a condition for encoding x t . Such information loss and asymmetry can harm the efficiency of conditional coding.…”
Section: Conditional Codingmentioning
confidence: 99%
See 1 more Smart Citation
“…The idea was extended in [22] for conditional motion coding, which encodes motion latents in an implicit, one-stage manner. However, Fabian et al [6] show that these conditional VAE-based approaches [21,22] may suffer from the bottleneck issue; that is, the latent representation of x c produced by a neural network for conditional decoding may not capture all the information of x c , which serves as a condition for encoding x t . Such information loss and asymmetry can harm the efficiency of conditional coding.…”
Section: Conditional Codingmentioning
confidence: 99%
“…As compared with DCVC [23], CANF-VC additionally features conditional motion coding. Although conditional motion coding also appears in [22], their VAE-based approach does not explicitly estimate a flow map prior to conditional coding, and may suffer from the bottleneck issue [6] (Section 2.2). In contrast, CANF-VC takes an explicit approach and avoids the bottleneck issue by using the same x c symmetrically in the encoder and the decoder due to its invertible property.…”
Section: Comparison With Anfic and Other Vae-based Schemesmentioning
confidence: 99%