Generalized Difference Coder: A Novel Conditional Autoencoder Structure for Video Compression

Brand, Fabian; Seiler, Jurgen; Schober, Robert

doi:10.48550/arxiv.2112.08011

Cited by 1 publication

(2 citation statements)

References 12 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The idea was extended in [22] for conditional motion coding, which encodes motion latents in an implicit, one-stage manner. However, Fabian et al [6] show that these conditional VAE-based approaches [21,22] may suffer from the bottleneck issue; that is, the latent representation of x c produced by a neural network for conditional decoding may not capture all the information of x c , which serves as a condition for encoding x t . Such information loss and asymmetry can harm the efficiency of conditional coding.…”

Section: Conditional Codingmentioning

confidence: 99%

“…As compared with DCVC [23], CANF-VC additionally features conditional motion coding. Although conditional motion coding also appears in [22], their VAE-based approach does not explicitly estimate a flow map prior to conditional coding, and may suffer from the bottleneck issue [6] (Section 2.2). In contrast, CANF-VC takes an explicit approach and avoids the bottleneck issue by using the same x c symmetrically in the encoder and the decoder due to its invertible property.…”

Section: Comparison With Anfic and Other Vae-based Schemesmentioning

confidence: 99%

See 1 more Smart Citation

CANF-VC: Conditional Augmented Normalizing Flows for Video Compression

Ho¹,

Chang²,

Chen³

et al. 2022

Preprint

View full text Add to dashboard Cite

This paper presents an end-to-end learning-based video compression system, termed CANF-VC, based on conditional augmented normalizing flows (CANF). Most learned video compression systems adopt the same hybrid-based coding architecture as the traditional codecs. Recent research on conditional coding has shown the sub-optimality of the hybrid-based coding and opens up opportunities for deep generative models to take a key role in creating new coding frameworks. CANF-VC represents a new attempt that leverages the conditional ANF to learn a video generative model for conditional inter-frame coding. We choose ANF because it is a special type of generative model, which includes variational autoencoder as a special case and is able to achieve better expressiveness. CANF-VC also extends the idea of conditional coding to motion coding, forming a purely conditional coding framework. Extensive experimental results on commonly used datasets confirm the superiority of CANF-VC to the state-of-the-art methods. The source code of CANF-VC is available at https://github.com/NYCU-MAPL/CANF-VC.

show abstract

Section: Conditional Codingmentioning

confidence: 99%

Section: Comparison With Anfic and Other Vae-based Schemesmentioning

confidence: 99%