Region-of-Interest Based Neural Video Compression

Perugachi-Diaz, Yura; Sautiere, Guillaume; Abati, Davide; Yang, Yang; Habibian, Amirhossein; Cohen, Taco

doi:10.48550/arxiv.2203.01978

Search citation statements

Order By: Relevance

Paper Sections

Select...

Introduction Conventional Video Compression Has Been Challen...1

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2023

Publication Types

Select...

Other1

Relationship

Self Cite0

Independent1

Authors

Journals

Cited by 1 publication

(1 citation statement)

References 38 publications

(115 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The problem of encoding-time spatial rate allocation by LVC has been sparsely studied in the literature [6]- [10]. RDOnet imitates the RDO process of the conventional codecs in a generic manner which might possibly allow spatial rate allocation.…”

Section: Introduction Conventional Video Compression Has Been Challen...mentioning

confidence: 99%

Spatial Rate Allocation for Learning-based Video Coding

Abdoli,

Henry,

Clare

et al. 2023

2023 31st European Signal Processing Conference (EUSIPCO)

View full text Add to dashboard Cite

This paper presents a method that enables arbitrary end-to-end Learning-based image/video codecs to apply spatial rate allocation. At the frame-level, the forward pass of the underlying encoder network is followed by a latent refinement step, in which a customized loss function is minimized. This loss function takes as input an arbitrary pixel-wise map that defines the interest of each pixel and computes a weighted distortion with respect to the given interest map. Back-propagation of the customized loss function using the gradient descent gives a refined version of the frame latent in which the quality of regions of interest (ROI) is improved at the cost of quality of regions of disinterest. The proposed method is implemented on top of an existing end-to-end LVC, called AIVC 1 , using saliencebased interest maps. Experiments show that the proposed method can effectively improve the quality of regions of interest frames. Notably, BD-BR performance using Weighted PSNR (WPSNR) shows an improvement of up to 21% by the proposed method.

show abstract

Section: Introduction Conventional Video Compression Has Been Challen...mentioning

confidence: 99%