Proceedings of the 28th ACM International Conference on Multimedia 2020
DOI: 10.1145/3394171.3413536
|View full text |Cite
|
Sign up to set email alerts
|

Efficient Adaptation of Neural Network Filter for Video Compression

Abstract: We present an efficient finetuning methodology for neural-network filters which are applied as a postprocessing artifact-removal step in video coding pipelines. The fine-tuning is performed at encoder side to adapt the neural network to the specific content that is being encoded. In order to maximize the PSNR gain and minimize the bitrate overhead, we propose to finetune only the convolutional layers' biases. The proposed method achieves convergence much faster than conventional finetuning approaches, making i… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
16
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
8
1

Relationship

1
8

Authors

Journals

citations
Cited by 24 publications
(16 citation statements)
references
References 11 publications
0
16
0
Order By: Relevance
“…• QP-specific training: dedicating one model for each QP or a range of QPs [8,66]. • QP-map training: providing QP as an input to the network [12,27,33,50,57]. Each approach has benefits and drawbacks.…”
Section: Methods Based On Coding Informationmentioning
confidence: 99%
“…• QP-specific training: dedicating one model for each QP or a range of QPs [8,66]. • QP-map training: providing QP as an input to the network [12,27,33,50,57]. Each approach has benefits and drawbacks.…”
Section: Methods Based On Coding Informationmentioning
confidence: 99%
“…This operation is performed on the encoder side, and the subject of optimization may be the encoder itself, the output of the encoder, or the decoder. In [9,10], the post-processing filter is finetuned at decoder side by using weight-updates signaled from the encoder to the decoder at inference time. Techniques of latent tensor overfitting for better human consumption are presented in [11,12], aiming at reducing the distortion in the pixel domain.…”
Section: Related Workmentioning
confidence: 99%
“…However, the single-path model is hard to utilize spatially-precise representations and large receptive field simultaneous. With respect to the inference of neural network for video filtering, frame-level on/off control was investigated in [6] and an efficient finetuning methodology was proposed to adapt the neural network to the specific content [14]. Attention mechanism.…”
Section: Related Workmentioning
confidence: 99%