2022
DOI: 10.48550/arxiv.2201.01586
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Learning True Rate-Distortion-Optimization for End-To-End Image Compression

Abstract: Even though rate-distortion optimization is a crucial part of traditional image and video compression, not many approaches exist which transfer this concept to end-to-end-trained image compression. Most frameworks contain static compression and decompression models which are fixed after training, so efficient rate-distortion optimization is not possible. In a previous work, we proposed RDONet, which enables an RDO approach comparable to adaptive block partitioning in HEVC. In this paper, we enhance the trainin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 10 publications
0
2
0
Order By: Relevance
“…RDOnet deploys masking layers to zero-out certain coefficients. By training models with such layers, unimportant regions of the image are identified during inference and do not have their information transmitted [11]- [13]. In [14], an ROI-based multi-rate codec is proposed that can dynamically control local and global rate allocation at the frame-level.…”
Section: Introduction Conventional Video Compression Has Been Challen...mentioning
confidence: 99%
“…RDOnet deploys masking layers to zero-out certain coefficients. By training models with such layers, unimportant regions of the image are identified during inference and do not have their information transmitted [11]- [13]. In [14], an ROI-based multi-rate codec is proposed that can dynamically control local and global rate allocation at the frame-level.…”
Section: Introduction Conventional Video Compression Has Been Challen...mentioning
confidence: 99%
“…For instance, RDOnet deploys masking layers to zero-out certain coefficients. By training models with such layers, unimportant regions of the image are identified during inference and do not have their information transmitted [7]- [9]. The drawback of this approach is the lack of inference-time signal adaptation.…”
Section: Introductionmentioning
confidence: 99%