2020
DOI: 10.48550/arxiv.2007.02711
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Perceptually Optimizing Deep Image Compression

Li-Heng Chen,
Christos G. Bampis,
Zhi Li
et al.

Abstract: Mean squared error (MSE) and ℓ p norms have largely dominated the measurement of loss in neural networks due to their simplicity and analytical properties. However, when used to assess visual information loss, these simple norms are not highly consistent with human perception. Here, we propose a different proxy approach to optimize image analysis networks against quantitative perceptual models. Specifically, we construct a proxy network, which mimics the perceptual model while serving as a loss layer of the ne… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(4 citation statements)
references
References 37 publications
(41 reference statements)
0
4
0
Order By: Relevance
“…in which the frequency index k is dropped for simplicity. As can be observed, the statistics of A in (12) are the same as A (sub) in (34). Thus, without any loss of generality we find the distribution of A (sub) .…”
Section: Appendix a Proof Of Theoremmentioning
confidence: 51%
See 1 more Smart Citation
“…in which the frequency index k is dropped for simplicity. As can be observed, the statistics of A in (12) are the same as A (sub) in (34). Thus, without any loss of generality we find the distribution of A (sub) .…”
Section: Appendix a Proof Of Theoremmentioning
confidence: 51%
“…Another difference is that a good performance metric for channel estimation is the Euclidean distance or SNR, due to Gaussian noise and the dispassionate nature of symbol demodulation in the presence of such noise. On the other hand, image quality is perceptual and qualitative, and Euclidean distance and SNR are known to be poor measures of image quality [33], [34]. Indeed, a major feature of a GAN is that it can produce an image that is far from the target image under a quantitative measure like Euclidean distance, but very close in a perceptual sense.…”
Section: Channel Estimation and Image Reconstruction Differences For ...mentioning
confidence: 99%
“…Recent developments, such as the JPEG-AI competition, 1 focus on image compression methods that are learning-based and suitable for higher-resolution images. Those learningbased methods can be implemented using DNNs [10,62,116] or use hybrid approaches that rely on traditional methods combined with neural networks for image enhancement [56]. An example of such a hybrid variant is proposed by Lee et al [56].…”
Section: High Resolution Image Quality Assessmentmentioning
confidence: 99%
“…Moreover, it is well known that 1 and 2 losses do not correlate well with subjective video quality [67][68][69], and the combined loss functions employed in these GANbased training strategies use artificially configured combining weights, which have never been fully evaluated in terms of their correlation with subjective video quality. These issues inevitably lead to sub-optimal training performance when the networks are utilised for compression application.…”
Section: Training Strategymentioning
confidence: 99%