2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020
DOI: 10.1109/cvpr42600.2020.00796
|View full text |Cite
|
Sign up to set email alerts
|

Learned Image Compression With Discretized Gaussian Mixture Likelihoods and Attention Modules

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
657
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 660 publications
(658 citation statements)
references
References 13 publications
1
657
0
Order By: Relevance
“…Previous methods accumulated all the priors to estimate the probability based on a single GMM assumption for each element. Recent studies in [151] and [152] have shown that weighted GMMs can further improve coding efficiency.…”
Section: ) R-d Optimizationmentioning
confidence: 99%
“…Previous methods accumulated all the priors to estimate the probability based on a single GMM assumption for each element. Recent studies in [151] and [152] have shown that weighted GMMs can further improve coding efficiency.…”
Section: ) R-d Optimizationmentioning
confidence: 99%
“…This section introduces two classes of solutions to support YUV 4:2:0 format. The first class of solutions are based on input-output channel alignment, which aim to support YUV 4:2:0 without introducing any major changes to the existing network architectures in the literature [9], [10], [11], [12], [13], [14]. On the other hand, second class of solutions proposes a new transform network architecture where the main goal is to compress YUV 4:2:0 input data more efficiently.…”
Section: Transform Network Architectures For Image/video Codingmentioning
confidence: 99%
“…However, in the literature, there is very little or no work on DLEC designs specialized for YUV sources. Although existing architectures designed for coding RGB data [9], [10], [11], [12], [13], [14] (such as the one shown in Fig. 4) can be employed to support non-subsampled YUV 4:4:4 format by simply retraining network parameters on a YUV 4:4:4 dataset, effective solutions for chroma subsampled formats, such as YUV 4:2:0, are non-trivial and require new neural network architectures.…”
Section: Introductionmentioning
confidence: 99%
“…al [31] customized an architecture based on neural network and wavelet trans-form capable to support both lossy and lossless compression schemes. Cheng et.al developed a flexible entropy model based on discretized Gaussian mixture likelihoods by taking the advantage of recent attention modules and is proved its efficiency in reducing the latency [32].…”
Section: Related Workmentioning
confidence: 99%
“…This shows that the suggested method can be a good replacement for the traditional algorithms in an application that requires a large amount of storage space such as Picture Archiving and Communication System(PACS). Part d, e,f of the Figure 6 shows the comparison between the Machine Learning based Compression algorithms -GMM & Attention [32], iWave++ [31], Non-Local 3D-Context [30]. The proposed method is compared to the existing machine learning method on the basis of PSNR, SSIM and Space Saving.…”
Section: Evaluation Of System Performancementioning
confidence: 99%