2018
DOI: 10.1109/tcsvt.2017.2727682
|View full text |Cite
|
Sign up to set email alerts
|

Convolutional Neural Network-Based Block Up-Sampling for Intra Frame Coding

Abstract: The past decade has witnessed the huge success of deep learning in well-known artificial intelligence applications such as face recognition, autonomous driving, and large language model like ChatGPT. Recently, the application of deep learning has been extended to a much wider range, with neural networkbased video coding being one of them. Neural network-based video coding can be performed at two different levels: embedding neural network-based (NN-based) coding tools into a classical video compression framewor… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
55
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 118 publications
(56 citation statements)
references
References 41 publications
(34 reference statements)
0
55
0
Order By: Relevance
“…In addition, if resolution downsampling is applied, the base QP used to encode the low resolution video is reduced by a default value of 6, following the analysis in [7] and [14], in order to achieve a similar bitrate as would be achieved if no resampling was performed. 1…”
Section: B Spatial Decisionsmentioning
confidence: 99%
See 1 more Smart Citation
“…In addition, if resolution downsampling is applied, the base QP used to encode the low resolution video is reduced by a default value of 6, following the analysis in [7] and [14], in order to achieve a similar bitrate as would be achieved if no resampling was performed. 1…”
Section: B Spatial Decisionsmentioning
confidence: 99%
“…In this context, several authors have proposed reducing spatial resolution for low bitrate encoding [3], [4], but lack a reliable adaptation technique. Others have developed prediction models [5], [6] or have introduced the resolution adaptation as one of the rate-distortion optimized modes at a block level (CTU) [7] but apply them for H.264 or intra coding only. Regarding temporal adaptation, a few methods for frame rate selection have been proposed in [8] and [9].…”
Section: Introductionmentioning
confidence: 99%
“…Thus, Afonso et al propose a spatio-temporal resolution adaptation where a CNN-based super-resolution model is used to reconstruct full-resolution content [19]. Li et al [20] introduce the block adaptive resolution coding framework for intra frame coding, where each block within a frame is either downscaled or coded at original resolution and then upscaled with a trained CNN at the decoder side. This concept was later extended to include P and B frames as well [21].…”
Section: Related Workmentioning
confidence: 99%
“…The final version of record is available at http://dx.doi.org/10.1109/TCSVT.2019.2954474 AV1 [141] ---3.0 --Note: 1 LL/RL/LH/RH denote low-delay low complexity, random-access low complexity, low-delay high-efficiency, and random-access high-efficiency configurations in early HM test conditions. [190][191][192][193], block up-sampling for intra frame coding [194], intra mode decision [195][196][197][198][199], transform [200], rate control [201,202], in-loop filtering/post-processing [203][204][205], arithmetic coding [206], or decoder-end artifact-removal and quality enhancement [207,208].…”
Section: Finer Precision Motion Estimation and Compensationmentioning
confidence: 99%