Convolutional Neural Network-Based Block Up-Sampling for Intra Frame Coding

Li, Yue; Liu, Dong; Li, Houqiang; Li, Li; Wu, Feng; Zhang, Hong; Yang, Haitao

doi:10.1109/tcsvt.2017.2727682

Cited by 118 publications

(56 citation statements)

References 41 publications

(34 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In addition, if resolution downsampling is applied, the base QP used to encode the low resolution video is reduced by a default value of 6, following the analysis in [7] and [14], in order to achieve a similar bitrate as would be achieved if no resampling was performed. 1…”

Section: B Spatial Decisionsmentioning

confidence: 99%

“…In this context, several authors have proposed reducing spatial resolution for low bitrate encoding [3], [4], but lack a reliable adaptation technique. Others have developed prediction models [5], [6] or have introduced the resolution adaptation as one of the rate-distortion optimized modes at a block level (CTU) [7] but apply them for H.264 or intra coding only. Regarding temporal adaptation, a few methods for frame rate selection have been proposed in [8] and [9].…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Video Compression Based on Spatio-Temporal Resolution Adaptation

Afonso

Zhang

Bull

2019

IEEE Trans. Circuits Syst. Video Technol.

View full text Add to dashboard Cite

Section: B Spatial Decisionsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Video Compression Based on Spatio-Temporal Resolution Adaptation

Afonso

Zhang

Bull

2019

IEEE Trans. Circuits Syst. Video Technol.

View full text Add to dashboard Cite

“…Thus, Afonso et al propose a spatio-temporal resolution adaptation where a CNN-based super-resolution model is used to reconstruct full-resolution content [19]. Li et al [20] introduce the block adaptive resolution coding framework for intra frame coding, where each block within a frame is either downscaled or coded at original resolution and then upscaled with a trained CNN at the decoder side. This concept was later extended to include P and B frames as well [21].…”

Section: Related Workmentioning

confidence: 99%

Deep Video Precoding

Bourtsoulatze¹,

Chadha²,

Fadeev³

et al. 2020

IEEE Trans. Circuits Syst. Video Technol.

View full text Add to dashboard Cite

Several groups worldwide are currently investigating how deep learning may advance the state-of-the-art in image and video coding. An open question is how to make deep neural networks work in conjunction with existing (and upcoming) video codecs, such as MPEG H.264/AVC, H.265/HEVC, VVC, Google VP9 and AOMedia AV1, as well as existing container and transport formats, without imposing any changes at the client side. Such compatibility is a crucial aspect when it comes to practical deployment, especially when considering the fact that the video content industry and hardware manufacturers are expected to remain committed to supporting these standards for the foreseeable future.We propose to use deep neural networks as precoders for current and future video codecs and adaptive video streaming systems. In our current design, the core precoding component comprises a cascaded structure of downscaling neural networks that operates during video encoding, prior to transmission. This is coupled with a precoding mode selection algorithm for each independently-decodable stream segment, which adjusts the downscaling factor according to scene characteristics, the utilized encoder, and the desired bitrate and encoding configuration. Our framework is compatible with all current and future codec and transport standards, as our deep precoding network structure is trained in conjunction with linear upscaling filters (e.g., the bilinear filter), which are supported by all web video players. Results with FHD (1080p) and UHD (2160p) content and widelyused H.264/AVC, H.265/HEVC and VP9 encoders show that coupling such standards with the proposed deep video precoding allows for 15% to 45% rate reduction under encoding configurations and bitrates suitable for video-on-demand adaptive streaming systems. The use of precoding can also lead to encoding complexity reduction, which is essential for cost-effective cloud deployment of complex encoders like H.265/HEVC and VP9, especially when considering the prominence of high-resolution adaptive video streaming.

show abstract

“…The final version of record is available at http://dx.doi.org/10.1109/TCSVT.2019.2954474 AV1 [141] ---3.0 --Note: 1 LL/RL/LH/RH denote low-delay low complexity, random-access low complexity, low-delay high-efficiency, and random-access high-efficiency configurations in early HM test conditions. [190][191][192][193], block up-sampling for intra frame coding [194], intra mode decision [195][196][197][198][199], transform [200], rate control [201,202], in-loop filtering/post-processing [203][204][205], arithmetic coding [206], or decoder-end artifact-removal and quality enhancement [207,208].…”

Section: Finer Precision Motion Estimation and Compensationmentioning

confidence: 99%

Recent Advances on HEVC Inter-Frame Coding: From Optimization to Implementation and Beyond

Zhang

Fan

et al. 2020

IEEE Trans. Circuits Syst. Video Technol.

View full text Add to dashboard Cite

High Efficiency Video Coding (HEVC) has doubled the video compression ratio with equivalent subjective quality as compared to its predecessor H.264/AVC. The significant coding efficiency improvement is attributed to many new techniques. Interframe coding is one of the most powerful yet complicated techniques therein and has posed high computational burden thus main obstacle in HEVC-based real-time applications. Recently, plenty of research has been done to optimize the inter-frame coding, either to reduce the complexity for real-time applications, or to further enhance the encoding efficiency. In this paper, we provide a comprehensive review of the state-of-the-art techniques for HEVC inter-frame coding from three aspects, namely fast inter coding solutions, implementation on different hardware platforms as well as advanced inter coding techniques. More specifically, different algorithms in each aspect are further subdivided into sub-categories and compared in terms of pros, cons, coding efficiency and coding complexity. To the best of our knowledge, this is the first such comprehensive review of the recent advances of the inter-frame coding for HEVC and hopefully it would help the improvement, implementation and applications of HEVC as well as the ongoing development of the next generation video coding standard.

show abstract

Convolutional Neural Network-Based Block Up-Sampling for Intra Frame Coding

Cited by 118 publications

References 41 publications

Video Compression Based on Spatio-Temporal Resolution Adaptation

Video Compression Based on Spatio-Temporal Resolution Adaptation

Deep Video Precoding

Recent Advances on HEVC Inter-Frame Coding: From Optimization to Implementation and Beyond

Contact Info

Product

Resources

About