Recently, neural video compression networks have obtained impressive results. However, previous neural video compression models mostly focus on low-delay configuration with the order of display being the same as the order of coding. In this paper, we propose a hierarchical random access coding approach that exploits bidirectionally temporal redundancy to improve the coding efficiency of existing deep neural video compression models. The proposed framework applies a video frame interpolation network to improve inter-frame prediction. In addition, a hierarchical coding structure is also proposed in this paper. Experimental results show the proposed framework improves the coding efficiency of the base deep neural model by 48.01% with the UVG dataset, 50.96% with the HEVC-class B dataset, and outperforms the previous deep neural video compression networks.INDEX TERMS Neural video compression, hierarchical random access coding, video frame interpolation.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.