2021
DOI: 10.48550/arxiv.2104.00531
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Extending Neural P-frame Codecs for B-frame Coding

Abstract: While most neural video codecs address P-frame coding (predicting each frame from past ones), in this paper we address B-frame compression (predicting frames using both past and future reference frames). Our B-frame solution is based on the existing P-frame methods. As a result, B-frame coding capability can easily be added to an existing neural codec. The basic idea of our B-frame coding method is to interpolate the two reference frames to generate a single reference frame and then use it together with an exi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 29 publications
(59 reference statements)
0
1
0
Order By: Relevance
“…Video frame interpolation (VFI) (also known as Video temporal super-resolution) is a significant video enhancement problem which aims to synthesize one or more visually coherent frames between two consecutive frames in a video, i.e., to up-scale the number of video frames. Such an up-scaling method finds its usage in numerous video-based applications such as slow-motion video generation (e.g., in sports and TV commercials), video compressiondecompression framework [43], generating short videos from GIF images [48], novel view synthesis [12] and medical imaging [27,55].…”
Section: Introductionmentioning
confidence: 99%
“…Video frame interpolation (VFI) (also known as Video temporal super-resolution) is a significant video enhancement problem which aims to synthesize one or more visually coherent frames between two consecutive frames in a video, i.e., to up-scale the number of video frames. Such an up-scaling method finds its usage in numerous video-based applications such as slow-motion video generation (e.g., in sports and TV commercials), video compressiondecompression framework [43], generating short videos from GIF images [48], novel view synthesis [12] and medical imaging [27,55].…”
Section: Introductionmentioning
confidence: 99%