2020
DOI: 10.1007/978-3-030-58595-2_29
|View full text |Cite
|
Sign up to set email alerts
|

A Flexible Recurrent Residual Pyramid Network for Video Frame Interpolation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
20
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 37 publications
(20 citation statements)
references
References 34 publications
0
20
0
Order By: Relevance
“…Partial cost volume is used to represent matching cost associated with different disparities. Inspired by classical pyramid energy minimization in optical flow algorithms, RRPN [30] designed a recurrent residual pyramid architecture for video frame interpolation to refine optical flow using a shared network for every pyramid level. Following above methods, we also exploit the advantages of classical principles of optical flow -the pyramid structure, multi-scale warping, and cost volume.…”
Section: B Pyramid Structure and The Cost Volumementioning
confidence: 99%
See 1 more Smart Citation
“…Partial cost volume is used to represent matching cost associated with different disparities. Inspired by classical pyramid energy minimization in optical flow algorithms, RRPN [30] designed a recurrent residual pyramid architecture for video frame interpolation to refine optical flow using a shared network for every pyramid level. Following above methods, we also exploit the advantages of classical principles of optical flow -the pyramid structure, multi-scale warping, and cost volume.…”
Section: B Pyramid Structure and The Cost Volumementioning
confidence: 99%
“…al. [30] designed a recurrent residual pyramid architecture to refine optical flow using a shared network across pyramid levels. Other methods, despite the usage of multi-scale features, only generate one-stage optical flow [3,8,16].…”
Section: Introductionmentioning
confidence: 99%
“…Various efforts have been made to alleviate this issue, including the use of depth information [5], higher order motion models [33,60], and adaptive forward warping [41]. A second group of methods [25,34,44,45,61,65] have been developed to improve approximation by directly predicting intermediate flows. These approaches typically employ a coarse-to-fine architecture, which supports a larger receptive field for capturing large motions.…”
Section: Video Frame Interpolationmentioning
confidence: 99%
“…While flowbased methods use the estimated optical flow maps to warp input frames, kernel-based methods learn local or shared convolution kernels for synthesizing the output. To handle challenging scenarios encountered in VFI applications, various techniques have been employed to enhance these methods, including non-linear motion models [45,50,60], coarse-to-fine architectures [10,44,50,65], attention mechanisms [12,29], and deformable convolutions [21,32].…”
Section: Introductionmentioning
confidence: 99%
“…However, VFI is significantly challenging, which is attributed to diverse factors such as occlusions, large motions and change of light. Recent deeplearning-based VFI has been actively studied, showing remarkable performances [48,4,7,37,25,13,31,51,6,33]. However, they are often optimized for existing LFR benchmark datasets of low resolution (LR), which may lead to poor VFI performance, especially for videos of 4K resolution (4096×2160) or higher with very large motion [1,21].…”
Section: Introductionmentioning
confidence: 99%