2020
DOI: 10.1609/aaai.v34i07.6693
|View full text |Cite
|
Sign up to set email alerts
|

Channel Attention Is All You Need for Video Frame Interpolation

Abstract: Prevailing video frame interpolation techniques rely heavily on optical flow estimation and require additional model complexity and computational cost; it is also susceptible to error propagation in challenging scenarios with large motion and heavy occlusion. To alleviate the limitation, we propose a simple but effective deep neural network for video frame interpolation, which is end-to-end trainable and is free from a motion estimation network component. Our algorithm employs a special feature reshaping opera… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
131
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 239 publications
(150 citation statements)
references
References 31 publications
0
131
0
Order By: Relevance
“…Regarding the frame interpolation as a local convolution over the two input frames, Niklaus et al [40], [41] utilized a CNN to learn a spatially-adaptive convolution kernel for each pixel. Choi et al [6] introduced a feature reshaping operation with Pixelshuffle [53] and a channel attention module for motion estimation. Lee et al [24] proposed adaptive collaboration of flows as a new warping module to deal with complex motions.…”
Section: Video Frame Interpolationmentioning
confidence: 99%
“…Regarding the frame interpolation as a local convolution over the two input frames, Niklaus et al [40], [41] utilized a CNN to learn a spatially-adaptive convolution kernel for each pixel. Choi et al [6] introduced a feature reshaping operation with Pixelshuffle [53] and a channel attention module for motion estimation. Lee et al [24] proposed adaptive collaboration of flows as a new warping module to deal with complex motions.…”
Section: Video Frame Interpolationmentioning
confidence: 99%
“…The attention mechanism in CNN understands and perceives images in a way that simulates humans and differentially weights global features to highlight key local features. In recent years, many channel attention [4], [64], [6], [65], [66], [67] mechanisms have been successfully applied to different computer vision tasks such as image classification, semantic segmentation, object detection, and image translation. Hu et al [2].…”
Section: Related Work (I)mentioning
confidence: 99%
“…We compare our method with 7 existing methods, including SepConv [17], CtxSyn [31], SoftSplat [32], DAIN [13], BMBC [33], CAIN [34] and RRIN [6] in Table 5. The values marked in red mean best performance, while blue represents second best.…”
Section: Quantitative Evaluationmentioning
confidence: 99%