2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021
DOI: 10.1109/cvpr46437.2021.00632
|View full text |Cite
|
Sign up to set email alerts
|

Temporal Modulation Network for Controllable Space-Time Video Super-Resolution

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
61
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 77 publications
(75 citation statements)
references
References 39 publications
0
61
0
Order By: Relevance
“…However, these VSR models would only produce pre-defined intermediate frames, causing them constrained to highly-controlled scenarios with fixed frame-rate videos. Consequently, exploiting controllable spatio-temporal VSR approaches, which with the deformable convolution network, for smooth motion synthesizing it is necessary (Xu et al 2021).…”
Section: Discussionmentioning
confidence: 99%
“…However, these VSR models would only produce pre-defined intermediate frames, causing them constrained to highly-controlled scenarios with fixed frame-rate videos. Consequently, exploiting controllable spatio-temporal VSR approaches, which with the deformable convolution network, for smooth motion synthesizing it is necessary (Xu et al 2021).…”
Section: Discussionmentioning
confidence: 99%
“…Zooming Slow-Mo [5] developed a unified framework with deformable ConvLSTM to align and aggregate temporal information and then synthesize the intermediate features by a bidirectional recurrent network before performing feature fusion for STVSR. Based on Zooming Slow-Mo, xu et al [7] proposed a temporal modulation network via locallytemporal feature comparison module and deformable convolution kernels for controllable feature interpolation, which can interpolate arbitrary intermediate frames.…”
Section: Video Time-space Super-resolutionmentioning
confidence: 99%
“…For the two-stage methods, we perform video frame interpolation (VFI) by SuperSloMo [15], [46] or SepConv [24], and perform video super-resolution (VSR) by Bicubic Interpolation (BI), RCAN [47], RBPN [9] or EDVR [8]. For one-stage STVSR models, we compare our network with recently state-of-the-art methods Zooming SlowMo [5] ,STARnet [6] or TMnet [7]. When training, we use Vimeo-90K trainset [42] and feed odd LR frames into the model and reconstruct HR frames corresponding to the frames of the entire sequence.…”
Section: B Comparison With State-of-the-artsmentioning
confidence: 99%
See 2 more Smart Citations