2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019
DOI: 10.1109/cvpr.2019.01077
|View full text |Cite
|
Sign up to set email alerts
|

Fast Spatio-Temporal Residual Network for Video Super-Resolution

Abstract: Recently, deep learning based video super-resolution (SR) methods have achieved promising performance. To simultaneously exploit the spatial and temporal information of videos, employing 3-dimensional (3D) convolutions is a natural approach. However, straight utilizing 3D convolutions may lead to an excessively high computational complexity which restricts the depth of video SR models and thus undermine the performance. In this paper, we present a novel fast spatio-temporal residual network (FSTRN) to adopt 3D… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
72
0
1

Year Published

2019
2019
2021
2021

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 134 publications
(73 citation statements)
references
References 37 publications
0
72
0
1
Order By: Relevance
“…These methods generally have good time efficiency, but because they do not explicitly align images, their alignment effect is difficult to quantitatively analyze. Long short-term memory (LSTM) networks [66] and 3D convolutions [67], [68] are also widely used in the field of video reconstruction. Considering that they do not register images or features, we do not compare these algorithms in this paper.…”
Section: ) Image Superresolutionmentioning
confidence: 99%
“…These methods generally have good time efficiency, but because they do not explicitly align images, their alignment effect is difficult to quantitatively analyze. Long short-term memory (LSTM) networks [66] and 3D convolutions [67], [68] are also widely used in the field of video reconstruction. Considering that they do not register images or features, we do not compare these algorithms in this paper.…”
Section: ) Image Superresolutionmentioning
confidence: 99%
“…The pipeline architecture follows feature extraction, feature alignment, fusion and frame reconstruction. Recently, Wang et al [23] reported that the feature alignment and fusion are the crucial steps when the video frames have more occlusion, motion and blurriness. Hence, in order to generate the high-quality HR output, we also focus on the feature alignment and fusion.…”
Section: Align-filter and Learn Video Super Resolution Using Deep Learnmentioning
confidence: 99%
“…3D-Convolution is known as a promising technique for this purpose but due to increased computational complexity the performance of 3D convolutions may degrade. In order to solve this issue, Li et al [23] presented spatio-temporal residual network. In this process, spatio-temporal residual block (STRB) is proposed which divides the 3D filters to reduce the dimension.…”
Section: Literature Surveymentioning
confidence: 99%
“…This method enhances performance while maintains a low computational load. 15 Another similar study has suggested spatio-temporal subpixel convolution networks. These networks effectively exploit temporal redundancies and therefore improve reconstruction accuracy with preserving real-time speed.…”
Section: Introductionmentioning
confidence: 96%
“…12 In recent years, the methods based on sparse representation have been used for the echocardiography temporal SR based on the post-processing techniques. 13,14 In recent times, some studies have used a combination of temporal and spatial information in a variety of ways for video SR. [15][16][17][18][19] Deep learning-based video SR methods are the topics of interest in this field, which have promising performances. For example, a novel fast spatio-temporal residual network (FSTRN) is offered to adopt 3D convolutions for the temporal SR task on natural images.…”
Section: Introductionmentioning
confidence: 99%