2017
DOI: 10.1609/aaai.v31i1.11228
|View full text |Cite
|
Sign up to set email alerts
|

Building an End-to-End Spatial-Temporal Convolutional Network for Video Super-Resolution

Abstract: We propose an end-to-end deep network for video super-resolution. Our network is composed of a spatial component that encodes intra-frame visual patterns, a temporal component that discovers inter-frame relations, and a reconstruction component that aggregates information to predict details. We make the spatial component deep, so that it can better leverage spatial redundancies for rebuilding high-frequency structures. We organize the temporal component in a bidirectional and multi-scale fashion, to better cap… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
4
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 30 publications
(5 citation statements)
references
References 33 publications
0
4
0
Order By: Relevance
“…Recently, many Transformer variants [67]- [69] have also emerged in the video deblurring domain. It should be pointed out that ConvLSTM is widely used in video deblurring networks [70], [71] but other ConvRNNs (ConvLSTM variants) have not been adopted for video deblurring.…”
Section: B Video Deblurring Modelsmentioning
confidence: 99%
“…Recently, many Transformer variants [67]- [69] have also emerged in the video deblurring domain. It should be pointed out that ConvLSTM is widely used in video deblurring networks [70], [71] but other ConvRNNs (ConvLSTM variants) have not been adopted for video deblurring.…”
Section: B Video Deblurring Modelsmentioning
confidence: 99%
“…Thus, the VSR technique is divided into two categories depending on the ways of the utilization of inter-frame information: 5 (1) method without alignment, such as Refs. 6 and 7. The non-local residual block is applied to capture long-term spatio-temporal correlations in Ref.…”
Section: Related Workmentioning
confidence: 99%
“…The non-local residual block is applied to capture long-term spatio-temporal correlations in Ref. 6, and Guo and Chao 7 extract inter-frame temporal information by long short term memory (LSTM). (2) Method with alignment, such as Refs.…”
Section: Related Workmentioning
confidence: 99%
“…The recurrent framework is popular for many video processing tasks including super-resolution [7,8,9,10,11,12,13]. The recurrent framework could either be unidirectional [8], bidirectional [13] , or omnidirectional [14].…”
Section: Recurrence Structurementioning
confidence: 99%