Optical flow-free generative and adversarial network: generative and adversarial network-based video super-resolution method by optical flow-free motion estimation and compensation

Fang, Cheng; Bian, Xueting; Han, Ping; Gao, Jingchun

doi:10.1117/1.jei.32.5.053009

Cited by 1 publication

(2 citation statements)

References 21 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…To validate the effectiveness of the proposed DCANet, we compare our method with seve ral state-of-the-art (SOTA) real-time inference VSR models These models include VESPCN [19 ], SOFVSR [13], TecoGAN [35], FRVSR [21], EGVSR [9], STDO [20], COFGAN [28], SWR N [10], RAI [13]. Comparison with SOTA lightweight and real-time inference VSR on benchm ark dataset tested with 4x down-sampling operation at Gaussian degradation.…”

Section: Comparison With the State-of-the-art Methodsmentioning

confidence: 99%

“…However, optical flow-based methods need to be built on the assumptions of luminance consistency, smal l motion, and temporal coherence [5], then the optical flow estimation is prone to errors when dealing with complex environments and large-motion video scenes. To solve this problem, [25,26] adopted deformable convolution to break through the network's limitation on geometric mo deling transformations, Chan et al [27] combined variable convolution with geometric modeling transformations, and Fang et al [28] used 3D-Unet to generate motion estimation. However, t he method of motion estimation without optical flow has limitations such as high computationa l complexity and difficulty in convergence of training, which makes it unsuitable for applicatio n to real-time inference VSR.…”

Section: Optical Flow-based Vsrmentioning

confidence: 99%

See 1 more Smart Citation

DCANet: Dual-contrast adaptive network for real-time inference video super-resolution

Lin,

Su,

Ouyang

2024

Preprint

View full text Add to dashboard Cite

Deep learning-based methods have made significant breakthroughs in real-time inference video super-resolution in recent years. However, these methods are prone to blurring, unnatural textures, and other distortion problems in super-resolution reconstructed video when dealing with complex environments and large-motion video scenes, which greatly affects the efficiency of video super-resolution reconstruction. Moreover, the real-time inference video super-resolution network training based on generative adversarial relies only on the simple modeling of features, which can obtain excellent subjective perceptual quality, but the objective index is lower has artifacts. To this end, in real-time inference video super-resolution, a dual-contrast adaptive network called DCANet is proposed to fully capture the motion offsets of neighboring frames through an adaptive optical flow network, which helps in accurate alignment. Real-time inference video super-resolution training based on generative adversarial networks relies only on simple modeling of features with poor objective metrics and artifacts. A dual-channel feature extraction module is proposed to acquire neighboring frame context features to achieve deep feature modeling. To further enhance the reconstruction quality, contrast learning is combined with the adversarial mechanism in the training strategy and a dual-contrast loss function is proposed to guide the network training. Extensive experiments on multiple benchmark testing sets of complex video scenes show that our method achieves an inference latency of 14.91ms while generating high fidelity and perceptual quality reconstructed videos, which is suitable for real-time inference in practical deployments. The code is available at https://github.com/Swaggyp1sz/DCANet.

show abstract