Liangbin Xie scite author profile

With the prosperity of digital video industry, video frame interpolation has arisen continuous attention in computer vision community and become a new upsurge in industry. Many learning-based methods have been proposed and achieved progressive results. Among them, a recent algorithm named quadratic video interpolation (QVI) achieves appealing performance. It exploits higher-order motion information (e.g. acceleration) and successfully models the estimation of interpolated flow. However, its produced intermediate frames still contain some unsatisfactory ghosting, artifacts and inaccurate motion, especially when large and complex motion occurs. In this work, we further improve the performance of QVI from three facets and propose an enhanced quadratic video interpolation (EQVI) model. In particular, we adopt a rectified quadratic flow prediction (RQFP) formulation with least squares method to estimate the motion more accurately. Complementary with image pixellevel blending, we introduce a residual contextual synthesis network (RCSN) to employ contextual information in high-dimensional feature space, which could help the model handle more complicated scenes and motion patterns. Moreover, to further boost the performance, we devise a novel multi-scale fusion network (MS-Fusion) which can be regarded as a learnable augmentation process. The proposed EQVI model won the first place in the AIM2020 Video Temporal Super-Resolution Challenge.Codes are available at https://github.com/lyh-18/EQVI.

show abstract

Hardhat-Wearing Detection Based on a Lightweight Convolutional Neural Network with Multi-Scale Features and a Top-Down Module

Wang

Xie

Yang

et al. 2020

Sensors

View full text Add to dashboard Cite

Construction sites are dangerous due to the complex interaction of workers with equipment, building materials, vehicles, etc. As a kind of protective gear, hardhats are crucial for the safety of people on construction sites. Therefore, it is necessary for administrators to identify the people that do not wear hardhats and send out alarms to them. As manual inspection is labor-intensive and expensive, it is ideal to handle this issue by a real-time automatic detector. As such, in this paper, we present an end-to-end convolutional neural network to solve the problem of detecting if workers are wearing hardhats. The proposed method focuses on localizing a person’s head and deciding whether they are wearing a hardhat. The MobileNet model is employed as the backbone network, which allows the detector to run in real time. A top-down module is leveraged to enhance the feature-extraction process. Finally, heads with and without hardhats are detected on multi-scale features using a residual-block-based prediction module. Experimental results on a dataset that we have established show that the proposed method could produce an average precision of 87.4%/89.4% at 62 frames per second for detecting people without/with a hardhat worn on the head.

show abstract

AIM 2020 Challenge on Video Temporal Super-Resolution

Son

Lee

Nah

et al. 2020

View full text Add to dashboard Cite

Real-ESRGAN: Training Real-World Blind Super-Resolution with Pure Synthetic Data

Wang

Xie

Dong

et al. 2021

Preprint

View full text Add to dashboard Cite

show abstract

VQFR: Blind Face Restoration with Vector-Quantized Dictionary and Parallel Decoder

Wang

Xie

et al. 2022

View full text Add to dashboard Cite

VQFR: Blind Face Restoration with Vector-Quantized Dictionary and Parallel Decoder

Gu¹,

Wang²,

Xie³

et al. 2022

Preprint

View full text Add to dashboard Cite

T2I-Adapter: Learning Adapters to Dig out More Controllable Ability for Text-to-Image Diffusion Models

Mou¹,

Wang²,

Xie³

et al. 2023

Preprint

View full text Add to dashboard Cite

12 3

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

334 Leonard St

Brooklyn, NY 11211

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Liangbin Xie

Real-ESRGAN: Training Real-World Blind Super-Resolution with Pure Synthetic Data

Enhanced Quadratic Video Interpolation

Hardhat-Wearing Detection Based on a Lightweight Convolutional Neural Network with Multi-Scale Features and a Top-Down Module

AIM 2020 Challenge on Video Temporal Super-Resolution

Real-ESRGAN: Training Real-World Blind Super-Resolution with Pure Synthetic Data

VQFR: Blind Face Restoration with Vector-Quantized Dictionary and Parallel Decoder

VQFR: Blind Face Restoration with Vector-Quantized Dictionary and Parallel Decoder

T2I-Adapter: Learning Adapters to Dig out More Controllable Ability for Text-to-Image Diffusion Models

Contact Info

Product

Resources

About