Bei Liu scite author profile

Bei Liu

3Publications

7Citation Statements Received

67Citation Statements Given

How they've been cited

How they cite others

Affiliations

Microsoft Research Asia (China)

Publications

Order By: Most citations

Reference-Based Defect Detection Network

Zeng

Liu

et al. 2021

IEEE Trans. on Image Process.

View full text Add to dashboard Cite

The defect detection task can be regarded as a realistic scenario of object detection in the computer vision field and it is widely used in the industrial field. Directly applying vanilla object detector to defect detection task can achieve promising results, while there still exists challenging issues that have not been solved. The first issue is the texture shift which means a trained defect detector model will be easily affected by unseen texture, and the second issue is partial visual confusion which indicates that a partial defect box is visually similar with a complete box. To tackle these two problems, we propose a Reference-based Defect Detection Network (RDDN). Specifically, we introduce template reference and context reference to against those two problems, respectively. Template reference can reduce the texture shift from image, feature or region levels, and encourage the detectors to focus more on the defective area as a result. We can use either well-aligned template images or the outputs of a pseudo template generator as template references in this work, and they are jointly trained with detectors by the supervision of normal samples. To solve the partial visual confusion issue, we propose to leverage the carried context information of context reference, which is the concentric bigger box of each region proposal, to perform more accurate region classification and regression. Experiments on two defect detection datasets demonstrate the effectiveness of our proposed approach.

show abstract

Language-Guided Face Animation by Recurrent StyleGAN-Based Generator

Hang

Yang

Liu

et al. 2023

IEEE Trans. Multimedia

View full text Add to dashboard Cite

MMPT'21: International Joint Workshop on Multi-Modal Pre-Training for Multimedia Understanding

Liu

Chen

et al. 2021

View full text Add to dashboard Cite

Pre-training has been an emerging topic that provides a way to learn strong representation in many fields (e.g., natural language processing, computing vision). In the last few years, we have witnessed many research works on multi-modal pre-training which have achieved state-of-the-art performances on many multimedia tasks (e.g., image-text retrieval, video localization, speech recognition). In this workshop, we aim to gather peer researchers on related topics for more insightful discussion. We also intend to attract more researchers to explore and investigate more opportunities of designing and using innovative pre-training models for multimedia tasks. CCS CONCEPTS• Information systems → Multimedia content creation; Multimedia and multimodal retrieval.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Bei Liu

Reference-Based Defect Detection Network

Language-Guided Face Animation by Recurrent StyleGAN-Based Generator

MMPT'21: International Joint Workshop on Multi-Modal Pre-Training for Multimedia Understanding

Contact Info

Product

Resources

About