Xuepeng Shi scite author profile

Rotation-invariant face detection, i.e. detecting faces with arbitrary rotation-in-plane (RIP) angles, is widely required in unconstrained applications but still remains as a challenging task, due to the large variations of face appearances. Most existing methods compromise with speed or accuracy to handle the large RIP variations. To address this problem more efficiently, we propose Progressive Calibration Networks (PCN) to perform rotation-invariant face detection in a coarse-to-fine manner. PCN consists of three stages, each of which not only distinguishes the faces from non-faces, but also calibrates the RIP orientation of each face candidate to upright progressively. By dividing the calibration process into several progressive steps and only predicting coarse orientations in early stages, PCN can achieve precise and fast calibration. By performing binary classification of face vs. non-face with gradually decreasing RIP ranges, PCN can accurately detect faces with full 360 • RIP angles. Such designs lead to a real-time rotationinvariant face detector. The experiments on multi-oriented FDDB and a challenging subset of WIDER FACE containing rotated faces in the wild show that our PCN achieves quite promising performance. A demo of PCN can be available at https://github.com/Jack-CV/PCN .

show abstract

Geometry-based Distance Decomposition for Monocular 3D Object Detection

Shi

Chen

et al. 2021

View full text Add to dashboard Cite

Distance-Normalized Unified Representation for Monocular 3D Object Detection

Shi

Chen

Kim

2020

View full text Add to dashboard Cite

Geometry-based Distance Decomposition for Monocular 3D Object Detection

Shi

Chen³

et al. 2021

Preprint

View full text Add to dashboard Cite

Monocular 3D object detection is of great significance for autonomous driving but remains challenging. The core challenge is to predict the distance of objects in the absence of explicit depth information. Unlike regressing the distance as a single variable in most existing methods, we propose a novel geometry-based distance decomposition to recover the distance by its factors. The decomposition factors the distance of objects into the most representative and stable variables, i.e. the physical height and the projected visual height in the image plane. Moreover, the decomposition maintains the self-consistency between the two heights, leading to the robust distance prediction when both predicted heights are inaccurate. The decomposition also enables us to trace the cause of the distance uncertainty for different scenarios. Such decomposition makes the distance prediction interpretable, accurate, and robust. Our method directly predicts 3D bounding boxes from RGB images with a compact architecture, making the training and inference simple and efficient. The experimental results show that our method achieves the state-of-the-art performance on the monocular 3D Object Detection and Bird's Eye View tasks on the KITTI dataset, and can generalize to images with different camera intrinsics.

show abstract

Multivariate Probabilistic Monocular 3D Object Detection

Shi

Chen

Kim

2023

View full text Add to dashboard Cite

Real-Time Rotation-Invariant Face Detection with Progressive Calibration Networks

Shi¹,

Shan²,

Kan³

et al. 2018

Preprint

View full text Add to dashboard Cite

Semi-Supervised Object Detection with Object-wise Contrastive Learning and Regression Uncertainty

Choi¹,

Chen²,

Shi³

et al. 2022

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Xuepeng Shi

Real-Time Rotation-Invariant Face Detection with Progressive Calibration Networks

Geometry-based Distance Decomposition for Monocular 3D Object Detection

Distance-Normalized Unified Representation for Monocular 3D Object Detection

Geometry-based Distance Decomposition for Monocular 3D Object Detection

Multivariate Probabilistic Monocular 3D Object Detection

Real-Time Rotation-Invariant Face Detection with Progressive Calibration Networks

Semi-Supervised Object Detection with Object-wise Contrastive Learning and Regression Uncertainty

Contact Info

Product

Resources

About