Chenhongyi Yang scite author profile

Deep learning-based dense object detectors have achieved great success in the past few years and have been applied to numerous multimedia applications such as video understanding. However, the current training pipeline for dense detectors is compromised to lots of conjunctions that may not hold. In this paper, we investigate three such important conjunctions: 1) only samples assigned as positive in classification head are used to train the regression head; 2) classification and regression share the same input feature and computational fields defined by the parallel head architecture; and 3) samples distributed in different feature pyramid layers are treated equally when computing the loss. We first carry out a series of pilot experiments to show disentangling such conjunctions can lead to persistent performance improvement. Then, based on these findings, we propose Disentangled Dense Object Detector (DDOD), in which simple and effective disentanglement mechanisms are designed and integrated into the current state-of-the-art dense object detectors. Extensive experiments on MS COCO benchmark show that our approach can lead to 2.0 mAP, 2.4 mAP and 2.2 mAP absolute improvements on RetinaNet, FCOS, and ATSS baselines with negligible extra overhead. Notably, our best model reaches 55.0 mAP on the COCO test-dev set and 93.5 AP on the hard subset of WIDER FACE, achieving new state-of-the-art performance on these two competitive benchmarks. Code is available at https://github.com/zehuichen123/DDOD. CCS CONCEPTS• Computing methodologies → Object detection.

show abstract

QueryDet: Cascaded Sparse Query for Accelerating High-Resolution Small Object Detection

Yang

Huang²,

Wang³

2022

183

View full text Add to dashboard Cite

Learning to Separate: Detecting Heavily-Occluded Objects in Urban Scenes

Yang

Ablavsky

Wang

et al. 2020

View full text Add to dashboard Cite

Learning to Separate: Detecting Heavily-Occluded Objects in Urban Scenes

Yang

Ablavsky

Wang

et al. 2019

Preprint

View full text Add to dashboard Cite

In the past decade, deep learning based visual object detection has received a significant amount of attention, but cases when heavy intra-class occlusions occur are not studied thoroughly. In this work, we propose a novel Non-Maximum-Suppression (NMS) algorithm that dramatically improves the detection recall while maintaining high precision in scenes with heavy occlusions. Our NMS algorithm is derived from a novel embedding mechanism, in which the semantic and geometric features of the detected boxes are jointly exploited. The embedding makes it possible to determine whether two heavily-overlapping boxes belong to the same object in the physical world. Our approach is particularly useful for car detection and pedestrian detection in urban scenes where occlusions tend to happen. We validate our approach on two widely-adopted datasets, KITTI and CityPersons, and achieve state-of-the-art performance.

show abstract

QueryDet: Cascaded Sparse Query for Accelerating High-Resolution Small Object Detection

Yang¹,

Huang²,

Wang³

2021

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Chenhongyi Yang

Disentangle Your Dense Object Detector

QueryDet: Cascaded Sparse Query for Accelerating High-Resolution Small Object Detection

Learning to Separate: Detecting Heavily-Occluded Objects in Urban Scenes

Learning to Separate: Detecting Heavily-Occluded Objects in Urban Scenes

QueryDet: Cascaded Sparse Query for Accelerating High-Resolution Small Object Detection

Contact Info

Product

Resources

About