Cascade R-CNN: Delving Into High Quality Object Detection

Cai, Zhaowei; Vasconcelos, Nuno

doi:10.1109/cvpr.2018.00644

Cited by 4,703 publications

(2,933 citation statements)

References 35 publications

Supporting

Mentioning

2,558

Contrasting

Unclassified

Order By: Relevance

“…Object detection has attracted a great deal of attention in recent years [4,13,14,16,19,20,27,28,30,38,39,43,47,48,56]. One popular direction for recent object detection is proposal-based object detectors (a.k.a.…”

Section: Related Workmentioning

confidence: 99%

PCL: Proposal Cluster Learning for Weakly Supervised Object Detection

Tang

Wang

Bai

et al. 2020

IEEE Trans. Pattern Anal. Mach. Intell.

333

310

View full text Add to dashboard Cite

In this paper, we focus on semi-supervised object detection to boost accuracies of proposal-based object detectors (a.k.a. two-stage object detectors) by training on both labeled and unlabeled data. However, it is non-trivial to train object detectors on unlabeled data due to the unavailability of ground truth labels. To address this problem, we present a proposal learning approach to learn proposal features and predictions from both labeled and unlabeled data. The approach consists of a self-supervised proposal learning module and a consistency-based proposal learning module. In the self-supervised proposal learning module, we present a proposal location loss and a contrastive loss to learn context-aware and noise-robust proposal features respectively. In the consistency-based proposal learning module, we apply consistency losses to both bounding box classification and regression predictions of proposals to learn noise-robust proposal features and predictions. Experiments are conducted on the COCO dataset with all available labeled and unlabeled data. Results show that our approach consistently improves the accuracies of fullysupervised baselines. In particular, after combining with data distillation [37], our approach improves AP by about 2.0% and 0.9% on average compared with fully-supervised baselines and data distillation baselines respectively.

show abstract

Section: Related Workmentioning

confidence: 99%

PCL: Proposal Cluster Learning for Weakly Supervised Object Detection

Tang

Wang

Bai

et al. 2020

IEEE Trans. Pattern Anal. Mach. Intell.

333

310

View full text Add to dashboard Cite

show abstract

“…Compared with baseline, our model outputs more accurate boxes and detects pedestrians with heavy occlusion. equation (5), the performance reaches 12.96%. Though the sign prediction loss indeed helps improving the performance, one can argue that it is because the loss involved with box prediction is increased and the sign predictor structure is not necessary.…”

Section: Methodsmentioning

confidence: 95%

“…It only selects the proper samples which fall in the desired scale range under different pyramids for training. Cascade R-CNN [5] adopts cascaded classifiers where training samples with increasingly higher overlap with ground truths are fed. Online hard example mining (OHEM) [6] dynamically chooses the samples with the highest loss in a batch to achieve better convergence and performance.…”

Section: Introductionmentioning

confidence: 99%

Occluded Pedestrian Detection with Visible IoU and Box Sign Predictor

2019

2019 IEEE International Conference on Image Processing (ICIP)

View full text Add to dashboard Cite

Training a robust classifier and an accurate box regressor are difficult for occluded pedestrian detection. Traditionally adopted Intersection over Union (IoU) measurement does not consider the occluded region of the object and leads to improper training samples. To address such issue, a modification called visible IoU is proposed in this paper to explicitly incorporate the visible ratio in selecting samples. Then a newly designed box sign predictor is placed in parallel with box regressor to separately predict the moving direction of training samples. It leads to higher localization accuracy by introducing sign prediction loss during training and sign refining in testing. Following these novelties, we obtain state-of-the-art performance on CityPersons benchmark for occluded pedestrian detection.

show abstract

“…Our design strategy is to select the model of the highest accuracy from the existing state-of-the-art ones at first and then improve the efficiency of the model. Among the existing models, Cascade R-CNN [6] with ResNeXt-101 [10] backbone has the best accuracy on MS COCO dataset [11]. To further boost the performance, we add Feature Pyramid Network (FPN) [5] to the backbone of the Cascade R-CNN model so that features at different scales can be extracted better.…”

Section: A Design Of High-accuracy Modelmentioning

confidence: 99%

IMMVP: An Efficient Daytime and Nighttime On-Road Object Detector

Chan

Chen

et al. 2019

2019 IEEE 21st International Workshop on Multimedia Signal Processing (MMSP)

View full text Add to dashboard Cite

It is hard to detect on-road objects under various lighting conditions. To improve the quality of the classifier, three techniques are used. We define subclasses to separate daytime and nighttime samples. Then we skip similar samples in the training set to prevent overfitting. With the help of the outside training samples, the detection accuracy is also improved. To detect objects in an edge device, Nvidia Jetson TX2 platform, we exert the lightweight model ResNet-18 FPN as the backbone feature extractor. The FPN (Feature Pyramid Network) generates good features for detecting objects over various scales. With Cascade R-CNN technique, the bounding boxes are iteratively refined for better results.

show abstract

Cascade R-CNN: Delving Into High Quality Object Detection

Cited by 4,703 publications

References 35 publications

PCL: Proposal Cluster Learning for Weakly Supervised Object Detection

PCL: Proposal Cluster Learning for Weakly Supervised Object Detection

Occluded Pedestrian Detection with Visible IoU and Box Sign Predictor

IMMVP: An Efficient Daytime and Nighttime On-Road Object Detector

Contact Info

Product

Resources

About