2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition 2018
DOI: 10.1109/cvpr.2018.00644
|View full text |Cite
|
Sign up to set email alerts
|

Cascade R-CNN: Delving Into High Quality Object Detection

Abstract: In object detection, an intersection over union (IoU) threshold is required to define positives and negatives. An object detector, trained with low IoU threshold, e.g. 0.5, usually produces noisy detections. However, detection performance tends to degrade with increasing the IoU thresholds. Two main factors are responsible for this: 1) overfitting during training, due to exponentially vanishing positive samples, and 2) inference-time mismatch between the IoUs for which the detector is optimal and those of th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

7
2,558
0
6

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 4,703 publications
(2,933 citation statements)
references
References 35 publications
7
2,558
0
6
Order By: Relevance
“…Object detection has attracted a great deal of attention in recent years [4,13,14,16,19,20,27,28,30,38,39,43,47,48,56]. One popular direction for recent object detection is proposal-based object detectors (a.k.a.…”
Section: Related Workmentioning
confidence: 99%
“…Object detection has attracted a great deal of attention in recent years [4,13,14,16,19,20,27,28,30,38,39,43,47,48,56]. One popular direction for recent object detection is proposal-based object detectors (a.k.a.…”
Section: Related Workmentioning
confidence: 99%
“…Compared with baseline, our model outputs more accurate boxes and detects pedestrians with heavy occlusion. equation (5), the performance reaches 12.96%. Though the sign prediction loss indeed helps improving the performance, one can argue that it is because the loss involved with box prediction is increased and the sign predictor structure is not necessary.…”
Section: Methodsmentioning
confidence: 95%
“…It only selects the proper samples which fall in the desired scale range under different pyramids for training. Cascade R-CNN [5] adopts cascaded classifiers where training samples with increasingly higher overlap with ground truths are fed. Online hard example mining (OHEM) [6] dynamically chooses the samples with the highest loss in a batch to achieve better convergence and performance.…”
Section: Introductionmentioning
confidence: 99%
“…Our design strategy is to select the model of the highest accuracy from the existing state-of-the-art ones at first and then improve the efficiency of the model. Among the existing models, Cascade R-CNN [6] with ResNeXt-101 [10] backbone has the best accuracy on MS COCO dataset [11]. To further boost the performance, we add Feature Pyramid Network (FPN) [5] to the backbone of the Cascade R-CNN model so that features at different scales can be extracted better.…”
Section: A Design Of High-accuracy Modelmentioning
confidence: 99%