2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2015
DOI: 10.1109/cvpr.2015.7298621
|View full text |Cite
|
Sign up to set email alerts
|

Improving object detection with deep convolutional networks via Bayesian optimization and structured prediction

Abstract: Object detection systems based on the deep convolutional neural network (CNN) have recently made groundbreaking advances on several object detection benchmarks. While the features learned by these high-capacity neural networks are discriminative for categorization, inaccurate localization is still a major source of error for detection. Building upon high-capacity CNN architectures, we address the localization problem by 1) using a search algorithm based on Bayesian optimization that sequentially proposes candi… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
85
0

Year Published

2016
2016
2020
2020

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 162 publications
(85 citation statements)
references
References 39 publications
0
85
0
Order By: Relevance
“…Under 07 + 12 train val, VGG16 has achieved up to 2.1% mAP improvement. Moreover, compared to other typical region-based detectors, such as AC-CNN [9], Yuting [15], MR-CNN [1], the proposed approach yields competitive performance as well. OHEM [12] is the state-of-the-art object detection approach, which has introduced online bootstrapping to the design of network structure based on the FastRCNN framework.…”
Section: Methodsmentioning
confidence: 92%
“…Under 07 + 12 train val, VGG16 has achieved up to 2.1% mAP improvement. Moreover, compared to other typical region-based detectors, such as AC-CNN [9], Yuting [15], MR-CNN [1], the proposed approach yields competitive performance as well. OHEM [12] is the state-of-the-art object detection approach, which has introduced online bootstrapping to the design of network structure based on the FastRCNN framework.…”
Section: Methodsmentioning
confidence: 92%
“…For example, using our method can enable training directly for other rank-based metrics used in information retrieval, such as discounted cumulative gain [17]. Moreover, we do not require a potentially expensive max-oracle to find the most-violating inputs with respect to the model and loss, as required by [18,19,2].…”
Section: Introductionmentioning
confidence: 99%
“…Like our method, this is a structured loss involving IoU of detections and ground-truth objects; however, it does not correspond to maximising AP, and only a single detection is returned in each image, so there is no NMS. More recently, [2] uses the same structured SVM loss, but with a CNN in place of a kernelised linear model over SURF features [26]. This work directly optimises the structured SVM loss via gradient descent, allowing backpropagation to update the nonlinear CNN layers.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…These feature vectors are then compared to a fine-tuned pre-trained model to score regions and find the best class for each object (Girshick, et al, 2015). Zhang, et al (2015) proposed two search algorithms to localize objects with high accuracy based on Bayesian optimization and also a deep learning framework based on a structured SVM objective function and CNN classifier. The results on PASCAL VOC 2007 and 2012 benchmarks highlight the significant improvement on detection performance .…”
Section: Introductionmentioning
confidence: 99%