2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition 2018
DOI: 10.1109/cvpr.2018.00448
|View full text |Cite
|
Sign up to set email alerts
|

Zigzag Learning for Weakly Supervised Object Detection

Abstract: This paper addresses weakly supervised object detection with only image-level supervision at training stage. Previous approaches train detection models with entire images all at once, making the models prone to being trapped in sub-optimums due to the introduced false positive examples. Unlike them, we propose a zigzag learning strategy to simultaneously discover reliable object instances and prevent the model from overfitting initial seeds. Towards this goal, we first develop a criterion named mean Energy Acc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
76
0
2

Year Published

2019
2019
2020
2020

Publication Types

Select...
4
4
1

Relationship

0
9

Authors

Journals

citations
Cited by 130 publications
(80 citation statements)
references
References 26 publications
0
76
0
2
Order By: Relevance
“…We randomly initialize the weights of all new layers using Gaussian distributions with 0-mean and standard deviations 0.01 (except 0.001 for bounding box regressor), and initialize all new biases to 0. We follow a widely-used setting [2,29,30,37] to use Selective Search [33] to generate about 2, 000 proposals for each image. The whole network is end-to-end optimized using SGD with an initial learning rate of 10 −3 , weight decay of 0.0005 and momentum of 0.9.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…We randomly initialize the weights of all new layers using Gaussian distributions with 0-mean and standard deviations 0.01 (except 0.001 for bounding box regressor), and initialize all new biases to 0. We follow a widely-used setting [2,29,30,37] to use Selective Search [33] to generate about 2, 000 proposals for each image. The whole network is end-to-end optimized using SGD with an initial learning rate of 10 −3 , weight decay of 0.0005 and momentum of 0.9.…”
Section: Methodsmentioning
confidence: 99%
“…We can observe the partial, correct, and oversized detection results for an object instance in the first, second, and third row, respectively. relieve the heavy labeling effort and reduce cost, weaklysupervised object detection paradigm has been proposed by leveraging only image-level annotations [2,30,37,38].…”
Section: Introductionmentioning
confidence: 99%
“…The main task of MIL-based detectors is to learn the discriminative representation of the object instances and then select them from positive images to train a detector. Previous works on applying MIL to WSOD can be roughly categorized into multi-phase learning approach [18,4,22,38,30,42,43,41] and end-to-end learning approach [1,39,34,19,33]. End-to-end learning approaches combine CNNs and MIL into a unified network to address weakly supervised object detection task.…”
Section: Weakly Supervised Object Detectionmentioning
confidence: 99%
“…Using these methods, they generated pseudo ground truth boxes from localization result of OICR [20] and trained a faster R-CNN [15] model. Zhang et al [26] proposed a zigzag learning strategy, in which they developed a criterion (the Energy Accumulation Score) to automatically measure and rank localization difficulty. As the localization result of WSOD is unreliable, at first they used easy images to localize and added difficult images progressively.…”
Section: Transferring Approachmentioning
confidence: 99%