2016
DOI: 10.1007/978-3-319-46448-0_43
|View full text |Cite
|
Sign up to set email alerts
|

Weakly Supervised Localization Using Deep Feature Maps

Abstract: Object localization is an important computer vision problem with a variety of applications. The lack of large scale object-level annotations and the relative abundance of image-level labels makes a compelling case for weak supervision in the object localization task. Deep Convolutional Neural Networks are a class of state-of-the-art methods for the related problem of object recognition. In this paper, we describe a novel object localization algorithm which uses classification networks trained on only image lab… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
59
0
1

Year Published

2017
2017
2022
2022

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 63 publications
(60 citation statements)
references
References 45 publications
0
59
0
1
Order By: Relevance
“…In spite of its simple and multipurpose architecture, our model outperforms by a large margin the complex cascaded architecture of ProNet [58]. It also outperforms the recent weakly supervised model [5] by 3.2 pt (resp. 4.2 pt) on VOC 2012 (resp.…”
Section: Weakly Supervised Pointwise Localizationmentioning
confidence: 79%
See 2 more Smart Citations
“…In spite of its simple and multipurpose architecture, our model outperforms by a large margin the complex cascaded architecture of ProNet [58]. It also outperforms the recent weakly supervised model [5] by 3.2 pt (resp. 4.2 pt) on VOC 2012 (resp.…”
Section: Weakly Supervised Pointwise Localizationmentioning
confidence: 79%
“…Concerning the WSL localization task, [5] uses label co-occurrence information and a coarse-to-fine strategy based on deep feature maps to predict object locations. ProNet [58] uses a cascade of two networks: the first generates bounding boxes and the second classifies them.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…2. The first module estimates image difficulty automatically via a backbone network [18] trained with only image-level labels. The second module progressively adds samples to network training in an ascending order based on image difficulty.…”
Section: Methodsmentioning
confidence: 99%
“…Once we obtain the image difficulty, the remaining task is to mine object instances from the images. A natural way is to directly choose the top scored region as the target object, which is used for localization evaluation in [18]. However, since the whole network is trained with classification loss, which makes high scored regions tend to focus on object parts rather than the whole objects.…”
Section: Estimating Image Difficultymentioning
confidence: 99%