2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2015
DOI: 10.1109/cvpr.2015.7298780
|View full text |Cite
|
Sign up to set email alerts
|

From image-level to pixel-level labeling with Convolutional Networks

Abstract: We are interested in inferring object segmentation by leveraging only object class information, and by considering only minimal priors on the object segmentation task. This problem could be viewed as a kind of weakly supervised segmentation task, and naturally fits the Multiple Instance Learning (MIL) framework: every training image is known to have (or not) at least one pixel corresponding to the image class label, and the segmentation task can be rewritten as inferring the pixels belonging to the class of th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

3
504
0
2

Year Published

2017
2017
2019
2019

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 568 publications
(509 citation statements)
references
References 25 publications
3
504
0
2
Order By: Relevance
“…We can see that WILDCAT without CRF outperforms existing weakly supervised models by a large margin. We note a large gain with respect to MIL models based on (soft-)max pooling [49,50], which validates the relevance of our pooling for segmentation. The improvement between WILD-CAT with CRF and the best model is 7.1 pt.…”
Section: Weakly Supervised Segmentationmentioning
confidence: 99%
See 2 more Smart Citations
“…We can see that WILDCAT without CRF outperforms existing weakly supervised models by a large margin. We note a large gain with respect to MIL models based on (soft-)max pooling [49,50], which validates the relevance of our pooling for segmentation. The improvement between WILD-CAT with CRF and the best model is 7.1 pt.…”
Section: Weakly Supervised Segmentationmentioning
confidence: 99%
“…Mean IoU MIL-FCN [49] 24.9 MIL-Base+ILP+SP-sppxl [50] 36.6 EM-Adapt +FC-CRF [45] 33.8 CCNN + FC-CRF [48] 35.3 WILDCAT 39.2 WILDCAT + FC-CRF 43.7 With a quite more complex strategy, the very recent paper [31] presents impressive results (50.7 MIoU). The training scheme in [31] incorporates different terms, which are specifically tailored to segmentation: one enforces the segmentation mask to match low-level image boundaries, another one incorporates prior knowledge to support predicted classes to occupy a certain image proportion.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Another approach considers modifying the classifier training procedure so as to have it generate object masks as byproduct of a forward-pass. This can be achieved by adding a global a max-pooling [33] or mean-pooling layer [54] in the last stages of the classifier. In this work we provide an empirical comparison of existing seeders, and explore variants of the mean-pooling approach [54] …”
Section: Related Workmentioning
confidence: 99%
“…Strongly co-occurring categories (such as train and rails, sculls and oars, snowbikes and snow) cannot be separated without additional information. Because additional information is needed to solve the task, previous work have explored different avenues, including class-specific size priors [31], crawling additional images [33,46], or requesting corrections from a human judge [17,37].…”
Section: Introductionmentioning
confidence: 99%