2013 IEEE Conference on Computer Vision and Pattern Recognition 2013
DOI: 10.1109/cvpr.2013.386
|View full text |Cite
|
Sign up to set email alerts
|

Finding Things: Image Parsing with Regions and Per-Exemplar Detectors

Abstract: This paper presents a system for image parsing, or labeling each pixel in an image with its semantic category, aimed at achieving broad coverage across hundreds of object categories, many of them sparsely sampled. The system combines region-level features with per-exemplar sliding window detectors. Per-exemplar detectors are better suited for our parsing task than traditional bounding box detectors: they perform well on classes with little training data and high intra-class variation, and they allow object mas… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

2
182
1

Year Published

2014
2014
2022
2022

Publication Types

Select...
5

Relationship

0
5

Authors

Journals

citations
Cited by 194 publications
(186 citation statements)
references
References 27 publications
(52 reference statements)
2
182
1
Order By: Relevance
“…Compared with Tighe and Lazebnik's extended SuperParsing algorithm with per-exemplar detectors and a combination SVM [33], CollageParsing obtains 1.9% higher per-class accuracy and 1.5% lower per-pixel accuracy. Compared with Farabet et al's system [9] with "natural" training, at a tradeoff of 1.4% lower per-pixel accuracy an 11.5% per-class accuracy improvement is obtained.…”
Section: Methodsmentioning
confidence: 99%
See 4 more Smart Citations
“…Compared with Tighe and Lazebnik's extended SuperParsing algorithm with per-exemplar detectors and a combination SVM [33], CollageParsing obtains 1.9% higher per-class accuracy and 1.5% lower per-pixel accuracy. Compared with Farabet et al's system [9] with "natural" training, at a tradeoff of 1.4% lower per-pixel accuracy an 11.5% per-class accuracy improvement is obtained.…”
Section: Methodsmentioning
confidence: 99%
“…As discussed in Section 1, this characteristic makes CollageParsing (and other nonparametric approaches) particularly well suited for open-universe datasets since no model retraining is required as the dataset expands. On a dataset of 45,000 images with 232 semantic labels, just the per-exemplar detector component of Tighe and Lazebnik [33] requires four days on a 512-node cluster to train. On a dataset of 715 images with eight semantic labels [13], Farabet et al [9] requires "48h on a regular server" to train.…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations