2008 IEEE Conference on Computer Vision and Pattern Recognition 2008
DOI: 10.1109/cvpr.2008.4587799
|View full text |Cite
|
Sign up to set email alerts
|

Object categorization using co-occurrence, location and appearance

Abstract: In this work we introduce a novel approach to object categorization that incorporates two types of context-cooccurrence and relative location-with local appearancebased features. Our approach, named CoLA (for Cooccurrence, Location and Appearance), uses a conditional random field (CRF) to maximize object label agreement according to both semantic and spatial relevance. We model relative location between objects using simple pairwise features. By vector quantizing this feature space, we learn a small set of pro… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

2
297
0
2

Year Published

2010
2010
2019
2019

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 419 publications
(301 citation statements)
references
References 25 publications
(42 reference statements)
2
297
0
2
Order By: Relevance
“…Our best method (merging multiple segmentation with mean operator) ranks second with for global pixel-wise accuracy with 76.1 behind Gould (76.5). The third one is our method using multiple segmentation and the max operator (75.4), then comes Yang et al (75.1) and our method using edison segmentation (75.1).The main advantage of the method of Gould et al is the introduction of the relative location priors between the object classes (a full 3-D spatial relationships between objects is inferred), which clearly helps the recognition as was also highlighted by Galleguillos et al [12]. We model only the co-occurences between adjacent regions in the relaxation labelling.…”
Section: Impact Of the Relaxation Labellingmentioning
confidence: 93%
See 2 more Smart Citations
“…Our best method (merging multiple segmentation with mean operator) ranks second with for global pixel-wise accuracy with 76.1 behind Gould (76.5). The third one is our method using multiple segmentation and the max operator (75.4), then comes Yang et al (75.1) and our method using edison segmentation (75.1).The main advantage of the method of Gould et al is the introduction of the relative location priors between the object classes (a full 3-D spatial relationships between objects is inferred), which clearly helps the recognition as was also highlighted by Galleguillos et al [12]. We model only the co-occurences between adjacent regions in the relaxation labelling.…”
Section: Impact Of the Relaxation Labellingmentioning
confidence: 93%
“…A mapping between the keywords and the visual blobs is performed using a method based on Expectation Maximization. The rest of the literature [15,26,32,12,2] noticeably differs from the original work by Duygulu et al in the sense that the models built try to exploit the maximum of information that can be extracted from the image: not only low level features (color, texture, etc. ), but also local contextual relationships between pixels or image segments, location and even global relevance estimates.…”
mentioning
confidence: 93%
See 1 more Smart Citation
“…However, identifying the abnormality of a person hanging in the air becomes difficult because the "person" is still above the "road". Third, contextual models become more informative when the more context types, such as co-occurrence and relative position/scale among objects, are used [7]. Finally, the models should not restrict the interpretation of scenes to find abnormal object properly.…”
Section: Introductionmentioning
confidence: 99%
“…Computer vision approaches have demonstrated that the use of context improves recognition performance [4]- [8]. While the term context is frequently used in the literature as one of important keywords, it is difficult to give its clear definition.…”
Section: Introductionmentioning
confidence: 99%