Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 2005
DOI: 10.1109/iccv.2005.152
|View full text |Cite
|
Sign up to set email alerts
|

Modeling scenes with local descriptors and latent aspects

Abstract: We present a new approach to model visual scenes in image collections, based on local invariant features and probabilistic latent space models. Our formulation provides answers to three open questions:(1) whether the invariant local features are suitable for scene (rather than object) classification; (2) whether unsupervised latent space models can be used for feature extraction in the classification task; and (3) whether the latent space formulation can discover visual cooccurrence patterns, motivating novel … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

4
234
1
3

Year Published

2006
2006
2010
2010

Publication Types

Select...
7
1
1

Relationship

1
8

Authors

Journals

citations
Cited by 301 publications
(242 citation statements)
references
References 17 publications
4
234
1
3
Order By: Relevance
“…For image representation, there is still no such approach that would be adequate for a wide variety of image processing problems. However, among the proposed representations, a consensus is emerging on using local descriptors for various tasks, for example (Lowe, 2004, Quelhas et al, 2005. This type of representation segments the image into regions of interest, and extracts visual features from each region.…”
Section: Image Representationmentioning
confidence: 99%
See 1 more Smart Citation
“…For image representation, there is still no such approach that would be adequate for a wide variety of image processing problems. However, among the proposed representations, a consensus is emerging on using local descriptors for various tasks, for example (Lowe, 2004, Quelhas et al, 2005. This type of representation segments the image into regions of interest, and extracts visual features from each region.…”
Section: Image Representationmentioning
confidence: 99%
“…The mapping of the descriptors to discrete indexes is performed according to a codebook C, which is typically learned from the local descriptors of the training images through kmeans clustering (Duygulu et al, 2002, Jeon and Manmatha, 2004, Quelhas et al, 2005. The assignment of the weight p i of visterm i in image p is as follows:…”
Section: Image Representationmentioning
confidence: 99%
“…Another similar part-based image represenations that are proposed recentlty are visterms [15,23,24], SIFT-bags [39] blobs [7], and VLAD [14] vector representation of an image which aggregates descriptors based on a locality criterion in the feature space. The different approach is the one proposed by Morand et al [21].…”
Section: Analogy Between Information Retrieval and Cbirmentioning
confidence: 99%
“…While most segmentation approaches segment image pixels or blocks based on their luminance, color or texture, in this work we consider local image regions characterized by viewpoint invariant descriptors [10]. This region representation, robust with respect to partial occlusion, clutter, and changes in viewpoint and illumination, has shown its applicability in a number of vision tasks [2,16,8,20,3,14,15]. Although local invariant regions do not provide a full segmentation of an image, they often occupy a considerable part of the scene and thus can define a "sparse" segmentation ( Fig.…”
Section: Introductionmentioning
confidence: 99%