Proceedings of the International Workshop on Workshop on Multimedia Information Retrieval 2007
DOI: 10.1145/1290082.1290111
|View full text |Cite
|
Sign up to set email alerts
|

Evaluating bag-of-visual-words representations in scene classification

Abstract: Based on keypoints extracted as salient image patches, an image can be described as a "bag of visual words" and this representation has been used in scene classification. The choice of dimension, selection, and weighting of visual words in this representation is crucial to the classification performance but has not been thoroughly studied in previous work. Given the analogy between this representation and the bag-of-words representation of text documents, we apply techniques used in text categorization, includ… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

13
397
0
8

Year Published

2009
2009
2018
2018

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 670 publications
(418 citation statements)
references
References 22 publications
13
397
0
8
Order By: Relevance
“…Its instances present 30 dimensions and are classified into two distinct classes, the malignant and benign cancer [39]. The images dataset contains 2117 pictures [40] in three different classes with 32 features extracted using the bag-of-visual features (BoVF) [41] technique. The two_norm and simplex are artificial datasets generated using the mlbench package of R [42].…”
Section: Evaluation and Applicationmentioning
confidence: 99%
“…Its instances present 30 dimensions and are classified into two distinct classes, the malignant and benign cancer [39]. The images dataset contains 2117 pictures [40] in three different classes with 32 features extracted using the bag-of-visual features (BoVF) [41] technique. The two_norm and simplex are artificial datasets generated using the mlbench package of R [42].…”
Section: Evaluation and Applicationmentioning
confidence: 99%
“…For the sake of illustration, in Fig. 2 we show the layout resulting from the combination of five multidimensional projection techniques, namely, LSP, LAMP, PLMP, SMS, and Hybrid Model (see [26] for a detailed description of those projection methods) when projecting the caltech data set which contains pictures [27] with features extracted using the bag-of-visual features (BoVF) [28] method. Fig.…”
Section: Combining Multiple Projectionsmentioning
confidence: 99%
“…Yang et al in [34] evaluated many frequency weighting schemes which are based on these factors, such as tf-idf weighting, stop word removal, and feature selection. The best weighting scheme in information retrieval does not guarantee good performance in CBIR since the count information can be noisy.…”
Section: Weighting Schemementioning
confidence: 99%