Spatial orientations of visual word pairs to improve Bag-of-Visual-Words model

Khan, Rahat; Barat, Cécile; Muselet, Damien; Ducottet, Christophe

doi:10.5244/c.26.89

Cited by 36 publications

(44 citation statements)

References 21 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Our method also outperformed Khan et al [31,32] on all three datasets. However, it is worth mentioning that even with a relatively small feature dimension (smaller codebook) and less dense low-level features, [32] achieved a highly competitive result on 15 Scenes.…”

Section: Comparison With Sp-based Methodsmentioning

confidence: 61%

“…Moreover, it would be interesting, in the future, to test how adopting either (or both) of them affects the classification performance of the proposed method. Lazebnik et al [35] 64.6 ± 0.8 81.4 ± 0.5 -4200, 8400 Yang et al [53] 73.2 ± 0.5 80.2 ± 0.9 40.1 ± 0.9 21504 Wang et al [51] 73.4 -47.7 43008, 86016 Boureau et al [7] a 71.8 ± 1.0 84.1 ± 0.5 -21504 Boureau et al [8] 77.3 ± 0.6 83.3 ± 1.0 41.6 ± 0.6 b 1397760, 365568, 344064 Chatfield et al [10] 76.1 ± 0.6 --84000 Khan et al [31] 67.1 82.5 -5000 Koniusz et al [33] 81.3 ± 0.6 --86016 Wang et al [52] -84.3 ± 0.2 -43008 Fanello et al [19] --47.9 1134592 Khan et al [32] 68 Chatfield et al [11] 77.6 ± 0.1…”

Section: Comparison With Sp-based Methodsmentioning

confidence: 99%

“…Thus, in order to avoid comparing apples to oranges, we break the listed works into four main groups based on what component, of the SP model, each improves. Thus, works are grouped into (1) those that improve the coding step, including works by Yang et al [53], Wang et al [51] and Wang et al [52], (2) those that improve the pooling operator, including works by Yang et al [53] and Koniusz et al [33], (3) those that enrich the spatial information captured by the model, the works by Khan et al [31,32], and finally (4) those that locally pool in the feature space, including works by Boureau et al [8], Fanello et al [19], and ours. Table 1 also includes studies by Boureau et al [7] and Chatfield et al [10], which are two widely cited benchmarking studies that extensively evaluated the model using different combinations of components and parameters.…”

Section: Comparison With Sp-based Methodsmentioning

confidence: 99%

See 2 more Smart Citations

Bregman pooling: feature-space local pooling for image classification

Najjar

Ogawa

Haseyama

2015

Int J Multimed Info Retr

View full text Add to dashboard Cite

In this paper, we propose a novel feature-space local pooling method for the commonly adopted architecture of image classification. While existing methods partition the feature space based on visual appearance to obtain pooling bins, learning more accurate space partitioning that takes semantics into account boosts performance even for a smaller number of bins. To this end, we propose partitioning the feature space over clusters of visual prototypes common to semantically similar images (i.e., images belonging to the same category). The clusters are obtained by Bregman co-clustering applied offline on a subset of training data. Therefore, being aware of the semantic context of the input image, our features have higher discriminative power than do those pooled from appearance-based partitioning. Testing on four datasets 15 Scenes, and 17 Flowers) belonging to three different classification tasks showed that the proposed method outperforms methods in previous works on local pooling in the feature space for less feature dimensionality. Moreover, when implemented within a spatial pyramid, our method achieves comparable results on three of the datasets used.

show abstract

Section: Comparison With Sp-based Methodsmentioning

confidence: 61%

Section: Comparison With Sp-based Methodsmentioning

confidence: 99%

Section: Comparison With Sp-based Methodsmentioning

confidence: 99%

See 1 more Smart Citation

Bregman pooling: feature-space local pooling for image classification

Najjar

Ogawa

Haseyama

2015

Int J Multimed Info Retr

View full text Add to dashboard Cite

show abstract

“…Khan et al [7] proposed to use the angles made by pair-wise identical visual words (PIWs) to add spatial information to the BoVWs model. An image representation is then constructed on these angles by aggregating them in a pair-wise identical words angles histogram (PIWAH).…”

Section: Methodsmentioning

confidence: 99%

“…The Multiscale SIFT with best performing number of scales is then evaluated along with the other three extraction methods on predefined vocabulary sizes. The best performing variant of SIFT is then used to compare the performances of PIWAH [7] and TIWAH. Each experiment is performed 10 times where in each run the datasets are randomly split into training and test sets.…”

Section: Methodsmentioning

confidence: 99%

Invariant Image-Based Species Classification of Butterflies and Reef Fish

Anwar¹,

Zambanini²,

Kampel³

2015

Procedings of the Machine Vision of Animals and Their Behaviour Workshop 2015

View full text Add to dashboard Cite

We propose a framework for species-based image classification of butterflies and reef fish. To support such image-based classification, we use an image representation which enriches the famous bag-of-visual words (BoVWs) model with spatial information. This image representation is developed by encoding the global geometric relationships of visual words in the 2D image plane in a scale-and rotation-invariant manner. In this way, invariance is achieved to the most common variations found in the images of these animals as they can be imaged at different image locations, exhibit various in-plane orientations and have various scales in the images. The images in our butterfly and reef fish datasets belong to 30 species of each animal. We achieve better classification rates on both the datasets than the ordinary BoVWs model while still being invariant to the mentioned image variations. Our proposed image-based classification framework for butterfly and reef fish species can be considered as a helpful tool for scientific research, conversation and education.

show abstract

Adding Color Information to Spatially-Enhanced, Bag-of-Visual-Words Models

Laurenson

Olson

2021

Advances in Visual Computing

View full text Add to dashboard Cite

Spatial orientations of visual word pairs to improve Bag-of-Visual-Words model

Cited by 36 publications

References 21 publications

Bregman pooling: feature-space local pooling for image classification

Bregman pooling: feature-space local pooling for image classification

Invariant Image-Based Species Classification of Butterflies and Reef Fish

Adding Color Information to Spatially-Enhanced, Bag-of-Visual-Words Models

Contact Info

Product

Resources

About