2014
DOI: 10.1049/iet-ipr.2013.0449
|View full text |Cite
|
Sign up to set email alerts
|

Informative visual words construction to improve bag of words image representation

Abstract: Bag of visual words model has recently attracted much attention from computer vision society because of its notable success in analysing images and exploring their content. This study improves this model by utilising the adjacency information between words. To explore this information, a binary tree structure is constructed from the visual words in order to model the isa relationships in the vocabulary. Informative nodes of this tree are extracted by using the χ 2 criterion and are used to capture the adjacenc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0
3

Year Published

2017
2017
2020
2020

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 9 publications
(8 citation statements)
references
References 30 publications
0
5
0
3
Order By: Relevance
“…The optimization variable represents the cropped position in the image, and the fitness function is the similarity between a cropped image and a training dataset image. Usually, image similarity adopts a feature-based method such as the bag-of-words model (32,33) for extracting and searching feature points in an image. However, an effective image similarity measure cannot be performed when the feature points are difficult to match if the images suffer from motion blur and are taken at different viewpoints.…”
Section: Input Image Crop Based On Genetic Algorithmmentioning
confidence: 99%
“…The optimization variable represents the cropped position in the image, and the fitness function is the similarity between a cropped image and a training dataset image. Usually, image similarity adopts a feature-based method such as the bag-of-words model (32,33) for extracting and searching feature points in an image. However, an effective image similarity measure cannot be performed when the feature points are difficult to match if the images suffer from motion blur and are taken at different viewpoints.…”
Section: Input Image Crop Based On Genetic Algorithmmentioning
confidence: 99%
“…Recently, it has developed rapidly and become distinguished in the field of image processing such as image recognition, classification, annotation, and so on [33,34]. Its fundamental conception, bag-of-words model (BoW) [35,36], was originally used in distinguishing hidden information in a large collection of corpus [37,38] and conversing the information of the pixels to non-ordered visual words. As an unsupervised generative probabilistic model, its documents are viewed as a mixture of topics, sharing a common Dirichlet priori.…”
Section: Lda (Latent Dirichlet Allocation)mentioning
confidence: 99%
“…Bag of visual words merupakan suatu skema untuk mengklasifikasikan citra berdasarkan nilai-nilai pixel pada citra [4] Dengan menggunakan deteksi interest point dan ekstraksi interest point, bag of visual words mengambil ciri unik pada citra sehingga dapat membedakan pola-pola yang terdapat pada suatu citra. Wajah manusia memainkan peran sentral dalam interaksi sosial, oleh karena itu tidak mengherankan bahwa pemrosesan informasi wajah otomatis merupakan subfield penting dan sangat aktif dalam penelitian pengenalan pola [5].…”
Section: Pendahuluanunclassified