Informative visual words construction to improve bag of words image representation

Farhangi, M. Mehdi; Soryani, Mohsen

doi:10.1049/iet-ipr.2013.0449

Cited by 9 publications

(8 citation statements)

References 30 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…The optimization variable represents the cropped position in the image, and the fitness function is the similarity between a cropped image and a training dataset image. Usually, image similarity adopts a feature-based method such as the bag-of-words model (32,33) for extracting and searching feature points in an image. However, an effective image similarity measure cannot be performed when the feature points are difficult to match if the images suffer from motion blur and are taken at different viewpoints.…”

Section: Input Image Crop Based On Genetic Algorithmmentioning

confidence: 99%

Image-similarity-based Convolutional Neural Network for Robot Visual Relocalization

Wang¹,

Li²,

Sun³

et al. 2020

Sensors and Materials

View full text Add to dashboard Cite

Convolutional neural network (CNN)-based methods, which train an end-to-end model to regress a six degree of freedom (DoF) pose of a robot from a single red-green-blue (RGB) image, have been developed to overcome the poor robustness of robot visual relocalization recently. However, the pose precision becomes low when the test image is dissimilar to training images. In this paper, we propose a novel method, named image-similarity-based CNN, which considers the image similarity of an input image during the CNN training. The higher the similarity of the input image, the higher precision we can achieve. Therefore, we crop the input image into several small image blocks, and the similarity between each cropped image block and training dataset images is measured by employing a feature vector in a fully connected CNN layer. Finally, the most similar image is selected to regress the pose. A genetic algorithm is utilized to determine the cropped position. Experiments on both open-source dataset 7-Scenes and two actual indoor environments are conducted. The results show that the proposed algorithm leads to better results and reduces large regression errors effectively compared with existing solutions.

show abstract

Section: Input Image Crop Based On Genetic Algorithmmentioning

confidence: 99%

Image-similarity-based Convolutional Neural Network for Robot Visual Relocalization

Wang¹,

Li²,

Sun³

et al. 2020

Sensors and Materials

View full text Add to dashboard Cite

show abstract

“…Recently, it has developed rapidly and become distinguished in the field of image processing such as image recognition, classification, annotation, and so on [33,34]. Its fundamental conception, bag-of-words model (BoW) [35,36], was originally used in distinguishing hidden information in a large collection of corpus [37,38] and conversing the information of the pixels to non-ordered visual words. As an unsupervised generative probabilistic model, its documents are viewed as a mixture of topics, sharing a common Dirichlet priori.…”

Section: Lda (Latent Dirichlet Allocation)mentioning

confidence: 99%

Unsupervised Greenhouse Tomato Plant Segmentation Based on Self-Adaptive Iterative Latent Dirichlet Allocation from Surveillance Camera

Cao

2019

Agronomy

View full text Add to dashboard Cite

It has long been a great concern in deep learning that we lack massive data for high-precision training sets, especially in the agriculture field. Plants in images captured in greenhouses, from a distance or up close, not only have various morphological structures but also can have a busy background, leading to huge challenges in labeling and segmentation. This article proposes an unsupervised statistical algorithm SAI-LDA (self-adaptive iterative latent Dirichlet allocation) to segment greenhouse tomato images from a field surveillance camera automatically, borrowing the language model LDA. Hierarchical wavelet features with an overlapping grid word document design and a modified density-based method quick-shift are adopted, respectively, according to different kinds of images, which are classified by specific proportions between fruits, leaves, and the background. We also utilize the feature correlation between several layers of the image to make further optimization through three rounds of iteration of LDA, with updated documents to achieve finer segmentation. Experiment results show that our method can automatically label the organs of the greenhouse plant under complex circumstances, fast and precisely, overcoming the difficulty of inferior real-time image quality caused by a surveillance camera, and thus obtain large amounts of valuable training sets.

show abstract

“…Bag of visual words merupakan suatu skema untuk mengklasifikasikan citra berdasarkan nilai-nilai pixel pada citra [4] Dengan menggunakan deteksi interest point dan ekstraksi interest point, bag of visual words mengambil ciri unik pada citra sehingga dapat membedakan pola-pola yang terdapat pada suatu citra. Wajah manusia memainkan peran sentral dalam interaksi sosial, oleh karena itu tidak mengherankan bahwa pemrosesan informasi wajah otomatis merupakan subfield penting dan sangat aktif dalam penelitian pengenalan pola [5].…”

Section: Pendahuluanunclassified

Klasifikasi Ekspresi Wajah Menggunakan Bag of Visual Words

Muhathir

2018

InfoTekJar

View full text Add to dashboard Cite

Pada hakikatnya, manusia dapat membedakan pola terhadap suatu objek berdasarkan bentuk visual yang mengandung keadaan emosional. Seperti membedakan ekspresi wajah seseorang pada suatu citra. Manusia dapat membedakan ekspresi pada citra tersebut secara kasat mata. Namun komputer yang tidak dapat mengenali ekspresi wajah tersebut. Bag of visual words merupakan suatu skema untuk mengklasifikasikan citra berdasarkan nilai-nilai pixel pada citra. Dengan menggunakan deteksi interest point dan ekstraksi interest point, bag of visual words mengambil ciri unik pada citra sehingga dapat membedakan pola-pola yang terdapat pada suatu citra. Bag of visual word dengan nilai K 500 mampu mengklasifikasi pola ekspresi wajah dengan tingkat akurasi 69%,Kata kunci: Wajah, Klasifikasi, Speed-up Robust Feature, Bag of visual words, Ekspresi

show abstract

Informative visual words construction to improve bag of words image representation

Cited by 9 publications

References 30 publications

Image-similarity-based Convolutional Neural Network for Robot Visual Relocalization

Image-similarity-based Convolutional Neural Network for Robot Visual Relocalization

Unsupervised Greenhouse Tomato Plant Segmentation Based on Self-Adaptive Iterative Latent Dirichlet Allocation from Surveillance Camera

Klasifikasi Ekspresi Wajah Menggunakan Bag of Visual Words

Contact Info

Product

Resources

About