Proceedings of the 6th ACM International Conference on Image and Video Retrieval 2007
DOI: 10.1145/1282280.1282368
|View full text |Cite
|
Sign up to set email alerts
|

Information-theoretic semantic multimedia indexing

Abstract: To solve the problem of indexing collections with diverse text documents, image documents, or documents with both text and images, one needs to develop a model that supports heterogeneous types of documents. In this paper, we show how information theory supplies us with the tools necessary to develop a unique model for text, image, and text/image retrieval. In our approach, for each possible query keyword we estimate a maximum entropy model based on exclusively continuous features that were preprocessed. The u… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
26
0
1

Year Published

2008
2008
2015
2015

Publication Types

Select...
4
4
1

Relationship

3
6

Authors

Journals

citations
Cited by 34 publications
(27 citation statements)
references
References 38 publications
(43 reference statements)
0
26
0
1
Order By: Relevance
“…Magalhaes&Rüger [21] 0.28* Npde Yavlinsky et al [34] 0.29* MBRM Feng et al [10] 0.30 SML Carneiro et al [3] 0.31 JEC Makadia et al [22] 0.35…”
Section: Logregl2mentioning
confidence: 99%
See 1 more Smart Citation
“…Magalhaes&Rüger [21] 0.28* Npde Yavlinsky et al [34] 0.29* MBRM Feng et al [10] 0.30 SML Carneiro et al [3] 0.31 JEC Makadia et al [22] 0.35…”
Section: Logregl2mentioning
confidence: 99%
“…However, many algorithms do not explicitly exploit the correlation between words. With respect to the deployed machine learning method, we can consider: co-occurrence models of low-level image features and words [26]; machine translation methods that translate image regions into words in the same way as words from French might be translated into English [6]; relevance models CRM [16], CMRM [12], and MBRM [10]; inference networks that connect image segments with words [24]; nonparametric density estimation [34]; supervised learning models [3,4]; information-theoretic semantic indexing [21]; and [22] show that a proper selection of features could lead to very good results for a k-nearest neighbours algorithm.…”
Section: Introductionmentioning
confidence: 99%
“…In [Magalhães and Rüger, 2007] we introduced an information-theoretic framework for Equation (5). The current paper proposes and presents a definitive and thorough account of our framework, [Magalhães, 2008].…”
Section: Organizationmentioning
confidence: 99%
“…Several techniques to model a keyword with different types of probability density distributions have been used: Feng and Manmatha [4] proposed a Bernoulli model with a vocabulary of visual terms for each keyword, Yavlinsky et al [28] deployed nonparametric density estimation, Carneiro and Vasconcelos [1] a semi-parametric density estimation. Automatic multimedia keyword annotation has also been an active area of research: Snoek et al [24] explore temporal synchronization to combine the multi-modal patterns, Monay and Gatica-Perez explore dependencies across different media [17], while Magalhães and Rüger [15] developed a multimodal maximum entropy framework. The above methods extract features from the multimedia itself, but other, heuristic techniques rely on metadata attached to the multimedia: for example, Lu et al [13] analyse HTML text surrounding an image and assign the most relevant keywords to it.…”
Section: Systems Based On Automatic Abstractmentioning
confidence: 99%