Bridging the Gap: Query by Semantic Example

Rasiwasia, Nikhil; Moreno, Pol; Vasconcelos, Nuno

doi:10.1109/tmm.2007.900138

Cited by 211 publications

(104 citation statements)

References 44 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…By exploiting the statistical structure of these ambiguities, a QBSE system is able to perform inferences at an higher level of abstraction, and significantly outperform QBVE systems. This has been confirmed by various recent studies, which have shown that QBSE systems can generalize much better than their QBVE counterparts [13].…”

Section: Introductionsupporting

confidence: 68%

“…For brevity, we limit the discussion to the implementation details of contextual level. The visual and semantic levels are those proposed in [19] and [13], where they were shown to achieve better performance than a number of other state of the art image retrieval systems.…”

Section: Implementation Detailsmentioning

confidence: 99%

“…Image retrieval results are presented on two public datasets -1) Natural15: 15-Natural scene classes [6] and 2) Corel15: 15-Corel stock photo CD's, used in [13] for QBSE. The Natural15 dataset contains 200-400 images per class with an average image size of 300×250px.…”

Section: Datasetsmentioning

confidence: 99%

“…To address this limitation, various authors have proposed an alternative query-by-example strategy which extends QBVE to the semantic domain [17,18,10,14,13,8]. This strategy, commonly referred to as query-by-semantic-example (QBSE), formulates image retrieval as a two stage process.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Image retrieval using query by contextual example

Rasiwasia¹,

Vasconcelos²

2008

Proceedings of the 1st ACM International Conference on Multimedia Information Retrieval

View full text Add to dashboard Cite

Current image retrieval techniques have difficulties to retrieve images which exhibit distinct visual patterns but belong to the class of the query image. Previous attempts to improve generalization have shown that the introduction of semantic representations can mitigate this problem. We extend the existing query-by-semanticexample (QBSE) retrieval paradigm by adding a second layer of semantic representation. At the first level, the representation is driven by patch-based visual features. Semantic concepts, from a predefined vocabulary, are modeled as Gaussian mixtures on a visual feature space, and images as vectors of posterior probabilities of containing each of the semantic concepts. At the second level, the representation is purely semantic. Semantic concepts are modeled as Dirichlet mixtures on the semantic feature space of QBSE, and images are again represented as vectors of posterior concept probabilities. It is shown that the proposed retrieval strategy, referred to as query-by-contextual-example (QBCE), is able to cope with the ambiguities of patch-based classification, exhibiting significantly better generalization than previous methods. An experimental evaluation on benchmark datasets shows that QBCE retrieval systems can substantially outperform their QBVE and QBSE counterparts, achieving high precision at most levels of recall.

show abstract

Section: Introductionsupporting

confidence: 68%

Section: Implementation Detailsmentioning

confidence: 99%

Section: Datasetsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Image retrieval using query by contextual example

Rasiwasia¹,

Vasconcelos²

2008

Proceedings of the 1st ACM International Conference on Multimedia Information Retrieval

View full text Add to dashboard Cite

show abstract

“…While given a single video story learning the links between visual appearances of keyframes and speech transcript words is not possible, the concurrent occurrences in large number of available video stories provide that information learned, they can be used to predict labels for the regions (region labeling) as an alternative to large-scale object recognition. They can also be used in a setting similar to the query by semantic example method as proposed by [67].…”

Section: Linking Regions To Abstract In Annotated Keyframesmentioning

confidence: 99%

Multimedia translation for linking visual data to semantics in videos

Duygulu

Baştan

2009

Machine Vision and Applications

View full text Add to dashboard Cite

The semantic gap problem, which can be referred to as the disconnection between low-level multimedia data and high-level semantics, is an important obstacle to build real-world multimedia systems. The recently developed methods that can use large volumes of loosely labeled data to provide solutions for automatic image annotation stand as promising approaches toward solving this problem. In this paper, we are interested in how some of these methods can be applied to semantic gap problems that appear in other application domains beyond image annotation. Specifically, we introduce new problems that appear in videos, such as the linking of keyframes with speech transcript text and the linking of faces with names. In a common framework, we formulate these problems as the problem of finding missing correspondences between visual and semantic data and apply the multimedia translation method. We evaluate the performance of the multimedia translation method on these problems and compare its performance against other autoannotation and classifier-based methods. The experiments, carried out on over 300 h of news videos from TRECVid 2004 and TRECVid 2006 corpora, show that the multimedia translation method provides a performance that is comparable to the other auto-annotation methods and superior performance compared to other classifier-based methods.

show abstract

Multiclass Boosting Framework for Multimodal Data Analysis

Wang

Pan

et al. 2015

MultiMedia Modeling

View full text Add to dashboard Cite

Bridging the Gap: Query by Semantic Example

Cited by 211 publications

References 44 publications

Image retrieval using query by contextual example

Image retrieval using query by contextual example

Multimedia translation for linking visual data to semantics in videos

Multiclass Boosting Framework for Multimodal Data Analysis

Contact Info

Product

Resources

About