Proceedings of the 22nd ACM International Conference on Multimedia 2014
DOI: 10.1145/2647868.2654901
|View full text |Cite
|
Sign up to set email alerts
|

Multi-modal Mutual Topic Reinforce Modeling for Cross-media Retrieval

Abstract: As an important and challenging problem in the multimedia area, multi-modal data understanding aims to explore the intrinsic semantic information across different modalities in a collaborative manner. To address this problem, a possible solution is to effectively and adaptively capture the common cross-modal semantic information by modeling the inherent correlations between the latent topics from different modalities. Motivated by this task, we propose a supervised multi-modal mutual topic reinforce modeling (… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
33
0

Year Published

2015
2015
2019
2019

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 45 publications
(33 citation statements)
references
References 21 publications
0
33
0
Order By: Relevance
“…For example, Roller and Schulte im Walde (2013) integrated visual features into latent Dirichlet allocation (LDA) and proposed a multimodal LDA model to learn representations for textual and visual data. Wang Y et al (2014) proposed a scheme called the multimodal mutual topic reinforce model (M 3 R), which seeks to discover mutually consistent semantic topics via appropriate interactions between model factors. These schemes represent data as topic distributions, and similarities are measured by the likelihood of observed data in terms of latent topics.…”
Section: Theory and Model For Cross-media Uniform Representationmentioning
confidence: 99%
“…For example, Roller and Schulte im Walde (2013) integrated visual features into latent Dirichlet allocation (LDA) and proposed a multimodal LDA model to learn representations for textual and visual data. Wang Y et al (2014) proposed a scheme called the multimodal mutual topic reinforce model (M 3 R), which seeks to discover mutually consistent semantic topics via appropriate interactions between model factors. These schemes represent data as topic distributions, and similarities are measured by the likelihood of observed data in terms of latent topics.…”
Section: Theory and Model For Cross-media Uniform Representationmentioning
confidence: 99%
“…An example is the well-known Latent Dirichlet Allocation (LDA). In [32], a supervised multimodal mutual topic reinforce modeling approach for cross-media retrieval, called M3R, is proposed. Some other methodologies are Partial Least Squares (PLS) and correlation matching.…”
Section: Related Workmentioning
confidence: 99%
“…However, CCA-based methods lack probabilistic interpretation on the intra-modal similarities. Topic models [3,12,25] learn latent topics to describe the intrinsic semantic correlations in multi-modal data. Based on Latent Dirichlet Allocation (LDA) [3], a variety of constraints are imposed.…”
Section: Introductionmentioning
confidence: 99%
“…For example, CCA-based models [1,11,19,20] assume that the inter-modal relation is expressed by co-occurrence of multi-modal data objects. The inter-modal relation is also encoded as binary observation matrix to be fit by the correlation models [3,12,25,27]. By contrast, we directly impose two kinds of inter-modal relations (i.e., semantic similarity and dissimilarity) as smooth priors on the output of multi-modal GPLVM.…”
Section: Introductionmentioning
confidence: 99%