2021
DOI: 10.1145/3412847
|View full text |Cite
|
Sign up to set email alerts
|

HCMSL: Hybrid Cross-modal Similarity Learning for Cross-modal Retrieval

Abstract: The purpose of cross-modal retrieval is to find the relationship between different modal samples and to retrieve other modal samples with similar semantics by using a certain modal sample. As the data of different modalities presents heterogeneous low-level feature and semantic-related high-level features, the main problem of cross-modal retrieval is how to measure the similarity between different modalities. In this article, we present a novel cross-modal retrieval method, named Hybrid Cross-Modal Similarity … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
15
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
8
2

Relationship

4
6

Authors

Journals

citations
Cited by 29 publications
(15 citation statements)
references
References 60 publications
0
15
0
Order By: Relevance
“…The purpose of cross-modal retrieval is to enable flexible retrieval across different modalities, e.g., retrieve semantically matching images for a given text query. Existing methods can be roughly divided into two categories, i.e., subspace learning methods 29 31 based on CCA 4 and supervised learning methods 2 , 6 , 6 , 32 based on DNNs.…”
Section: Related Workmentioning
confidence: 99%
“…The purpose of cross-modal retrieval is to enable flexible retrieval across different modalities, e.g., retrieve semantically matching images for a given text query. Existing methods can be roughly divided into two categories, i.e., subspace learning methods 29 31 based on CCA 4 and supervised learning methods 2 , 6 , 6 , 32 based on DNNs.…”
Section: Related Workmentioning
confidence: 99%
“…As a hot issue widely concerned, cross-modal retrieval problem is studied by a growing number of researchers [4,5,25,29,35,40,50]. According to the representation type of multimedia instances, cross-modal retrieval can be divided into two groups: real-valued representation based retrieval and binary representation (hash code) based retrieval.…”
Section: Related Workmentioning
confidence: 99%
“…Multi-modal learning means that there are more than one source and form of data, and the process of learning in these forms is called multi-modal learning. Multi-modal learning can be divided into five categories: multi-modal representation learning (Zhang C. et al, 2021 ), modal transformation, alignment (Zhu et al, 2022 ), multi-modal fusion, and collaborative learning (Li et al, 2019 ). In this paper, because we use multi-modal feature selection algorithm, we focus on multi-modal feature selection in multi-modal representation learning.…”
Section: Related Workmentioning
confidence: 99%