Proceedings of the 28th ACM International Conference on Multimedia 2020
DOI: 10.1145/3394171.3413971
|View full text |Cite
|
Sign up to set email alerts
|

Label Embedding Online Hashing for Cross-Modal Retrieval

Abstract: Supervised cross-modal hashing has gained a lot of attention recently. However, most existing methods learn binary codes or hash functions in a batch-based scheme, which is inefficient in an online scenario, i.e., data points come in a streaming fashion. Online hashing is a promising solution; however, there still exist several challenges, e.g., how to effectively exploit semantic information, how to discretely solve the binary optimization problem, how to efficiently update hash codes and hash functions. To a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
8
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
2
1

Relationship

2
7

Authors

Journals

citations
Cited by 40 publications
(9 citation statements)
references
References 44 publications
0
8
0
Order By: Relevance
“…Mingbao Lin et.al [42] proposes Fast Class-wise Updating for Online Hashing (FCOH), class-based update method is brought up to decompose binary code learning, and a semirelaxation strategy is adopted in the optimization process, which can well solve the burden of a large number of training batches. Label EMbedding ONline hashing (LEMON) [43] unifies the label similarity and label semantic embedding into a unified framework, and uses a two-step learning method to efficiently update the hash function and binary code, aiming to generate a binary code with strong resolution and effectively reduce the quantization error. Flexible Online Multimodal Hashing (FOMH) [44] proposes to deal with the multimodal image retrieval problem in the form of streaming data, using the multi-modal data weighting method, combined with asymmetric semantic supervision, to generate a binary code with strong versatility between modalities.…”
Section: B Supervised Online Hashingmentioning
confidence: 99%
“…Mingbao Lin et.al [42] proposes Fast Class-wise Updating for Online Hashing (FCOH), class-based update method is brought up to decompose binary code learning, and a semirelaxation strategy is adopted in the optimization process, which can well solve the burden of a large number of training batches. Label EMbedding ONline hashing (LEMON) [43] unifies the label similarity and label semantic embedding into a unified framework, and uses a two-step learning method to efficiently update the hash function and binary code, aiming to generate a binary code with strong resolution and effectively reduce the quantization error. Flexible Online Multimodal Hashing (FOMH) [44] proposes to deal with the multimodal image retrieval problem in the form of streaming data, using the multi-modal data weighting method, combined with asymmetric semantic supervision, to generate a binary code with strong versatility between modalities.…”
Section: B Supervised Online Hashingmentioning
confidence: 99%
“…Such strategy may lose the information of old data. The state-of-the-art online cross-modal hashing (Wang, Luo, and Xu 2020) learns from the above four parts and is endowed with impressive results. However, it is impossible to handle class incremental problem because the incorrect dimensions for matrix multiplication error will happen when performing ⃗ P (t)T P (t) operation if new classes come.…”
Section: Formulationmentioning
confidence: 99%
“…To overcome the limitation, many efforts have been devoted to online hashing. Similarly, we could roughly classify the literature into online uni-modal hashing (Huang, Yang, and Zheng 2013;Cakir and Sclaroff 2015;Chen, King, and Lyu 2017;Weng and Zhu 2020;Tian, Ng, and Wang 2019;Chen et al 2021a), online cross-modal hashing (Xie, Shen, and Zhu 2016;Qi, Wang, and Li 2017;Wang, Luo, and Xu 2020;Yi et al 2021;Zhan et al 2022), and online multi-modal hashing (Xie et al 2017;Lu et al 2019a).…”
Section: Introductionmentioning
confidence: 99%
“…Recent developments in massive multimedia data [11,30,43] have heightened the need for multi-modal hashing technology [19,44], which can support large-scale multimedia retrieval with its extremely low storage cost and high retrieval efficiency. Different from uni-modal hashing [5,18,22] which trains and searches data from a single source, and cross-modal hashing [1,29,37] which explores a shared subspace for two heterogeneous modalities and achieves mutual retrieval across them, multi-modal hashing [23,28,35,41] is a real-world application that data are collected from diverse sources or represented by heterogeneous features from different modalities [39]. It focuses on developing collaborative relationships of multiple modalities and supporting multimedia retrieval task.…”
Section: Introductionmentioning
confidence: 99%