Label Embedding Online Hashing for Cross-Modal Retrieval

Wang, Yongxin; Luo, Xin; Xu, Xin-Shun

doi:10.1145/3394171.3413971

Cited by 40 publications

(9 citation statements)

References 44 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Mingbao Lin et.al [42] proposes Fast Class-wise Updating for Online Hashing (FCOH), class-based update method is brought up to decompose binary code learning, and a semirelaxation strategy is adopted in the optimization process, which can well solve the burden of a large number of training batches. Label EMbedding ONline hashing (LEMON) [43] unifies the label similarity and label semantic embedding into a unified framework, and uses a two-step learning method to efficiently update the hash function and binary code, aiming to generate a binary code with strong resolution and effectively reduce the quantization error. Flexible Online Multimodal Hashing (FOMH) [44] proposes to deal with the multimodal image retrieval problem in the form of streaming data, using the multi-modal data weighting method, combined with asymmetric semantic supervision, to generate a binary code with strong versatility between modalities.…”

Section: B Supervised Online Hashingmentioning

confidence: 99%

Angular Quantization Online Hashing for Image Retrieval

Fang

Liu

2022

IEEE Access

View full text Add to dashboard Cite

Online hash method with fast search mechanism and compact index structure plays a pivotal role. The inner product between label data has become one of the important means to measure the similarity between existing data and new data streams in online hashing methods. However, due to its discrete attributes and semantic gap, it often leads to a large amount of information loss. In this article, we propose a new method called Angular Quantization Online Hashing (AQOH) to focus on learning compact binary codes with the help of cosine distance. Specifically, we propose an online hashing method for angular quantization, by minimizing the quantization error between the cosine similarity calculated from the original data and the generated binary code between the existing data and the new data stream. Further, within this framework, two effective algorithms to complete the optimization of the objective function to be designed, including continuous and discrete methods, respectively. Extensive experiments on various benchmark databases for online retrieval verify that our method outperforms many state-of-the art learning to hash methods.

show abstract

Section: B Supervised Online Hashingmentioning

confidence: 99%

Angular Quantization Online Hashing for Image Retrieval

Fang

Liu

2022

IEEE Access

View full text Add to dashboard Cite

show abstract

“…Such strategy may lose the information of old data. The state-of-the-art online cross-modal hashing (Wang, Luo, and Xu 2020) learns from the above four parts and is endowed with impressive results. However, it is impossible to handle class incremental problem because the incorrect dimensions for matrix multiplication error will happen when performing ⃗ P (t)T P (t) operation if new classes come.…”

Section: Formulationmentioning

confidence: 99%

“…To overcome the limitation, many efforts have been devoted to online hashing. Similarly, we could roughly classify the literature into online uni-modal hashing (Huang, Yang, and Zheng 2013;Cakir and Sclaroff 2015;Chen, King, and Lyu 2017;Weng and Zhu 2020;Tian, Ng, and Wang 2019;Chen et al 2021a), online cross-modal hashing (Xie, Shen, and Zhu 2016;Qi, Wang, and Li 2017;Wang, Luo, and Xu 2020;Yi et al 2021;Zhan et al 2022), and online multi-modal hashing (Xie et al 2017;Lu et al 2019a).…”

Section: Introductionmentioning

confidence: 99%

Online Enhanced Semantic Hashing: Towards Effective and Efficient Retrieval for Streaming Multi-Modal Data

Luo

Zhan

et al. 2022

AAAI

Self Cite

View full text Add to dashboard Cite

With the vigorous development of multimedia equipments and applications, efficient retrieval of large-scale multi-modal data has become a trendy research topic. Thereinto, hashing has become a prevalent choice due to its retrieval efficiency and low storage cost. Although multi-modal hashing has drawn lots of attention in recent years, there still remain some problems. The first point is that existing methods are mainly designed in batch mode and not able to efficiently handle streaming multi-modal data. The second point is that all existing online multi-modal hashing methods fail to effectively handle unseen new classes which come continuously with streaming data chunks. In this paper, we propose a new model, termed Online enhAnced SemantIc haShing (OASIS). We design novel semantic-enhanced representation for data, which could help handle the new coming classes, and thereby construct the enhanced semantic objective function. An efficient and effective discrete online optimization algorithm is further proposed for OASIS. Extensive experiments show that our method can exceed the state-of-the-art models. For good reproducibility and benefiting the community, our code and data are already publicly available.

show abstract

“…Recent developments in massive multimedia data [11,30,43] have heightened the need for multi-modal hashing technology [19,44], which can support large-scale multimedia retrieval with its extremely low storage cost and high retrieval efficiency. Different from uni-modal hashing [5,18,22] which trains and searches data from a single source, and cross-modal hashing [1,29,37] which explores a shared subspace for two heterogeneous modalities and achieves mutual retrieval across them, multi-modal hashing [23,28,35,41] is a real-world application that data are collected from diverse sources or represented by heterogeneous features from different modalities [39]. It focuses on developing collaborative relationships of multiple modalities and supporting multimedia retrieval task.…”

Section: Introductionmentioning

confidence: 99%

Graph Convolutional Multi-modal Hashing for Flexible Multimedia Retrieval

Zhu

Liu

et al. 2021

Proceedings of the 29th ACM International Conference on Multimedia

View full text Add to dashboard Cite

Multi-modal hashing makes an important contribution to multimedia retrieval, where a key challenge is to encode heterogeneous modalities into compact hash codes. To solve this dilemma, graphbased multi-modal hashing methods generally define individual affinity matrix of each independent modality and apply linear algorithm for heterogeneous modalities fusion and compact hash learning. Several other methods construct graph Laplacian matrix based on semantic information to help learn discriminative hash code. However, these conventional methods roughly ignore the structural similarity of training set and the complex relations among multi-modal samples, which leads to unsatisfactory complementarity of fused hash codes. More notably, they are faced with two other important problems: huge computing and storage costs caused by graph construction and partial modality feature lost problem when incomplete query sample comes. In this paper, we propose a Flexible Graph Convolutional Multi-modal Hashing (FGCMH) method that adopts GCNs with linear complexity to preserve both the modality-individual and modality-fused structural similarity for discriminative hash learning. Necessarily, accurate multimedia retrieval can be performed on complete and incomplete datasets with our method. Specifically, multiple modality-individual GCNs under semantic guidance are proposed to act on each individual modality independently for intra-modality similarity preserving, then the output representations are fused into a fusion graph with adaptive weighting scheme. Hash GCN and semantic GCN, which share parameters in the first two layers, propagate fusion information and generate hash codes under high-level label space supervision. In the query stage, our method adaptively captures various multi-modal contents in a flexible and robust way, even if partial modality features are lost. Experimental results on three publicly datasets show the flexibility and effectiveness of our proposed method.

show abstract

Label Embedding Online Hashing for Cross-Modal Retrieval

Cited by 40 publications

References 44 publications

Angular Quantization Online Hashing for Image Retrieval

Angular Quantization Online Hashing for Image Retrieval

Online Enhanced Semantic Hashing: Towards Effective and Efficient Retrieval for Streaming Multi-Modal Data

Graph Convolutional Multi-modal Hashing for Flexible Multimedia Retrieval

Contact Info

Product

Resources

About