Coupled Binary Embedding for Large-Scale Image Retrieval

Zheng, Liang; Wang, Shengjin; Tian, Qi

doi:10.1109/tip.2014.2330763

Cited by 133 publications

(6 citation statements)

References 61 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Binary coding aims to present feature vectors by a compact binary code. For example, in [3], the authors present dual binary embedding applied lo large scale images retrieval. The system uses multiples binary features extracted from SIFT (Scale Invariant Feature Transform) feature and a multi-IDF (Inverse Document Frequency) scheme allowing the association of binary features to the inverted file.…”

Section: Large Scale Multimedia Management Method: Overviewmentioning

confidence: 99%

Large Scale Multimedia Management: Recent Challenges

Mahmoudi

Belarbi

2022

Information

View full text Add to dashboard Cite

Multimedia applications deal, in most cases, with an extremely high volume of multimedia data (2D and 3D images, sounds, videos). That is why efficient algorithms should be developed to analyze and process these large datasets. On the other hand, multimedia management is based on efficient representation of knowledge which allows efficient data processing and retrieval. The main challenge in this era is to achieve clever and quick access to these huge datasets to allow easy access to the data and in a reasonable time. In this context, large-scale image retrieval is a fundamental task. Many methods have been developed in the literature to achieve fast and efficient navigating in large databases by using the famous content-based image retrieval (CBIR) methods associated with these methods allowing a decrease in the computing time, such as dimensional reduction and hashing methods. More recently, these methods based on convolutional neural networks (CNNs) for feature extraction and image classification are widely used. In this paper, we present a comprehensive review of recent multimedia retrieval methods and algorithms applied to large datasets of 2D/3D images and videos. This editorial paper discusses the mains challenges of multimedia retrieval in a context of large databases.

show abstract

Section: Large Scale Multimedia Management Method: Overviewmentioning

confidence: 99%

Large Scale Multimedia Management: Recent Challenges

Mahmoudi

Belarbi

2022

Information

View full text Add to dashboard Cite

show abstract

“…In practice, additional data, such as other feature descriptors or cues, can be embedded in the inverted index (together with image IDs) for refined re-ranking, such as Hamming embedding [61], binary embedding [21], semantic-aware coindexing [57], and IR embedding [29]. The embedded information is typically compact (e.g.…”

Section: A Utilizing the Inverted Indexmentioning

confidence: 99%

“…The inverted index can be extended to multidimensional cases, aka inverted multi-index (IMI) [20]. If each index dimension corresponds to a different feature, then IMIlike structures can be used for feature fusion, aka coupled indexing [21]. For a two-dimensional IMI with dimensions M 1 and M 2 , the time complexity is reduced to O(N/M 1 /M 2 ), but the space complexity is increased to O(M 1 M 2 ).…”

Section: Introductionmentioning

confidence: 99%

Fusion of Global and Local Features with Multi-Inverted Indices for Efficient Image Retrieval

Weng

2023

Preprint

View full text Add to dashboard Cite

<p>Feature fusion is an effective solution for improving image retrieval performance. Although the more feature types, the better accuracy, complexity also increases. Applications in practice typically afford a limited number of feature types. Due to the strong complementarity, global and local features form an ideal combination for many fusion applications. However, the two kinds of features are intrinsically different in nature, thus cannot be fused in a straightforward way. In this work, we propose an integrated image retrieval and feature fusion framework for global and local features. It is based on inverted index fusion, a technique for efficient image retrieval. The core idea is to rank candidates by weighted voting during candidate selection, which is named pre-ranking. This procedure takes place before re-ranking, and is potentially superior to conventional late fusion. Extensive experiments on three public datasets show that the light-weight pre-ranking stage significantly contributes to accuracy, and brings substantial improvement when used together with re-ranking. Our method is robust and versatile, and can be applied to any scenario where inverted indexing is used. It is a promising technique for multimedia retrieval in the big data era.</p>

show abstract

“…In practice, SIFT and SURF are trade-off options because SIFT yields better accuracy results but slower inference. In 8,9 , they implemented a new approach when creating bag-of-words (BOW) for a large database; despite choosing only SIFT, they decided to add binary descriptions and achieved 79.6% accuracy on a holiday dataset. Deep Learning visual descriptor: Along with lowlevel features, the features extracted by these approaches above seem to exploit only a small amount of image information.…”

Section: Introductionmentioning

confidence: 99%

Image Retrieval Based on ConvNets and Hashing Algorithm

Tran,

Nguyen,

2023

Sci. Tech. Dev. J.

View full text Add to dashboard Cite

Image retrieval is a prominent subject of study in the fields of image processing and computer vision. With its application in various domains, such as logo search, product search or general image search in Google, Bing, etc., image retrieval has received significant attention for many years. In this work, we study and investigate a framework that leverages visual transferring fea-tures and hashing algorithms for the purpose of finding similar images in a dataset. The key idea of our solution is to find the answer to the following question: “How can we convert an image into binary code and search it more efficiently in a large-scale dataset?”. To achieve this pur-pose, we use pretrained CNN models from ImageNet for image representation and then convert them into binary code by using hashing algorithms. These images in the dataset are represented by binary codes, and the Hamming distance is used to find the images in the dataset that are in-dexed. To demonstrate the robustness of the system, we systematically tested the performance of the system based on speed with raw indexing and hashing indexing on 4 datasets: CIFAR-10, Caltech-101, Oxford-102-Flowers, and MS-COCO 2017. The experimental results show that lo-cal sensitive hashing (LSH) algorithms with 2,048 bits in binary code demonstrate the same or greater precision than raw indexing. Furthermore, the findings show that the MobileNet architec-ture consistently outperforms other architectures across these datasets, effectively balancing speed and precision.

show abstract

Coupled Binary Embedding for Large-Scale Image Retrieval

Cited by 133 publications

References 61 publications

Large Scale Multimedia Management: Recent Challenges

Large Scale Multimedia Management: Recent Challenges

Fusion of Global and Local Features with Multi-Inverted Indices for Efficient Image Retrieval

Image Retrieval Based on ConvNets and Hashing Algorithm

Contact Info

Product

Resources

About