2018
DOI: 10.1109/tpami.2017.2709749
|View full text |Cite
|
Sign up to set email alerts
|

SIFT Meets CNN: A Decade Survey of Instance Retrieval

Abstract: Abstract-In the early days, content-based image retrieval (CBIR) was studied with global features. Since 2003, image retrieval based on local descriptors (de facto SIFT) has been extensively studied for over a decade due to the advantage of SIFT in dealing with image transformations. Recently, image representations based on the convolutional neural network (CNN) have attracted increasing interest in the community and demonstrated impressive performance. Given this time of rapid evolution, this article provides… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

3
314
0
6

Year Published

2018
2018
2022
2022

Publication Types

Select...
8
1
1

Relationship

0
10

Authors

Journals

citations
Cited by 615 publications
(323 citation statements)
references
References 149 publications
3
314
0
6
Order By: Relevance
“…Instead of designing visual features manually, Convolutional Neural Network (CNN) can automatically learn deep representations of images [12]. Several researchers have also applied CNN to image sentiment classification [13]- [16] and demostrated the superior performance of the deep features against hand-tuned features for sentiment classification.…”
Section: Introductionmentioning
confidence: 99%
“…Instead of designing visual features manually, Convolutional Neural Network (CNN) can automatically learn deep representations of images [12]. Several researchers have also applied CNN to image sentiment classification [13]- [16] and demostrated the superior performance of the deep features against hand-tuned features for sentiment classification.…”
Section: Introductionmentioning
confidence: 99%
“…ImageNet, are used as feature extractors by feedforwarding the image of interest, and gathering the activations at different layers of the network [13], [44], [53], [48], [39], [55]. The penultimate activations before softmax classifier have been reported as good baselines for transferring knowledge in several vision tasks [13], [44].…”
Section: B Methodologymentioning
confidence: 99%
“…Oxford 5k dataset is used for image retrieval [38]. This dataset contains 5062 images, denoted as I = {a 1 , a 2 , .…”
Section: A Datasetsmentioning
confidence: 99%