2019
DOI: 10.1109/tpami.2018.2846566
|View full text |Cite
|
Sign up to set email alerts
|

Fine-Tuning CNN Image Retrieval with No Human Annotation

Abstract: Image descriptors based on activations of Convolutional Neural Networks (CNNs) have become dominant in image retrieval due to their discriminative power, compactness of representation, and search efficiency. Training of CNNs, either from scratch or fine-tuning, requires a large amount of annotated data, where a high quality of annotation is often crucial. In this work, we propose to fine-tune CNNs for image retrieval on a large collection of unordered images in a fully automated manner. Reconstructed 3D models… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

14
1,090
1
3

Year Published

2019
2019
2023
2023

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 1,040 publications
(1,177 citation statements)
references
References 65 publications
14
1,090
1
3
Order By: Relevance
“…They also propose the regional version RMAC, by sampling windows at different scales and describing them separately. Radenović et al [19] generalize the preceding approaches with a generalized mean pooling (GeM) including a learnable parameter.…”
Section: Global Methodsmentioning
confidence: 99%
“…They also propose the regional version RMAC, by sampling windows at different scales and describing them separately. Radenović et al [19] generalize the preceding approaches with a generalized mean pooling (GeM) including a learnable parameter.…”
Section: Global Methodsmentioning
confidence: 99%
“…expectation maximization (EM) [33] , curriculum learning [34] , self-paced learning [35] , etc. ) are widely used in the weakly-supervised tasks [9,[36][37][38][39][40][41] . For example, [36] adopts the expectation maximization (EM) algorithm to dynamically predict semantic foreground and background pixels by using an alternative training procedure.…”
Section: Iterative Learning Methodsmentioning
confidence: 99%
“…In Table 7, we present the results in ROxford and RParis datasets of stateof-the-art methods which uses VGG as feature extractor. In the pre-trained single pass category we improve the state-of-the-art performance with ChCO-SC T + SpT D based in the linear aggregation of co-occurrences against well known image retrieval methods like crow [7], SPoC [5], MAC and R-MAC [6] and GeM [31]. Moreover, with bilinear pooling we can obtain a final vector representation with higher dimensions than the number of channels of the last VGG layer.…”
Section: Comparison With State-of-the-art Resultsmentioning
confidence: 99%
“…In this section is evaluated the co-occurrence representation after the cooccurrence filter training process in a CoOcNET pipeline. The evaluation is performed similar to [31], with its same whitening procedure, alpha query expansion method αQE, and testing also each query in multiscale, ms, (1, 1…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation