2019
DOI: 10.1016/j.eswa.2019.05.002
|View full text |Cite
|
Sign up to set email alerts
|

Benchmarking unsupervised near-duplicate image detection

Abstract: Unsupervised near-duplicate detection has many practical applications ranging from social media analysis and web-scale retrieval, to digital image forensics. It entails running a threshold-limited query on a set of descriptors extracted from the images, with the goal of identifying all possible near-duplicates, while limiting the false positives due to visually similar images. Since the rate of false alarms grows with the dataset size, a very high specificity is thus required, up to 1 − 10 −9 for realistic use… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
12
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 25 publications
(12 citation statements)
references
References 37 publications
0
12
0
Order By: Relevance
“…Through such a setup, pseudo-Siamese is able to provide more flexibility than restricted Siamese, while restricted Siamese is more efficient in training [ 36 ]. Furthermore, DeepRet [ 2 , 37 ] extends Siamese into triplet Siamese with the Triple Loss to capture the slight difference between images and applies region of interest (ROI) pooling to deeply learn the images.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Through such a setup, pseudo-Siamese is able to provide more flexibility than restricted Siamese, while restricted Siamese is more efficient in training [ 36 ]. Furthermore, DeepRet [ 2 , 37 ] extends Siamese into triplet Siamese with the Triple Loss to capture the slight difference between images and applies region of interest (ROI) pooling to deeply learn the images.…”
Section: Related Workmentioning
confidence: 99%
“…Thus, there are many near-duplicate images existing among the massive data. Near-duplicate images are typically considered as images containing the same scene or objects [ 1 , 2 , 3 ], but in various viewpoints, illumination conditions and camera settings etc, or images obtained through reediting of the original image [ 4 ], including but not limited to changing contrast, tone, cropping, rotating and watermarking. Automatic near-duplicate image pair detection by using computer vision and pattern recognition technology has been attracting wide attention recently, as it has great potential values in the application of image copyright violations detection, fake image detection, management of device hardware storage, and autonomous vehicle driving.…”
Section: Introductionmentioning
confidence: 99%
“…Recently, some researchers have focused on training deep learning models to extract global image features. Lia et al [34] evaluated a set of CNN-learned descriptors and concluded that the features learned from fine-tuned CNNs perform better than the off-the-shelf features. Shervin et al [35] gave a summary of promising works that use deep learning-based models for biometric recognition.…”
Section: Related Workmentioning
confidence: 99%
“…Recently, some researchers have focused on training deep learning models to extract image features [34][35][36][37]. However, since these methods fail to sufficiently consider the influence of background clutter and partial occlusion on the final representation, the extracted global CNN features are not robust enough to combat these attacks.…”
Section: Introductionmentioning
confidence: 99%
“…However, the problem of IPA is not intensively studied so far in the literature. The research areas that are related to IPA are near duplicate detection [22,23], and image splicing detection [24][25][26][27][28][29]. Most of these works are designed to classify whether a candidate image is a near duplicate to a given query image.…”
Section: Related Workmentioning
confidence: 99%