Efficient RkNN retrieval with arbitrary non-metric similarity measures

Deepak, P; Deshpande, Prasad M.

doi:10.14778/1920841.1920995

Cited by 5 publications

(2 citation statements)

References 23 publications

(30 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…That method can be combined with any distance-based indexing scheme and is orthogonal to such schemes, including the method proposed in this paper. A tree-based reverse kNN was proposed [10] for indexing non-metric spaces; however, this problem is orthogonal to ours, while the asymmetric relationship between kNN and reverse kNN makes it hard to adapt techniques for kNN to reverse kNN.…”

Section: Related Workmentioning

confidence: 99%

Optimizing Hashing Functions for Similarity Indexing in Arbitrary Metric and Nonmetric Spaces

Jangyodsuk

Papapetrou

Athitsos

2015

Proceedings of the 2015 SIAM International Conference on Data Mining

View full text Add to dashboard Cite

A large number of methods have been proposed for similarity indexing in Euclidean spaces, and several such methods can also be used in arbitrary metric spaces. Such methods exploit specific properties of Euclidean spaces or general metric spaces. Designing generalpurpose similarity indexing methods for arbitrary metric and non-metric distance measures is a more difficult problem, due to the vast heterogeneity of such spaces and the lack of common properties that can be exploited. In this paper, we propose a generally applicable method for similarity-based indexing in arbitrary metric and nonmetric spaces, based on hashing. We build upon the technique of Distance-Based Hashing (DBH), which organizes database objects in multiple hash tables, so that two similar objects tend to fall in the same bucket in at least one of those hash tables. The main contribution is in showing how to optimize the hashing functions for accuracy and efficiency, using training data. The proposed optimizations significantly improve performance in experiments on three public datasets.

show abstract

Section: Related Workmentioning

confidence: 99%

Optimizing Hashing Functions for Similarity Indexing in Arbitrary Metric and Nonmetric Spaces

Jangyodsuk

Papapetrou

Athitsos

2015

Proceedings of the 2015 SIAM International Conference on Data Mining

View full text Add to dashboard Cite

show abstract

“…[24] points out specific cases in which each of the metric properties (viz., reflexivity, symmetry and triangle inequality) may not be intuitively satisfied. Similarity search on arbitrary non-metric similarity measures has attracted recent attention [10,21,20].…”

Section: Related Workmentioning

confidence: 99%

Efficient reverse skyline retrieval with arbitrary non-metric similarity measures

Deshpande

Deepak

2011

Proceedings of the 14th International Conference on Extending Database Technology

Self Cite

View full text Add to dashboard Cite

A Reverse Skyline query returns all objects whose skyline contains the query object. In this paper, we consider Reverse Skyline query processing where the distance between attribute values are not necessarily metric. We outline real world cases that motivate Reverse Skyline processing in such scenarios. We consider various optimizations to develop efficient algorithms for Reverse Skyline processing. Firstly, we consider block-based processing of objects to optimize on IO costs. We then explore pre-processing to re-arrange objects on disk to speed-up computational and IO costs. We then present our main contribution, which is a method of using group-level reasoning and early pruning to microoptimize processing by reducing attribute level comparisons. An extensive empirical evaluation with real-world datasets and synthetic data of varying characteristics shows that our optimization techniques are indeed very effective in dramatically speeding Reverse Skyline processing, both in terms of computational costs and IO costs.

show abstract

Indexing for Similarity Search Operators

Deepak

Deshpande

2015

Operators for Similarity Search

View full text Add to dashboard Cite

Efficient RkNN retrieval with arbitrary non-metric similarity measures

Cited by 5 publications

References 23 publications

Optimizing Hashing Functions for Similarity Indexing in Arbitrary Metric and Nonmetric Spaces

Optimizing Hashing Functions for Similarity Indexing in Arbitrary Metric and Nonmetric Spaces

Efficient reverse skyline retrieval with arbitrary non-metric similarity measures

Indexing for Similarity Search Operators

Contact Info

Product

Resources

About