Proceedings of the 34th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems 2015
DOI: 10.1145/2745754.2745761
|View full text |Cite
|
Sign up to set email alerts
|

Smooth Tradeoffs between Insert and Query Complexity in Nearest Neighbor Search

Abstract: Locality Sensitive Hashing (LSH) has emerged as the method of choice for high dimensional similarity search, a classical problem of interest in numerous applications. LSH-based solutions require that each data point be inserted into a number A of hash tables, after which a query can be answered by performing B lookups. The original LSH solution of [IM98] showed for the first time that both A and B can be made sublinear in the number of data points. Unfortunately, the classical LSH solution does not provide any… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

4
25
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 20 publications
(32 citation statements)
references
References 26 publications
4
25
0
Order By: Relevance
“…In particular, (1.3) is better than all the prior work on time-space trade-offs for ANN, including the most recent tradeoff [Kap15]. Moreover, using a reduction from [Val15], we achieve the bound (1.3) for the whole R d as opposed to just the unit sphere.…”
Section: Techniquesmentioning
confidence: 86%
See 3 more Smart Citations
“…In particular, (1.3) is better than all the prior work on time-space trade-offs for ANN, including the most recent tradeoff [Kap15]. Moreover, using a reduction from [Val15], we achieve the bound (1.3) for the whole R d as opposed to just the unit sphere.…”
Section: Techniquesmentioning
confidence: 86%
“…This regime has been studied since [Ind01a] Despite significant progress in the near-linear space regime, no known algorithms obtain near-linear space and a sublinear query time for all approximations c > 1. For example, the best currently known algorithm of [Kap15] obtained query time of roughly n 4/(c 2 +1) , which becomes trivial for c < √ 3.…”
Section: Locality-sensitive Hashing (Lsh) and Beyond A Classic Technimentioning
confidence: 99%
See 2 more Smart Citations
“…In contrast, strong theoretical results are known when randomization and approximation of distances is allowed, e.g. [2,18,22,25,46]. Even though existing randomized algorithms for set similarity search are superior to commonly used heuristics for difficult data distributions with small skew, it is clear that the heuristics will work much better (in theory and in practice) when the skew is large enough.…”
Section: Introductionmentioning
confidence: 99%