2022
DOI: 10.1145/3534963
|View full text |Cite
|
Sign up to set email alerts
|

Indexing Metric Spaces for Exact Similarity Search

Abstract: With the continued digitization of societal processes, we are seeing an explosion in available data. This is referred to as big data. In a research setting, three aspects of the data are often viewed as the main sources of challenges when attempting to enable value creation from big data: volume, velocity, and variety. Many studies address volume or velocity, while fewer studies concern the variety. Metric spaces are ideal for addressing variety because they can accommodate any data as long as it can be equipp… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0
1

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
3
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 17 publications
(14 citation statements)
references
References 145 publications
0
6
0
1
Order By: Relevance
“…To reduce the storage, LAESA [20] only keeps the distances of objects to selected pivots. EPT * [21] selects multiple sets rather than a single set of pivots to achieve better search performance. Omni-family [22,23] employs existing diskbased index structures (e.g., the sequential file, the B + -tree, or the R-tree) to index pre-computed distances.…”
Section: B Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…To reduce the storage, LAESA [20] only keeps the distances of objects to selected pivots. EPT * [21] selects multiple sets rather than a single set of pivots to achieve better search performance. Omni-family [22,23] employs existing diskbased index structures (e.g., the sequential file, the B + -tree, or the R-tree) to index pre-computed distances.…”
Section: B Related Workmentioning
confidence: 99%
“…It can be any classical multi-dimensional index structure for similarity search, such as EPT * [21], GNAT [25] and SAT [15,16].…”
Section: Indexingmentioning
confidence: 99%
“…The ElMTree improves on this by using a metric indexing data structure 13,14 , the List of Clusters (LC) 15 , where randomly assigned objects of the search space are assigned as routing objects. The remaining objects are assigned to their closest routing object, and the distance from each routing object to its furthest child is stored.…”
Section: Elmtreementioning
confidence: 99%
“…The ElMTree improves on this by using a metric indexing data structure, 13,14 the List of Clusters (LC), 15 where randomly selected objects of the search space are designated as routing objects. The remaining objects are assigned to their closest routing object, with the distance from each routing object to its furthest child stored as the routing objects' covering radius.…”
Section: Available Toolsmentioning
confidence: 99%