Proceedings of the 2004 ACM Symposium on Applied Computing 2004
DOI: 10.1145/967900.968151
|View full text |Cite
|
Sign up to set email alerts
|

Similarity between Euclidean and cosine angle distance for nearest neighbor queries

Abstract: Understanding the relationship among different distance measures is helpful in choosing a proper one for a particular application. In this paper, we compare two commonly used distance measures in vector models, namely, Euclidean distance (EUD) and cosine angle distance (CAD), for nearest neighbor (NN) queries in high dimensional data spaces. Using theoretical analysis and experimental results, we show that the retrieval results based on EUD are similar to those based on CAD when dimension is high. We have appl… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
88
0
5

Year Published

2005
2005
2021
2021

Publication Types

Select...
6
3
1

Relationship

0
10

Authors

Journals

citations
Cited by 213 publications
(105 citation statements)
references
References 15 publications
0
88
0
5
Order By: Relevance
“…This can happen because as the number of dimensions grows significantly, ESM (Euclidian Similarity Measure) and CSM (Cosine Similarity Measure) eventually become less similar. In very high dimensional spaces as dimension gets higher (≥ 128) [19], the two similarity measures start having small variations between them. However, the rate of decrease of similarity is very slow.…”
Section: Discussionmentioning
confidence: 99%
“…This can happen because as the number of dimensions grows significantly, ESM (Euclidian Similarity Measure) and CSM (Cosine Similarity Measure) eventually become less similar. In very high dimensional spaces as dimension gets higher (≥ 128) [19], the two similarity measures start having small variations between them. However, the rate of decrease of similarity is very slow.…”
Section: Discussionmentioning
confidence: 99%
“…For instance, indexes like [5] could be exploited for speeding up the computation of naming functions.…”
Section: Discussionmentioning
confidence: 99%
“…Calculate Euclidean Distance: Classification will be performed by comparing the feature vectors of the training images with feature vector of input testing image. Distance measure is an important part of a vector model [Qian, 2004]. It can be done by using the distance measures; Euclidean Distance is probably the most widely used distance metric.…”
Section: Process Recognitionmentioning
confidence: 99%