2002
DOI: 10.1016/s0031-3203(01)00118-2
|View full text |Cite
|
Sign up to set email alerts
|

On measuring the distance between histograms

Abstract: A distance measure between two histograms has applications in feature selection, image indexing and retrieval, pattern classiÿcation and clustering, etc. We propose a distance between sets of measurement values as a measure of dissimilarity of two histograms. The proposed measure has the advantage over the traditional distance measures regarding the overlap between two distributions; it takes the similarity of the non-overlapping parts into account as well as that of overlapping parts. We consider three versio… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
226
0
13

Year Published

2005
2005
2017
2017

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 319 publications
(239 citation statements)
references
References 15 publications
(26 reference statements)
0
226
0
13
Order By: Relevance
“…(the interested reader is referred to [47] for a review). Here, we discuss a number of popular distance measures which we think are relevant to our problem.…”
Section: Additional Distance Measures For Matching Histogramsmentioning
confidence: 99%
See 1 more Smart Citation
“…(the interested reader is referred to [47] for a review). Here, we discuss a number of popular distance measures which we think are relevant to our problem.…”
Section: Additional Distance Measures For Matching Histogramsmentioning
confidence: 99%
“…In pattern recognition literature, template matching method is a simple and robust approach when adequately applied [47,[50][51][52][53]. Temperley [12] also considers the method of tonality finding in literature on western music as template matching.…”
Section: Automatic Makam Recognitionmentioning
confidence: 99%
“…Once we translate this result to network traffic, the unsuitability of the (Euclidean) distance metric becomes clear immediately; The difference between Histogram A and B can easily be caused by variability in the TCP header or differences in username and password lengths, for example, while Histogram C shows significantly different traffic. A solution to this 'problem' is provided in [4], where the Minimum Difference of Pair Assignments (MDPA) distance metric is defined. In a nutshell, MDPA aims at finding the minimum difference of pair assignments between two sets, where sets are histogram bins in our context:…”
Section: Clustering Histogramsmentioning
confidence: 99%
“…More formally, we define a tuple as a pair of source and destination IP addresses, source port number and vhost. 4 To qualify for preselection, an attacker must have generated at least N flows towards a target. Note that we refer several times to this number in the remainder of this paper and that the used value for N is explained in Sect.…”
Section: Preselectionmentioning
confidence: 99%
See 1 more Smart Citation