On measuring the distance between histograms

Cha, Sung-Hyuk; Srihari, Sargur N.

doi:10.1016/s0031-3203(01)00118-2

Cited by 319 publications

(239 citation statements)

References 15 publications

(26 reference statements)

Supporting

Mentioning

226

Contrasting

Unclassified

Order By: Relevance

“…(the interested reader is referred to [47] for a review). Here, we discuss a number of popular distance measures which we think are relevant to our problem.…”

Section: Additional Distance Measures For Matching Histogramsmentioning

confidence: 99%

See 1 more Smart Citation

Pitch-frequency histogram-based music information retrieval for Turkish music

Gedik

Bozkurt

2010

Signal Processing

View full text Add to dashboard Cite

“…(the interested reader is referred to [47] for a review). Here, we discuss a number of popular distance measures which we think are relevant to our problem.…”

Section: Additional Distance Measures For Matching Histogramsmentioning

confidence: 99%

“…In pattern recognition literature, template matching method is a simple and robust approach when adequately applied [47,[50][51][52][53]. Temperley [12] also considers the method of tonality finding in literature on western music as template matching.…”

Section: Automatic Makam Recognitionmentioning

confidence: 99%

Pitch-frequency histogram-based music information retrieval for Turkish music

Gedik

Bozkurt

2010

Signal Processing

View full text Add to dashboard Cite

“…Once we translate this result to network traffic, the unsuitability of the (Euclidean) distance metric becomes clear immediately; The difference between Histogram A and B can easily be caused by variability in the TCP header or differences in username and password lengths, for example, while Histogram C shows significantly different traffic. A solution to this 'problem' is provided in [4], where the Minimum Difference of Pair Assignments (MDPA) distance metric is defined. In a nutshell, MDPA aims at finding the minimum difference of pair assignments between two sets, where sets are histogram bins in our context:…”

Section: Clustering Histogramsmentioning

confidence: 99%

“…More formally, we define a tuple as a pair of source and destination IP addresses, source port number and vhost. 4 To qualify for preselection, an attacker must have generated at least N flows towards a target. Note that we refer several times to this number in the remainder of this paper and that the used value for N is explained in Sect.…”

Section: Preselectionmentioning

confidence: 99%

“…After retrieval, flow data that cannot be part of an attack by definition is filtered out. For example, flow records from attacker to target typically feature at least four packets, because every valid HTTP request consists of the following packets at least: 4 Many IPFIX flow exporters extract vhosts, often referred to as HTTP hostname, from HTTP headers. This information is no prerequisite for our detection approach and therefore only used within the Preselection phase.…”

Section: Data Retrievalmentioning

confidence: 99%

See 1 more Smart Citation

Flow-Based Web Application Brute-Force Attack and Compromise Detection

et al. 2017

View full text Add to dashboard Cite

In the early days of network and service management, researchers paid much attention to the design of management frameworks and protocols. Since then the focus of research has shifted from the development of management technologies towards the analysis of management data. From the five FCAPS areas, security of networks and services has become a key challenge. For example, brute-force attacks against Web applications, and compromises resulting thereof, are widespread. Talks with several Top-10 Web hosting companies in the Netherlands reflect that detection of these attacks is often done based on log file analysis on servers, or by deploying host-based intrusion detection systems (IDSs) and firewalls. However, such host-based solutions have several problems. In this paper we therefore investigate the feasibility of a network-based monitoring approach, which detects brute-force attacks against and compromises of Web applications, even in encrypted environments. Our approach is based on per-connection histograms of packet payload sizes in flow data that are exported using IPFIX. We validate our approach using datasets collected in the production network of a large Web hoster in the Netherlands.

show abstract

Dissimilarity‐based detection of schizophrenia

Ulas

Duin

Castellani

et al. 2011

Int J Imaging Syst Tech

View full text Add to dashboard Cite

In this article, a novel approach to schizophrenia classification using magnetic resonance images (MRI) is proposed. The presented method is based on dissimilarity-based classification techniques applied to morphological MRIs and diffusion-weighted images (DWI). Instead of working with features directly, pairwise dissimilar-ities between expert delineated regions of interest (ROIs) are considered as representations based on which learning and classification can be performed. Experiments are carried out on a set of 59 patients and 55 controls and several pairwise dissimilarity measurements are analyzed. We demonstrate that significant improvements can be obtained when combining over different ROIs and different dissimilar-ity measures. We show that combining ROIs using the dissimilarity-based representation, we achieve higher accuracies. The dissimilar-ity-based representation outperforms the feature-based representation in all cases. Best results are obtained by combining the two modalities. In summary, our contribution is threefold: (i) We introduce the usage of dissimilarity-based classification to schizophrenia detection and show that dissimilarity-based classification achieves better results than normal features, (ii) We use dissimilarity combination to achieve better accuracies when carefully selected ROIs and dissimi-larity measures are considered, and (iii) We show that by combining multiple modalities we can achieve even better results. V

show abstract

On measuring the distance between histograms

Cited by 319 publications

References 15 publications

Pitch-frequency histogram-based music information retrieval for Turkish music

Pitch-frequency histogram-based music information retrieval for Turkish music

Flow-Based Web Application Brute-Force Attack and Compromise Detection

Dissimilarity‐based detection of schizophrenia

Contact Info

Product

Resources

About