2004
DOI: 10.1007/978-3-540-30214-8_10
|View full text |Cite
|
Sign up to set email alerts
|

Measuring the Similarity for Heterogenous Data: An Ordered Probability-Based Approach

Abstract: Abstract. In this paper we propose a solution to the similarity measuring for heterogenous data. The key idea is to consider the similarity of a given attribute-value pair as the probability of picking randomly a value pair that is less similar than or equally similar in terms of order relations defined appropriately for data types. Similarities of attribute value pairs are then integrated into similarities between data objects using a statistical method. Applying our method in combination with distance-based … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2006
2006
2018
2018

Publication Types

Select...
2
2
1

Relationship

0
5

Authors

Journals

citations
Cited by 7 publications
(1 citation statement)
references
References 8 publications
0
1
0
Order By: Relevance
“…that is, Sim(a, b) is the tail probability of the χ 2 (2n) [LH04], [RPR08]. Figure (7) shows the density and distributions functions, and the tail probability for a χ 2 distribution.…”
Section: B Combining the Probability Based Similarity Across Dimensionsmentioning
confidence: 98%
“…that is, Sim(a, b) is the tail probability of the χ 2 (2n) [LH04], [RPR08]. Figure (7) shows the density and distributions functions, and the tail probability for a χ 2 distribution.…”
Section: B Combining the Probability Based Similarity Across Dimensionsmentioning
confidence: 98%