Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06) 2006
DOI: 10.1109/icdmw.2006.2
|View full text |Cite
|
Sign up to set email alerts
|

A Comparison of Personal Name Matching: Techniques and Practical Issues

Abstract: Technical-DOT-Reports-AT-cs-DOT-anu.edu.au A list of technical reports, including some abstracts and copies of some full reports may be found at:

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
181
2
10

Year Published

2009
2009
2019
2019

Publication Types

Select...
4
2
2

Relationship

2
6

Authors

Journals

citations
Cited by 228 publications
(195 citation statements)
references
References 23 publications
1
181
2
10
Order By: Relevance
“…To our knowledge, it is the only available parallel corpus Arabic-English. Since this corpus is small, we decided to test on a parallel newspaper corpus which contains 11942 sentences extracted from ANN 4 , referred in the following as ‫ܥ‬ .…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…To our knowledge, it is the only available parallel corpus Arabic-English. Since this corpus is small, we decided to test on a parallel newspaper corpus which contains 11942 sentences extracted from ANN 4 , referred in the following as ‫ܥ‬ .…”
Section: Resultsmentioning
confidence: 99%
“…Name matching can be defined such as the process of determining, whether two name strings are instances of the same name [4]. This task is not difficult, if the two languages use the same alphabet.…”
Section: Proper Names In Arabicmentioning
confidence: 99%
“…Compute the phonetic codification for each word from each transcription using a given codification algorithm. A general description and comparison of the codification algorithms used in our experiments can be found in [17], for further details we refer to [3,4,5,6,7].…”
Section: Constructing the Combined Representationmentioning
confidence: 99%
“…Surveys [8,9]. review the various approaches, including named attributes computations [5], schema mapping [2,17] and duplicate detection in hierarchical data [10], all which inform the construction of profile linkage techniques.…”
Section: Record Linkage and Entity Resolutionmentioning
confidence: 99%
“…We adopt the Jaro Winker metric, as it been reported to be one of the best performing [5] metrics for name-like feature. As many identities may have similar or even identical namesakes, the usernames alone are not sufficiently discriminative.…”
Section: Feature Selectionmentioning
confidence: 99%