Proceedings of the 27th ACM International Conference on Information and Knowledge Management 2018
DOI: 10.1145/3269206.3271699
|View full text |Cite
|
Sign up to set email alerts
|

The Impact of Name-Matching and Blocking on Author Disambiguation

Abstract: In this work, we address the problem of blocking in the context of author name disambiguation. We describe a framework that formalizes different ways of name-matching to determine which names could potentially refer to the same author. We focus on name variations that follow from specifying a name with different completeness (i.e. full first name or only initial). We extend this framework by a simple way to define traditional, new and custom blocking schemes. Then, we evaluate different old and new schemes in … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

1
25
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
3
2

Relationship

1
4

Authors

Journals

citations
Cited by 19 publications
(26 citation statements)
references
References 19 publications
1
25
0
Order By: Relevance
“…For this, a new disambiguation method may be evaluated under initialized versus full forename settings followed by feature importance assessment or in comparison with results disambiguated by string‐based matching. Especially, the latter suggestion supports the idea that string‐based matching results need to be baselines in evaluating author name disambiguation (Backes, ).…”
Section: Conclusion and Discussionmentioning
confidence: 54%
“…For this, a new disambiguation method may be evaluated under initialized versus full forename settings followed by feature importance assessment or in comparison with results disambiguated by string‐based matching. Especially, the latter suggestion supports the idea that string‐based matching results need to be baselines in evaluating author name disambiguation (Backes, ).…”
Section: Conclusion and Discussionmentioning
confidence: 54%
“…8,[17][18][19][24][25][26][27] While the others try to utilize multiple heterogeneous information for user account alignment, such as user the social relations, 6,7,9,10,[12][13][14][15]28 user interests, 4,7,10,29 user temporal distribution features. 4,6,11,28,30 To most of the studies on user account alignment, 4,8,[12][13][14][15][16][17]19,[24][25][26][27] the account name information is very important, since many users like to assign their accounts in different networks with very similar names, and the account names in most networks are very easy to be acquired. And how to properly utilize the name information in the alignment of accounts owned by the English users have already been well studied by many works, 8,16,17,19,24,26,27 among them: Vosecky et al …”
Section: Related Workmentioning
confidence: 99%
“…4,6,11,28,30 To most of the studies on user account alignment, 4,8,[12][13][14][15][16][17]19,[24][25][26][27] the account name information is very important, since many users like to assign their accounts in different networks with very similar names, and the account names in most networks are very easy to be acquired. And how to properly utilize the name information in the alignment of accounts owned by the English users have already been well studied by many works, 8,16,17,19,24,26,27 among them: Vosecky et al 31 propose a method that based on web profile matching to connect users between Facebook and StudiVZ. In their study, they compare three kinds of name matching algorithms, and select the best one for profile matching.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Cast in this light, many existing clustering models are not very suitable for the author name disambiguation problem. Meanwhile, cost-effective blocking technique [1] and lightweight rule-based methods [2,22] are worthy of research as they have been proven to achieve convincing precision in this problem.…”
Section: Introductionmentioning
confidence: 99%