IEEE/ACM Joint Conference on Digital Libraries 2014
DOI: 10.1109/jcdl.2014.6970165
|View full text |Cite
|
Sign up to set email alerts
|

Combining domain-specific heuristics for author name disambiguation

Abstract: Author name disambiguation has been one of the hardest problems faced by digital libraries since their early days. Historically, supervised solutions have empirically outperformed those based on heuristics, but with the burden of having to rely on manually labelled training sets for the learning process. Moreover, most supervised solutions just apply some type of generic machine learning solution and do not exploit specific knowledge about the problem. In this paper, we follow a similar reasoning, but in the o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
4
0
1

Year Published

2015
2015
2016
2016

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 7 publications
(5 citation statements)
references
References 16 publications
0
4
0
1
Order By: Relevance
“…Fan et al [5] used graph based framework for name disambiguation, and Hermansson et al [10] used graph kernels to calculate similarity based on local neighborhood structure. Instead of using machine learning algorithms, Santana et al [18] used domain specific heuristics for classification. Recently, Ventura et al [23] applied a random forest classifier with agglomerative clustering for inventor name disambiguation for USPTO database.…”
Section: Related Workmentioning
confidence: 99%
“…Fan et al [5] used graph based framework for name disambiguation, and Hermansson et al [10] used graph kernels to calculate similarity based on local neighborhood structure. Instead of using machine learning algorithms, Santana et al [18] used domain specific heuristics for classification. Recently, Ventura et al [23] applied a random forest classifier with agglomerative clustering for inventor name disambiguation for USPTO database.…”
Section: Related Workmentioning
confidence: 99%
“…Uma parte dos trabalhos desenvolvidos foi publicada e apresentada na ACM/IEEE Joint Conference on Digital Libraries (JCDL) [Santana et al 2014] (Qualis A2), a principal conferência mundial naárea. Uma versão estendida desse trabalho foi convidada para uma edição especial dos melhores artigos da conferência e aceita para publicação, após nova revisão, no periódico International Journal on Digital Libraries (IJDL, Qualis B3) [Santana et al 2015] .…”
Section: Principais Contribuiçõesunclassified
“…[23] and [12] use heuristics to automatically generate reference sets and use them, instead of labeled examples, as training data for the supervised methods. In [20] disambiguation is conducted using heuristics, with supervision being applied to optimize the heuristics parameters. We can also classify person name disambiguation methods based on the types of information they employ.…”
Section: Related Workmentioning
confidence: 99%