2015
DOI: 10.1007/s13278-015-0249-1
|View full text |Cite
|
Sign up to set email alerts
|

Name disambiguation from link data in a collaboration graph using temporal and topological features

Abstract: In a social community, multiple persons may share the same name, phone number or some other identifying attributes. This, along with other phenomena, such as name abbreviation, name misspelling, and human error leads to erroneous aggregation of records of multiple persons under a single reference. Such mistakes affect the performance of document retrieval, web search, database integration, and more importantly, improper attribution of credit (or blame). The task of entity disambiguation partitions the records … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

3
24
0

Year Published

2016
2016
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 22 publications
(27 citation statements)
references
References 26 publications
3
24
0
Order By: Relevance
“…The research by Saha et al [16], and Hermansson et al [17] is most closely related to ours. These papers also only use topological information of the network for NDA.…”
Section: Related Worksupporting
confidence: 89%
“…The research by Saha et al [16], and Hermansson et al [17] is most closely related to ours. These papers also only use topological information of the network for NDA.…”
Section: Related Worksupporting
confidence: 89%
“…Moreover, ND does not assume the availability of a list of known unambiguous entity identifiers, such that an important part of the challenge is to identify which nodes are ambiguous in the first place. 1 The research by (Hermansson et al, 2013, Saha et al, 2015) is most closely related to ours. These papers also only use topological information of the network for ND.…”
Section: C) Related Worksupporting
confidence: 57%
“…as the sum of all the edges within cluster C i , W (C i , C i ) the sum of the for all the edges between cluster C i and the rest of the network C i , and k being the number of clusters in the graph. While (Hermansson et al, 2013) also worked on identitying nodes based on topological features, their method (which is not publicly available) performed worse in all the cases when compared to (Saha et al, 2015), so we only chose the latter as a competing baseline.…”
Section: Quantitative Evaluation Of Node Identificationmentioning
confidence: 99%
See 1 more Smart Citation
“…With the ubiquity of cloud computing, outsourcing is a major trend. A graph is widely used, such as the collaboration graph , the network graph , and chemical molecular graph . The graphs may also increasingly be outsourced to the cloud.…”
Section: Introductionmentioning
confidence: 99%