2019 IEEE International Conference on Big Data (Big Data) 2019
DOI: 10.1109/bigdata47090.2019.9006095
|View full text |Cite
|
Sign up to set email alerts
|

Fast Record Linkage for Company Entities

Abstract: Record linkage is an essential part of nearly all real-world systems that consume structured and unstructured data coming from different sources. Typically no common key is available for connecting records. Massive data cleaning and data integration processes often have to be completed before any data analytics and further processing can be performed. Although record linkage is frequently regarded as a somewhat tedious but necessary step, it reveals valuable insights into the data at hand. These insights guide… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
16
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 12 publications
(17 citation statements)
references
References 18 publications
1
16
0
Order By: Relevance
“…We show that this problem can be solved with our Hybrid approach consisting of a set of rules and a supervised ML method, or our DL approach. Thus, we confirm and support the statements of Govind et al [8] and Gschwind et al [16] that ML procedures should be used for subtasks in RL and thus support the automation of the RL process. This statement is confirmed by our approach and encourages us to identify further general problems in RL and data preparation and investigate suitable ML solutions for these problems.…”
Section: Theoretical and Practical Implicationssupporting
confidence: 90%
See 4 more Smart Citations
“…We show that this problem can be solved with our Hybrid approach consisting of a set of rules and a supervised ML method, or our DL approach. Thus, we confirm and support the statements of Govind et al [8] and Gschwind et al [16] that ML procedures should be used for subtasks in RL and thus support the automation of the RL process. This statement is confirmed by our approach and encourages us to identify further general problems in RL and data preparation and investigate suitable ML solutions for these problems.…”
Section: Theoretical and Practical Implicationssupporting
confidence: 90%
“…This method describes our approach to analysing our eleven existing data sources (see table 1) and integrating various of them through a RL process to find general RL challenges for the real-world entity company. One of the most relevant attributes in company entity matching is the company name [15,16] which we will focus on in this paper. The legal form of a company is also an important attribute, as it is discriminatory when comparing companies [15].…”
Section: Motivation and Problem Statementmentioning
confidence: 99%
See 3 more Smart Citations