2016
DOI: 10.12716/1001.10.03.12
|View full text |Cite
|
Sign up to set email alerts
|

Named Entity Disambiguation for Maritime-related Data Retrieved from Heterogenous Sources

Abstract: The article concerns integration and disambiguation of data related to the maritime domain. A developed system is described, which collects and merges data about several maritime-related entities (vessels, vessel types, ports, companies etc.) retrieved from different internet sources and feeds the data into a single database. This process is however not trivial. There are few challenges, which need to be faced to successfully conduct it. Firstly, in different sources, entities may be referenced to in different… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
2
1
1

Relationship

2
2

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 11 publications
0
3
0
Order By: Relevance
“…Therefore, appropriate methods were developed to alleviate these quality issues. The process of internet sources selection and data fusion was described in detail in separate papers ( [35] and [19] respectively). As a result a vast amount of ancillary data for all types of vessels was acquired, such as tonnage, dimensions, detailed type, built year, builder, home port, data about detentions and inspections of ships as well as data about classification statuses of ships and their affiliation to a classification society.…”
Section: Methodsmentioning
confidence: 99%
“…Therefore, appropriate methods were developed to alleviate these quality issues. The process of internet sources selection and data fusion was described in detail in separate papers ( [35] and [19] respectively). As a result a vast amount of ancillary data for all types of vessels was acquired, such as tonnage, dimensions, detailed type, built year, builder, home port, data about detentions and inspections of ships as well as data about classification statuses of ships and their affiliation to a classification society.…”
Section: Methodsmentioning
confidence: 99%
“…The SIMMO system retrieves data from many sources, and each of these sources may a have different structure and may publish data in a different way. Therefore, a separate Data Acquisition Module (DAM) was developed for each data source (Małyszko et al, 2016). DAMs connect to the data source in a defined manner, send appropriate requests, collect the documents returned, and extract required data.…”
Section: Data Retrieval and Disambiguationmentioning
confidence: 99%
“…Such methods were developed within the SIMMO project in order to address this challenge. They are presented in detail in another paper (Małyszko et al, 2016).…”
Section: Data Disambiguation and Fusionmentioning
confidence: 99%