2016 IEEE Tenth International Conference on Semantic Computing (ICSC) 2016
DOI: 10.1109/icsc.2016.54
|View full text |Cite
|
Sign up to set email alerts
|

Towards Cleaning-Up Open Data Portals: A Metadata Reconciliation Approach

Abstract: This paper presents an approach for metadata reconciliation, curation and linking for Open Governamental Data Portals (ODPs). ODPs have been lately the standard solution for governments willing to put their public data available for the society. Portal managers use several types of metadata to organize the datasets, one of the most important ones being the tags. However, the tagging process is subject to many problems, such as synonyms, ambiguity or incoherence, among others. As our empiric analysis of ODPs sh… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
3
0
1

Year Published

2018
2018
2023
2023

Publication Types

Select...
4
3
2

Relationship

1
8

Authors

Journals

citations
Cited by 15 publications
(5 citation statements)
references
References 18 publications
0
3
0
1
Order By: Relevance
“…Figure 9: Averagenot "Relative" -data quality of all open data portals with respect to each quality sub-dimension (cf., Table 2) in this paper an Open Data Portal Quality (ODPQ) dashboard, which is dynamic and enables any open data end-user/stakeholder to easily assess/rank open data portals based on multiple quality dimensions and personal preferences. Our research work purely analyzes the state and quality of the metadata, providing useful quality indicators for applications that use the metadata such as in (Tygel et al, 2016;Zuiderwijk et al, 2016Zuiderwijk et al, , 2012b. From a theoretical standpoint, AHP is used to properly deal with such multiple indicators, while enabling endusers to adjust their preferences regarding the one or more of these indicators.…”
Section: Resultsmentioning
confidence: 99%
“…Figure 9: Averagenot "Relative" -data quality of all open data portals with respect to each quality sub-dimension (cf., Table 2) in this paper an Open Data Portal Quality (ODPQ) dashboard, which is dynamic and enables any open data end-user/stakeholder to easily assess/rank open data portals based on multiple quality dimensions and personal preferences. Our research work purely analyzes the state and quality of the metadata, providing useful quality indicators for applications that use the metadata such as in (Tygel et al, 2016;Zuiderwijk et al, 2016Zuiderwijk et al, , 2012b. From a theoretical standpoint, AHP is used to properly deal with such multiple indicators, while enabling endusers to adjust their preferences regarding the one or more of these indicators.…”
Section: Resultsmentioning
confidence: 99%
“…Tygel et al [19] present a system to link datasets from different Open Data portals by extracting the tags and keywords from metadata descriptions: the tags get reconciled using automated translations and similarity measures, and re-published using unique URIs and meta-information for the reconciled tags. Again, specifically, links to organizations and temporal changes were not taken into account in this approach.…”
Section: Background and Related Workmentioning
confidence: 99%
“…Even though multilingual labels can be defined on Wikidata, this is often not sufficient for our case. For example, the concept Russian Federal State Statistics Service (Федеральная служба статистики) does exist on Wikidata 19 -but there is no Russian label defined for it as of April 2020. Yet, this label appears more than 3,000 times in the ODPW data base as publisher.…”
Section: Challenge Analysismentioning
confidence: 99%
“…O problemaé que as tags normalmente são atribuídas por gestores de datasets de forma livre, estando sujeitasà ambiguidade e subjetividade. Técnicas já foram desenvolvidas para limpar, conciliar e enriquecer tags de portais de dados abertos de governo, como por exemplo em [Tygel et al 2016], porém uma forma que parece interessanteé tentar enriquecer semanticamente tags a partir dos dados e de extrações de fontes diversas.…”
Section: Conclusãounclassified