Proceedings of the 10th International Conference on Semantic Systems 2014
DOI: 10.1145/2660517.2660520
|View full text |Cite
|
Sign up to set email alerts
|

A comparison of NER tools w.r.t. a domain-specific vocabulary

Abstract: In this paper we compare several state-of-the-art Linked Data Knowledge Extraction tools, with regard to their ability to recognise entities of a controlled, domain-specific vocabulary. This includes tools that offer APIs as a Service, locally installed platforms as well as an UIMA-based approach as reference. We evaluate under realistic conditions, with natural language source texts from keywording experts of the Städel Museum Frankfurt. The goal is to find first hints which tool approach or strategy is more … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2015
2015
2020
2020

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(2 citation statements)
references
References 4 publications
0
2
0
Order By: Relevance
“…Intangible cultural heritage is one of the specific domains that have not been explored yet for the Balkans in regard to NLP and specifically in the case of the Serbian language. Findings from different research studies have implied that specific contexts play a significant role in extracting the information from texts of specific domains (Heuss et al, 2014). This further implies that the entire infrastructurefrom handcrafted rules, specific ontology to dictionaries, using semantic structures and specific vocabulary for the detection of textual phrases potentially useful for NER and IE in generalmust be supplied for Serbian intangible cultural heritage texts.…”
Section: El 385/6mentioning
confidence: 99%
“…Intangible cultural heritage is one of the specific domains that have not been explored yet for the Balkans in regard to NLP and specifically in the case of the Serbian language. Findings from different research studies have implied that specific contexts play a significant role in extracting the information from texts of specific domains (Heuss et al, 2014). This further implies that the entire infrastructurefrom handcrafted rules, specific ontology to dictionaries, using semantic structures and specific vocabulary for the detection of textual phrases potentially useful for NER and IE in generalmust be supplied for Serbian intangible cultural heritage texts.…”
Section: El 385/6mentioning
confidence: 99%
“…The work by Steinmetz, Knuth, and Sack (2013) is also focused on the evaluation of semantic annotation tools; however, their attention is centered more on the statistical analysis of different benchmark and dictionary datasets that can be used in the evaluation process. Heuss, Humm, Henninger, and Rippl (2014) compared the performance of several state-of-the-art semantic annotation tools on domain specific texts (namely texts about museum collections). The study found that, on average, each tool achieved roughly just a third of its F1 score on texts covering general/common topics.…”
Section: Defaultmentioning
confidence: 99%