Proceedings of the 2009 Workshop on the People's Web Meets NLP Collaboratively Constructed Semantic Resources - People's Web '0 2009
DOI: 10.3115/1699765.1699767
|View full text |Cite
|
Sign up to set email alerts
|

Named entity recognition in Wikipedia

Abstract: Named entity recognition (NER) is used in many domains beyond the newswire text that comprises current gold-standard corpora. Recent work has used Wikipedia's link structure to automatically generate near gold-standard annotations. Until now, these resources have only been evaluated on newswire corpora or themselves. We present the first NER evaluation on a Wikipedia gold standard (WG) corpus. Our analysis of cross-corpus performance on WG shows that Wikipedia text may be a harder NER domain than newswire. We … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
40
0

Year Published

2012
2012
2023
2023

Publication Types

Select...
6
4

Relationship

0
10

Authors

Journals

citations
Cited by 82 publications
(45 citation statements)
references
References 16 publications
0
40
0
Order By: Relevance
“…The three geoparsers that integrate external NER software (GeoTxt, Topocluster, CLAVIN), leave a lot of room for improvement as shown in the tables. Wikipedia text proved to be the greater challenge, a finding mirrored by Balasuriya et al (2009). The ultimate goal for geotagging performance should be F-Scores of 0.9?…”
Section: Scoring In Geotaggingmentioning
confidence: 99%
“…The three geoparsers that integrate external NER software (GeoTxt, Topocluster, CLAVIN), leave a lot of room for improvement as shown in the tables. Wikipedia text proved to be the greater challenge, a finding mirrored by Balasuriya et al (2009). The ultimate goal for geotagging performance should be F-Scores of 0.9?…”
Section: Scoring In Geotaggingmentioning
confidence: 99%
“…NER often relies on a machine learning algorithm and an annotated dictionary (gazetteer). Several researchers have used Wikipedia for NER (Kazama & Torisawa 2007;Balasuriya et al 2009). Bunescu (2007) also aimed to derive new IE techniques with higher performance than existing ones using NER, named entity disambiguation and relation extraction.…”
Section: Named Entity Recognitionmentioning
confidence: 99%
“…As a second contribution, we propose a series of feature combination and recombination to attain better results. We test our intuitions on both NER and WSI/WSD tasks and over four different corpora: CoNLL-2003 [17], WikiNER and Wikigold [4] for NER; Semeval-2007 [1] for WSI/WSD.…”
Section: Introductionmentioning
confidence: 99%