2016
DOI: 10.1017/s1351324916000206
|View full text |Cite
|
Sign up to set email alerts
|

Improving mention detection for Basque based on a deep error analysis

Abstract: This paper presents the improvement process of a mention detector for Basque. The system is rule-based and takes into account the characteristics of mentions in Basque. A classification of error types is proposed based on the errors that occur during mention detection. A deep error analysis distinguishing error types and causes is presented and improvements are proposed. At the final stage, the system obtains an F-measure of 74.57% under the Exact Matching protocol and of 80.57% under Lenient Matching. We also… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
3
2

Relationship

3
2

Authors

Journals

citations
Cited by 6 publications
(5 citation statements)
references
References 25 publications
0
5
0
Order By: Relevance
“…Mentions and coreference chains have been annotated semi-automatically based on previous layers in a 46,383-word subset of the EPEC corpus. Automatically annotated mentions obtained by our mention detector [36] were first corrected by linguists; then, coreferent mentions were linked in clusters. It is freely available and can be downloaded at: ixa2.si.ehu.eus/epec-koref/epec-koref_v1.0.tgz.…”
Section: Mention Structures In Basquementioning
confidence: 99%
See 1 more Smart Citation
“…Mentions and coreference chains have been annotated semi-automatically based on previous layers in a 46,383-word subset of the EPEC corpus. Automatically annotated mentions obtained by our mention detector [36] were first corrected by linguists; then, coreferent mentions were linked in clusters. It is freely available and can be downloaded at: ixa2.si.ehu.eus/epec-koref/epec-koref_v1.0.tgz.…”
Section: Mention Structures In Basquementioning
confidence: 99%
“…We created a mention detection system [36], defining a set of hand-crafted rules that have been compiled into Finite State Transducers (FST). These FSTs are able to detect complex structures that should be identified as mentions.…”
Section: System Architecturementioning
confidence: 99%
“…Nevertheless, a rule-based coreference resolution system (Soraluze et al, 2015) and a machine learning based system (Soraluze et al, 2016) have been developed. Both of which used a rule-based mention detector (Soraluze et al, 2017).…”
Section: Related Workmentioning
confidence: 99%
“…The part of the corpus we have used has about 45,000 words and it has been manually tagged at coreference level by two linguists (Ceberio et al, 2016). First of all, automatically tagged mentions obtained by a mention detector (Soraluze et al, 2016) have been corrected; then, coreferent mentions have been linked in clusters.…”
Section: Error Analysismentioning
confidence: 99%