Florian Petran scite author profile

This paper deals with means of evaluating inter-annotator agreement for a normalization task. This task differs from common annotation tasks in two important aspects: (i) the class of labels (the normalized wordforms) is open, and (ii) annotations can match to different degrees. We propose a new method to measure inter-annotator agreement for the normalization task. It integrates common chancecorrected agreement measures, such as Fleiss's κ or Krippendorff's α. The novelty of our proposed method lies in the way the annotated word forms are treated. First, they are evaluated character-wise; second, certain characters are mapped to more general categories.

show abstract

Applying Rule-Based Normalization to Different Types of Historical Texts—An Evaluation

Bollmann

Petran

Dipper

2014

View full text Add to dashboard Cite

ReM: A reference corpus of Middle High German -- corpus compilation, annotation, and access

Petran¹,

Bollmann²,

Dipper³

et al. 2016

JLCL

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Florian Petran

CorA: A web-based annotation tool for historical and other non-standard language data

Applying Rule-Based Normalization to Different Types of Historical Texts—An Evaluation

Evaluating Inter-Annotator Agreement on Historical Spelling Normalization

Applying Rule-Based Normalization to Different Types of Historical Texts—An Evaluation

ReM: A reference corpus of Middle High German -- corpus compilation, annotation, and access

Contact Info

Product

Resources

About