Is this the end of the gold standard? A straightforward reference-less grammatical error correction metric

Islam, Asadul; Magnani, Enrico

doi:10.18653/v1/2021.emnlp-main.239

Cited by 2 publications

(1 citation statement)

References 13 publications

(24 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Scribendi Score. The Scribendi Score (Islam and Magnani 2021) was designed to be simpler than other reference-less metrics in that it requires neither an existing GEC system nor fine-tuning. Instead, it calculates an absolute score (1=positive, -1=negative, 0=no change) from a combination of language model perplexity (GPT2: Radford et al (2019)) and sorted token/Levenshtein distance ratios, which respectively ensure that i) the corrected sentence is more probable than the original and ii) both sentences are not significantly different from each other.…”

Section: Reference-less Metricsmentioning

confidence: 99%

Grammatical Error Correction: A Survey of the State of the Art

Bryant¹,

Zheng²,

Qorib³

et al. 2022

Preprint

View full text Add to dashboard Cite

Grammatical Error Correction (GEC) is the task of automatically detecting and correcting errors in text. The task not only includes the correction of grammatical errors, such as missing prepositions and mismatched subject-verb agreement, but also orthographic and semantic errors, such as misspellings and word choice errors respectively. The field has seen significant progress in the last decade, motivated in part by a series of five shared tasks, which drove the development of rule-based methods, statistical classifiers, statistical machine translation, and finally neural machine translation systems which represent the current dominant state of the art. In this survey paper, we condense the field into a single article and first outline some of the linguistic challenges of the task, introduce the most popular datasets that are available to researchers (for both English and other languages), and summarise the various methods and techniques that have been developed with a particular focus on artificial error generation. We next describe the many different approaches to evaluation as well as concerns surrounding metric reliability, especially in relation to subjective human judgements, before concluding with an overview of recent progress and suggestions for future work and remaining challenges. We hope that this survey will serve as comprehensive resource for researchers who are new to the field or who want to be kept apprised of recent developments.

show abstract

Section: Reference-less Metricsmentioning

confidence: 99%

Grammatical Error Correction: A Survey of the State of the Art

Bryant¹,

Zheng²,

Qorib³

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

Revisiting Meta-evaluation for Grammatical Error Correction

Kobayashi,

Mita,

Komachi

2024

Transactions of the Association for Computational Linguistics

View full text Add to dashboard Cite

Metrics are the foundation for automatic evaluation in grammatical error correction (GEC), with their evaluation of the metrics (meta-evaluation) relying on their correlation with human judgments. However, conventional meta-evaluations in English GEC encounter several challenges, including biases caused by inconsistencies in evaluation granularity and an outdated setup using classical systems. These problems can lead to misinterpretation of metrics and potentially hinder the applicability of GEC techniques. To address these issues, this paper proposes SEEDA, a new dataset for GEC meta-evaluation. SEEDA consists of corrections with human ratings along two different granularities: edit-based and sentence-based, covering 12 state-of-the-art systems including large language models, and two human corrections with different focuses. The results of improved correlations by aligning the granularity in the sentence-level meta-evaluation suggest that edit-based metrics may have been underestimated in existing studies. Furthermore, correlations of most metrics decrease when changing from classical to neural systems, indicating that traditional metrics are relatively poor at evaluating fluently corrected sentences with many edits.

show abstract

Is this the end of the gold standard? A straightforward reference-less grammatical error correction metric

Cited by 2 publications

References 13 publications

Grammatical Error Correction: A Survey of the State of the Art

Grammatical Error Correction: A Survey of the State of the Art

Revisiting Meta-evaluation for Grammatical Error Correction

Contact Info

Product

Resources

About