GEC-DePenD: Non-Autoregressive Grammatical Error Correction with Decoupled Permutation and Decoding

Yakovlev, K. I.; Podolskiy, A. V.; Bout, Andrey; Nikolenko, Sergey I.; Piontkovskaya, Irina

doi:10.18653/v1/2023.acl-long.86

Cited by 3 publications

(4 citation statements)

References 27 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Recent advances in GEC also explore non-autoregressive models, which provide faster performance by decoupling the error detection and correction processes, allowing for more dynamic responses to the identified errors [11]. The essence of this task can be succinctly summarized through the following formulation: Given a sentence S err laden with grammatical inaccuracies, the objective of English GEC is to generate a grammatically correct sentence S corr , wherein S corr retains the original intent and content of S err to the greatest extent possible.…”

Section: Gecmentioning

confidence: 99%

Exploring Sentence-level Revision Capabilities of LLMs in English for Academic Purposes Writing Assistance

Du,

Hashimoto

2024

Preprint

View full text Add to dashboard Cite

The English for Academic Purposes (EAP) is pivotal for scholarly communication; however, it poses significant challenges for non-native English speakers. Recently, Large Language Models (LLMs) have been extensively utilized in EAP to assist with writing tasks. EAP writing assistance typically encompasses several downstream tasks in natural language processing (NLP), such as Grammatical Error Correction (GEC). Non\-etheless, some studies have revealed that the performance of LLMs in GEC tasks is inferior to traditional GEC solutions. To explore the capabilities of LLMs more thoroughly in aspects like deep semantic and syntactic structures, this study aims to rigorously assess the performance of LLMs in the Sentence-level Revision (SentRev) task. We designed three sets of meticulous experiments to evaluate the efficacy of different LLMs. The first experiment assessed LLMs using prompts in ten different languages, finding that the SentRev performance of LLMs was heavily influenced by the language of the prompt and the quality of the input text. The second experiment investigated the performance of English LLMs with minimal prompting in the SentRev task, yet the results showed no significant changes, contradicting some prior studies. In the third experiment, we devised an innovative and straightforward method that significantly enhanced the performance of multiple LLMs by integrating academic phrases from the Formulaic Language (FL) Academic Phrasebank\footnote{\url{https://www.phrasebank.manchester.ac.uk/}}, thus overcoming the performance limitations imposed by different languages on LLMs. Additionally, our study highlights the deficiencies in existing evaluation benchmarks and suggests that higher-level, discourse-based EAP text evaluation benchmarks merit deeper exploration.

show abstract

Section: Gecmentioning

confidence: 99%

Exploring Sentence-level Revision Capabilities of LLMs in English for Academic Purposes Writing Assistance

Du,

Hashimoto

2024

Preprint

View full text Add to dashboard Cite

show abstract

“…Gu et al (2019) non-autoregressively refine an output sequence using language-agnostic insertions and deletions. Yakovlev et al (2023) decompose the inference stage into permutation and decoding. First, a permutation network repositions the tokens of an input sequence with possible deletions and inser-tions.…”

Section: Synthetic Datamentioning

confidence: 99%

Efficient Grammatical Error Correction Via Multi-Task Training and Optimized Training Schedule

Bout,

Podolskiy,

Nikolenko

et al. 2023

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing

View full text Add to dashboard Cite

Progress in neural grammatical error correction (GEC) is hindered by the lack of annotated training data. Sufficient amounts of highquality manually annotated data are not available, so recent research has relied on generating synthetic data, pretraining on it, and then finetuning on real datasets; performance gains have been achieved either by ensembling or by using huge pretrained models such as XXL-T5 as the backbone. In this work, we explore an orthogonal direction: how to use available data more efficiently. First, we propose auxiliary tasks that exploit the alignment between the original and corrected sentences, such as predicting a sequence of corrections. We formulate each task as a sequence-to-sequence problem and perform multi-task training. Second, we discover that the order of datasets used for training and even individual instances within a dataset may have important effects on the final performance, so we set out to find the best training schedule. Together, these two ideas lead to significant improvements, producing results that improve state of the art with much smaller models; in particular, we outperform the best models based on T5-XXL (11B parameters) with a BART-based model (400M parameters).

show abstract

“…Prior research incorporates detection results as supplementary information for Seq2Seq correction models Yuan et al, 2021b;Li et al, 2023a). The methods proposed by Mallinson et al (2020) and Yakovlev et al (2023) employ the Masked Language Model (MLM) (Devlin et al, 2018) to obtain corrections, which are constrained by mask number. Chen et al (2020) introduces error span detection and correction to address the GEC problem, which allows for flexible corrections while maximizing time efficiency.…”

Section: Detection-correction Gecmentioning

confidence: 99%

“…Multi-Encoder (Yuan et al, 2021a) encodes error categories as auxiliary information. GEC-DePend (Yakovlev et al, 2023) with correction by the MLM. TemplateGEC (Li et al, 2023a) uses the output of the GECToR model as supplementary information for Seq2Seq models.…”

Section: Model Settingsmentioning

confidence: 99%

Incorporating rich syntax information in Grammatical Error Correction

Li¹,

Parnow²,

Zhao³

2022

Information Processing & Management

View full text Add to dashboard Cite

GEC-DePenD: Non-Autoregressive Grammatical Error Correction with Decoupled Permutation and Decoding

Cited by 3 publications

References 27 publications

Exploring Sentence-level Revision Capabilities of LLMs in English for Academic Purposes Writing Assistance

Exploring Sentence-level Revision Capabilities of LLMs in English for Academic Purposes Writing Assistance

Efficient Grammatical Error Correction Via Multi-Task Training and Optimized Training Schedule

Incorporating rich syntax information in Grammatical Error Correction

Contact Info

Product

Resources

About