Controllable Text Simplification with Explicit Paraphrasing

Maddela, Mounica; Alva-Manchego, Fernando; Xu, Wei

doi:10.18653/v1/2021.naacl-main.277

Cited by 33 publications

(35 citation statements)

References 49 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Building on a Seq2Seq model, Zhang and Lapata (2017) used reinforcement learning to optimize a reward based on simplicity, fluency and relevance. Recent methods build on transformer (Vaswani et al, 2017) models, by integrating external databases containing simplification rules (Zhao et al, 2018), using an additional loss function to generate diverse outputs (Kriz et al, 2019), combining syntactic rules (Maddela et al, 2021), and conditioning on length and syntactic and lexical complexity features (Martin et al, 2020a).…”

Section: Related Workmentioning

confidence: 99%

“…Simplification methods can also be categorized as supervised or unsupervised. Supervised methods tend to have better performance, but require aligned complex-simple sentence pairs for training (Zhang and Lapata, 2017;Guo et al, 2018;Kriz et al, 2019;Martin et al, 2020a,b;Maddela et al, 2021). Unsupervised methods do not need such training data but do not perform as well (Surya et al, 2019;Kumar et al, 2020;Zhao et al, 2020).…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

GRS: Combining Generation and Revision in Unsupervised Sentence Simplification

Dehghan¹,

Kumar²,

Golab³

2022

Preprint

View full text Add to dashboard Cite

We propose GRS: an unsupervised approach to sentence simplification that combines text generation and text revision. We start with an iterative framework in which an input sentence is revised using explicit edit operations, and add paraphrasing as a new edit operation. This allows us to combine the advantages of generative and revision-based approaches: paraphrasing captures complex edit operations, and the use of explicit edit operations in an iterative manner provides controllability and interpretability. We demonstrate these advantages of GRS compared to existing methods on the Newsela and ASSET datasets.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

GRS: Combining Generation and Revision in Unsupervised Sentence Simplification

Dehghan¹,

Kumar²,

Golab³

2022

Preprint

View full text Add to dashboard Cite

show abstract

“…We then evaluate outputs from several modern simplification models (Zhang and Lapata, 2017;Dong et al, 2019;Martin et al, 2020;Maddela et al, 2021), as well as a fine-tuned T5 (Raffel et al, 2020) model. Compared to RNN-based models, Transformer-based ones tend to have less severe deletion and substitution errors; however, the pre-trained T5 produced more hallucinations on the more abstractive Newsela dataset.…”

Section: Introductionmentioning

confidence: 99%

Evaluating Factuality in Text Simplification

Devaraj¹,

Sheffield²,

Wallace³

et al. 2022

Preprint

View full text Add to dashboard Cite

Automated simplification models aim to make input texts more readable. Such methods have the potential to make complex information accessible to a wider audience, e.g., providing access to recent medical literature which might otherwise be impenetrable for a lay reader. However, such models risk introducing errors into automatically simplified texts, for instance by inserting statements unsupported by the corresponding original text, or by omitting key information. Providing more readable but inaccurate versions of texts may in many cases be worse than providing no such access at all. The problem of factual accuracy (and the lack thereof) has received heightened attention in the context of summarization models, but the factuality of automatically simplified texts has not been investigated. We introduce a taxonomy of errors that we use to analyze both references drawn from standard simplification datasets and state-of-the-art model outputs. We find that errors often appear in both that are not captured by existing evaluation metrics, motivating a need for research into ensuring the factual accuracy of automated simplification models.

show abstract

“…It is useful for improving the interpretability in natural language understanding tasks, including semantic textual similarity (Li and Srikumar, 2016) and question answering (Yao, 2014). Monolingual word alignment can also support the analysis of human editing operations (Figure 1) and improve model performance for text-to-text generation tasks, such as text simplification (Maddela et al, 2021) and neutralizing biased language (Pryzant et al, 2020). It has also been shown to be helpful for data augmentation and label projection With Canadian collaborators, Lloyd went on to conduct laboratory simulations of his model.…”

Section: Introductionmentioning

confidence: 99%

“…More specifically, we sample from the exact test set used in Table2inMaddela et al (2021).6 This annotator has annotated MultiMWA-MTRef. 7 https://arxiv.org/ 8 https://github.com/pkubowicz/ opendetex…”

mentioning

confidence: 99%

Neural semi-Markov CRF for Monolingual Word Alignment

Lan

Jiang

2021

Preprint

Self Cite

View full text Add to dashboard Cite

Monolingual word alignment is important for studying fine-grained editing operations (i.e., deletion, addition, and substitution) in textto-text generation tasks, such as paraphrase generation, text simplification, neutralizing biased language, etc. In this paper, we present a novel neural semi-Markov CRF alignment model, which unifies word and phrase alignments through variable-length spans. We also create a new benchmark with human annotations that cover four different text genres to evaluate monolingual word alignment models in more realistic settings. Experimental results show that our proposed model outperforms all previous approaches for monolingual word alignment as well as a competitive QA-based baseline, which was previously only applied to bilingual data. Our model demonstrates good generalizability to three out-of-domain datasets and shows great utility in two downstream applications: automatic text simplification and sentence pair classification tasks. 1

show abstract

Controllable Text Simplification with Explicit Paraphrasing

Cited by 33 publications

References 49 publications

GRS: Combining Generation and Revision in Unsupervised Sentence Simplification

GRS: Combining Generation and Revision in Unsupervised Sentence Simplification

Evaluating Factuality in Text Simplification

Neural semi-Markov CRF for Monolingual Word Alignment

Contact Info

Product

Resources

About