Reinforced Rewards Framework for Text Style Transfer

Sancheti, Abhilasha; Krishna, Kundan; Srinivasan, Balaji Vasan; Natarajan, Anandhavelu

doi:10.1007/978-3-030-45439-5_36

Cited by 22 publications

(25 citation statements)

References 27 publications

(62 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Evaluation Following previous work He et al, 2020;Sancheti et al, 2020), we adopt the following strategies. The binary classifier TextCNN (Kim, 2014) is pre-trained to evaluate style strength; on the human references it has an accuracy of 87.0% (E&M) and 89.3% (F&R).…”

Section: Methodsmentioning

confidence: 99%

“…For the GPT-2 based model, we also add a classification confidence reward to the source sentence, similar to Eq. 4, since the model generates sentence x with the original style while generating the target sentence: BLEU Score Reward Following Sancheti et al (2020), we introduce a BLEU-based reward to foster content preservation as in Eq. 6, where y is the target style text obtained by greedily maximizing the distribution of model outputs at each time step, and y s is sampled from the distribution.…”

Section: Rewardsmentioning

confidence: 99%

“…Niu et al (2018) combine the train sets of two different domains and incorporate machine translation to train their models with a multi-task learning schema, plus model ensembles. Sancheti et al (2020) use it to train a supervised sequence-tosequence model, and in addition to the commonly used style strength reward, they include a reward based on BLEU (Papineni et al, 2002) to enhance content preservation. Shang et al (2019) propose a semi-supervised model combining parallel data with large amounts of non-parallel data.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Thank you BART! Rewarding Pre-Trained Models Improves Formality Style Transfer

Lai

Toral

Nissim

2021

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Confer

View full text Add to dashboard Cite

Scarcity of parallel data causes formality style transfer models to have scarce success in preserving content. We show that fine-tuning pre-trained language (GPT-2) and sequenceto-sequence (BART) models boosts content preservation, and that this is possible even with limited amounts of parallel data. Augmenting these models with rewards that target style and content -the two core aspects of the task-we achieve a new state-of-the-art.

show abstract

Section: Methodsmentioning

confidence: 99%

Section: Rewardsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Thank you BART! Rewarding Pre-Trained Models Improves Formality Style Transfer

Lai

Toral

Nissim

2021

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Confer

View full text Add to dashboard Cite

show abstract

“…Recent works have also modeled this in a self-supervised manner where rewriting (transfer) is achieved by utilizing corpus from the target style alone (Syed et al, 2020). These wide studies have also led to the curation and benchmarking of non-parallel dataset for various style dimensions, such as sentiment (Li et al, 2018), formality (Rao and Tetreault, 2018), politeness (Danescu-Niculescu-Mizil et al, 2013), excitement (Sancheti et al, 2020), etc. But availability of data with joint tagging across multiple styles is limited and has restricted the ability of existing approaches to scale from single-dimensional transfer to multiple style dimensions.…”

Section: Introductionmentioning

confidence: 99%

Multi-Style Transfer with Discriminative Feedback on Disjoint Corpus

Goyal¹,

Srinivasan²,

Anandhavelu³

et al. 2021

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Langua

Self Cite

View full text Add to dashboard Cite

Style transfer has been widely explored in natural language generation with non-parallel corpus by directly or indirectly extracting a notion of style from source and target domain corpus. A common shortcoming of existing approaches is the prerequisite of joint annotations across all the stylistic dimensions under consideration. Availability of such dataset across a combination of styles limits the extension of these setups to multiple style dimensions. While cascading single-dimensional models across multiple styles is a possibility, it suffers from content loss, especially when the style dimensions are not completely independent of each other. In our work, we relax this requirement of jointly annotated data across multiple styles by using independently acquired data across different style dimensions without any additional annotations. We initialize an encoder-decoder setup with transformerbased language model pre-trained on a generic corpus and enhance its re-writing capability to multiple target style dimensions by employing multiple style-aware language models as discriminators. Through quantitative and qualitative evaluation, we show the ability of our model to control styles across multiple style dimensions while preserving content of the input text. We compare it against baselines involving cascaded state-of-the-art uni-dimensional style transfer models.

show abstract

“…We then fine-tune GePpeTto with this perceptionlabelled data. In addition, inspired by the classifierbased reward used in style transfer tasks (Lample et al, 2019;Gong et al, 2019;Luo et al, 2019;Sancheti et al, 2020), we reward the model to push its classification confidence. We evaluate the new perception-enhanced models in comparison with the original GePpeTto by running both an automatic as well as a human evaluation on output generated by the various models.…”

Section: Introductionmentioning

confidence: 99%

Human Perception in Natural Language Generation

Mattei¹,

Lai²,

Dell’Orletta⋄³

et al. 2021

Proceedings of the 1st Workshop on Natural Language Generation, Evaluation, and Metrics (GEM 2021)

View full text Add to dashboard Cite

We take a collection of short texts, some of which are human-written, while others are automatically generated, and ask subjects, who are unaware of the texts' source, whether they perceive them as human-produced. We use this data to fine-tune a GPT-2 model to push it to generate more human-like texts, and observe that the production of this fine-tuned model is indeed perceived as more humanlike than that of the original model. Contextually, we show that our automatic evaluation strategy correlates well with human judgements. We also run a linguistic analysis to unveil the characteristics of human-vs machineperceived language.

show abstract

Reinforced Rewards Framework for Text Style Transfer

Cited by 22 publications

References 27 publications

Thank you BART! Rewarding Pre-Trained Models Improves Formality Style Transfer

Thank you BART! Rewarding Pre-Trained Models Improves Formality Style Transfer

Multi-Style Transfer with Discriminative Feedback on Disjoint Corpus

Human Perception in Natural Language Generation

Contact Info

Product

Resources

About