Better Robustness by More Coverage: Adversarial and Mixup Data Augmentation for Robust Finetuning

Si, Chenglei; Zhang, Zhengyan; Qi, Fanchao; Liu, Zhiyuan; Wang, Yasheng; Li, Qun; Sun, Maosong

doi:10.18653/v1/2021.findings-acl.137

Cited by 30 publications

(25 citation statements)

References 28 publications

(28 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Most are discussed in Emmery et al (2021); however, some new work is specifically of interest to data augmentation, such as improving the substitutions using beam search (Zhao et al, 2021, as opposed to the simultaneous rollout we used in the current work). More broadly, adversarial training (Si et al, 2021;Pan et al, 2021), implementing more robust stylometric features (Markov et al, 2021), or modelbased weightings of the augmentation models could be explored; e.g., by selecting instances with a generation model in the loop (Anaby-Tavor et al, 2020). This could be a particularly worthwhile option when focusing on conversation scopes, rather than message-level cyberbullying content (Emmery et al, 2019).…”

Section: Augmentation For Robustnessmentioning

confidence: 99%

Cyberbullying Classifiers are Sensitive to Model-Agnostic Perturbations

Emmery¹,

Kádár²,

Chrupała³

et al. 2022

Preprint

View full text Add to dashboard Cite

A limited amount of studies investigates the role of model-agnostic adversarial behavior in toxic content classification. As toxicity classifiers predominantly rely on lexical cues, (deliberately) creative and evolving language-use can be detrimental to the utility of current corpora and state-of-the-art models when they are deployed for content moderation. The less training data is available, the more vulnerable models might become. This study is, to our knowledge, the first to investigate the effect of adversarial behavior and augmentation for cyberbullying detection. We demonstrate that model-agnostic lexical substitutions significantly hurt classifier performance. Moreover, when these perturbed samples are used for augmentation, we show models become robust against word-level perturbations at a slight trade-off in overall task performance. Augmentations proposed in prior work on toxicity prove to be less effective. Our results underline the need for such evaluations in online harm areas with small corpora. The perturbed data, models, and code are available for reproduction at https://github.com/cmry/augtox.

show abstract

Section: Augmentation For Robustnessmentioning

confidence: 99%

Cyberbullying Classifiers are Sensitive to Model-Agnostic Perturbations

Emmery¹,

Kádár²,

Chrupała³

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

“…And the relevant keywords which categorise the transformation. t/evaluation data splits allows for testing the robustness of models and for identifying possible biases; on the other hand, applying transformations and filters to training data (data augmentation) allows for possibly mitigating the detected robustness and bias issues (Wang et al, 2021b;Pruksachatkun et al, 2021;Si et al, 2021).…”

Section: Format Of a Transformationmentioning

confidence: 99%

“…For example, "Stillwater is not a 2010 American liveaction/animated dark fantasy adventure film" turns into "Stillwater !is film". Zhang et al (2021) used a similar idea to this transformation.…”

Section: B97 Sentence Summarizaitonmentioning

confidence: 99%

NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation

Dhole¹,

Gangal²,

Gehrmann³

et al. 2021

Preprint

View full text Add to dashboard Cite

Data augmentation is an important component in the robustness evaluation of models in natural language processing (NLP) and in enhancing the diversity of the data they are trained on. In this paper, we present NL-Augmenter, a new participatory Pythonbased natural language augmentation framework which supports the creation of both transformations (modifications to the data) and filters (data splits according to specific features). We describe the framework and an initial set of 117 transformations and 23 filters for a variety of natural language tasks. We demonstrate the efficacy of NL-Augmenter by using several of its tranformations to analyze the robustness of popular natural language models. The infrastructure, datacards and robutstness analysis results are available publicly on the NL-Augmenter repository (https://github. com/GEM-benchmark/NL-Augmenter).

show abstract

“…Given the original samples, Cheng et al [25] firstly construct their adversarial samples following [75], and then apply two Mixup strategies named P adv and P aut : The former interpolates between adversarial samples, and the latter interpolates between the two corresponding original samples. Similarly, Sun et al [76], Bari et al [77] , and Si et al [78] both apply such Mixup method for text classification. Sun et al [76] propose Mixup-Transformer which combines Mixup with transformer-based pretrained architecture.…”

Section: Mixupmentioning

confidence: 99%

Data Augmentation Approaches in Natural Language Processing: A Survey

Li,

Hou,

Che

2021

Preprint

View full text Add to dashboard Cite

As an effective strategy, data augmentation (DA) alleviates data scarcity scenarios where deep learning techniques may fail. It is widely applied in computer vision then introduced to natural language processing and achieves improvements in many tasks. One of the main focuses of the DA methods is to improve the diversity of training data, thereby helping the model to better generalize to unseen testing data. In this survey, we frame DA methods into three categories based on the diversity of augmented data, including paraphrasing, noising, and sampling. Our paper sets out to analyze DA methods in detail according to the above categories. Further, we also introduce their applications in NLP tasks as well as the challenges.

show abstract

Better Robustness by More Coverage: Adversarial and Mixup Data Augmentation for Robust Finetuning

Cited by 30 publications

References 28 publications

Cyberbullying Classifiers are Sensitive to Model-Agnostic Perturbations

Cyberbullying Classifiers are Sensitive to Model-Agnostic Perturbations

NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation

Data Augmentation Approaches in Natural Language Processing: A Survey

Contact Info

Product

Resources

About