Flexible text generation for counterfactual fairness probing

Fryer, Zee; Packer, Ben; Beutel, Alex; Chen, Jilin; Webster, Kellie

doi:10.18653/v1/2022.woah-1.20

Cited by 5 publications

(2 citation statements)

References 21 publications

(27 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Large language models (LLMs) are becoming ubiquitous for their ability to solve a wide range of linguistic tasks with prompting that does not require additional model training [1,6,22]. This ability also lets them generate smaller, more refined datasets for finetuning [13,25,27], benchmarking [29], low-resource tasks or languages [4,15], and counterfactual testing (e.g., examples that are identical except for having different religious or gender-based identities [12]).…”

Section: Introductionmentioning

confidence: 99%

Visualizing Linguistic Diversity of Text Datasets Synthesized by Large Language Models

Reif,

Kahng,

Petridis

2023

2023 IEEE Visualization and Visual Analytics (VIS)

View full text Add to dashboard Cite

Figure 1: LinguisticLens, a new visualization tool for making sense of text datasets synthesized by large language models (LLMs) and analyzing the diversity of examples. (A) Each column represents a cluster of examples, where clustering is performed based on their syntax, tokens, or embeddings. Each example within the column is colored by part-of-speech (POS) tag, and has the dependency parse tree in gray. (B) In this example, users can easily find a group of examples very similar to each other. (C) Each cluster has a summary string, showing one of the most frequent subpattern across the examples. These text examples are generated with few-shot prompting on LLMs with (D) some seed examples.

show abstract

Section: Introductionmentioning

confidence: 99%

Visualizing Linguistic Diversity of Text Datasets Synthesized by Large Language Models

Reif,

Kahng,

Petridis

2023

2023 IEEE Visualization and Visual Analytics (VIS)

View full text Add to dashboard Cite

show abstract

“…We address these limitations of human red teaming with a "plug-and-play" AI-assisted Red Teaming (AART) pipeline for generating adversarial testing datasets at scale by minimizing the human effort to only guide the adversarial generation recipe. Our work builds on recent automated red teaming (Perez et al, 2022), synthetic safety data generation (Fryer et al, 2022;Hartvigsen et al, 2022;Bai et al, 2022;Sun et al, 2023) and human-in-theloop methods . We adapt work on self-consistency (Wang et al, 2023a), chain-ofthought (Kojima et al, 2023Wei et al, 2022), and structured reasoning and data generation (Wang et al, 2023b;Xu et al, 2023;Creswell and Shanahan, 2022) and creatively apply them to the task of adversarial dataset creation.…”

Section: Introductionmentioning

confidence: 99%