Generative Conversational Networks

Papangelis, Alexandros; Gopalakrishnan, Karthik; Padmakumar, Aishwarya; Kim, Seokhwan; Tur, Gokhan; Hakkani-Tur, Dilek

doi:10.18653/v1/2021.sigdial-1.12

Cited by 1 publication

(1 citation statement)

References 13 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…However, prompt-based augmentation strategies are uncontrolled forms of generation, which may result in generation mistakes for labeled datasets (Sahu et al, 2022;Chen et al, 2022;Meng et al, 2022). In contrast, other recent studies have instead proposed language augmentation strategies that use complex, highly-controlled frameworks that often involve fine-tuning generators (Papangelis et al, 2021;Kulhánek et al, 2021;. Such complex augmentation frameworks require larger amounts of seed data to maintain a ground-truth language distribution (Rosenbaum et al, 2022b;Kim et al, 2021b), and are more costly than prompting PLMs (Chen et al, 2022).…”

Section: Related Workmentioning

confidence: 99%

PLACES: Prompting Language Models for Social Conversation Synthesis

Chen,

Papangelis,

Tao

et al. 2023

Findings of the Association for Computational Linguistics: EACL 2023

View full text Add to dashboard Cite

Collecting high quality conversational data can be very expensive for most applications and infeasible for others due to privacy, ethical, or similar concerns. A promising direction to tackle this problem is to generate synthetic dialogues by prompting large language models. In this work, we use a small set of expertwritten conversations as in-context examples to synthesize a social conversation dataset using prompting. We perform several thorough evaluations of our synthetic conversations compared to human-collected conversations. This includes various dimensions of conversation quality with human evaluation directly on the synthesized conversations, and interactive human evaluation of chatbots fine-tuned on the synthetically generated dataset. We additionally demonstrate that this prompting approach is generalizable to multi-party conversations, providing potential to create new synthetic data for multi-party tasks. Our synthetic multi-party conversations were rated more favorably across all measured dimensions compared to conversation excerpts sampled from a human-collected multi-party dataset. Random In-Context Examples Target Recipe The following is a conversation between Alice and Bob about relationships. Bob recently got engaged. Alice: Congrats on your engagement! When do you think you will have your wedding? Bob: Thank you!! We're thinking of having it in November. Alice: That's amazing! Will you pick a fancy destination? ... The following is a conversation between Alice and Bob about pets. Alice likes cats but Bob likes dogs. Result: Alice: What kind of pet do you have? Bob: I have a dog. Her name is Hope. Alice: Are you an animal person? Bob: I do love animals. I think they're so special. Alice: I just adopted a kitten from a shelter. Her name is Mimi. Bob: Aww, I hope she's doing well. You must be a really good pet parent. Alice: Thanks! I just wanted to do what I could to make sure she was safe. Bob: That's the greatest thing anyone can do for a pet. Well, I should probably go now. Alice: OK! Have a good night.

show abstract