C2C-GenDA: Cluster-to-Cluster Generation for Data Augmentation of Slot Filling

Hou, Yutai; Chen, Sanyuan; Che, Wanxiang; Chen, Cheng; Liu, Ting

doi:10.48550/arxiv.2012.07004

Cited by 2 publications

(4 citation statements)

References 34 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We use the original train / validation / test splits provided with each dataset. For Restaurants8k, we randomly split the training set into training (80%) and SNIPS-3 PROTODA 0.881 GCN-RL 0.822 GCN+RL 0.926 (Hou et al, 2020b) and SC-GPT (Peng et al, 2020b) on few-shot intent detection. We allow our learners to train for 5000 iterations.…”

Section: Methodsmentioning

confidence: 99%

“…C2C-GenDA (cluster to cluster generation for data augmentation) (Hou et al, 2020b) is a generative data augmentation approach focused on slot filling. This method jointly encodes multiple realisations (i.e.…”

Section: Related Workmentioning

confidence: 99%

“…In Table 4, we show a comparison with C2C-GenDA (Hou et al, 2020b) and SC-GPT (Peng et al, 2020b) on SNIPS. GCN outperforms C2C-GenDA while SC-GPT performs better than GCN, which is expected since it is based on GPT-2 (instead of distilGPT2) and fine-tuned on 400K additional dialogue act -utterance pairs.…”

Section: Intent Detectionmentioning

confidence: 99%

See 2 more Smart Citations

Generative Conversational Networks

Papangelis,

Gopalakrishnan,

Padmakumar

et al. 2021

Preprint

View full text Add to dashboard Cite

Inspired by recent work in meta-learning and generative teaching networks, we propose a framework called Generative Conversational Networks, in which conversational agents learn to generate their own labelled training data (given some seed data) and then train themselves from that data to perform a given task. We use reinforcement learning to optimize the data generation process where the reward signal is the agent's performance on the task. The task can be any language-related task, from intent detection to full task-oriented conversations. In this work, we show that our approach is able to generalise from seed data and performs well in limited data and limited computation settings, with significant gains for intent detection and slot tagging across multiple datasets: ATIS, TOD, SNIPS, and Restau-rants8k. We show an average improvement of 35% in intent detection and 21% in slot tagging over a baseline model trained from the seed data. We also conduct an analysis of the novelty of the generated data and provide generated examples for intent detection, slot tagging, and non-goal oriented conversations.

show abstract

Section: Methodsmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Generative Conversational Networks

Papangelis,

Gopalakrishnan,

Padmakumar

et al. 2021

Preprint

View full text Add to dashboard Cite

show abstract

“…In Table 4, we show a comparison with C2C-GenDA (Hou et al, 2020b) and SC-GPT (Peng et al, 2020b) on SNIPS. GCN outperforms C2C-GenDA while SC-GPT performs better than GCN, which is expected since it is based on GPT-2 (instead of distilGPT2) and fine-tuned 400K additional dialogue act -utterance pairs.…”

Section: Intent Detectionmentioning

confidence: 99%