GOLD: Improving Out-of-Scope Detection in Dialogues using Data Augmentation

Chen, Derek; Zhou, Yu

doi:10.18653/v1/2021.emnlp-main.35

Cited by 9 publications

(7 citation statements)

References 44 publications

(39 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…As noted by Chen and Yu (2021), PersonaChat is particularly suitable for OOS detection of STAR data because it is a rich source of OOS dialogues. For FLOW, "Int."…”

Section: Results On Oos Detectionmentioning

confidence: 99%

“…with Data Augmentation GOLD (Chen and Yu, 2021) is the data augmentation method most closely related to this work. Given a small set of annotated OOS dialogues (1% of the size of INS), GOLD replaces utterances with sentences selected from an external pool to generate new OOS dialogues.…”

Section: Gold: Generating Out-of-scope Labelsmentioning

confidence: 99%

“…One such method is GOLD (Chen and Yu, 2021). GOLD uses simple rules to replace utterances in known OOS dialogues with sentences selected from a large pool, making it possible to train a binary classifier to decide OOS dialogues directly.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

SILVER: Self Data Augmentation for Out-of-Scope Detection in Dialogues

Ma,

Makino

2023

Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacifi

View full text Add to dashboard Cite

Detecting out-of-scope (OOS) utterances is crucial in task-oriented dialogue systems, but obtaining enough annotated OOS dialogues to train a binary classifier directly is difficult in practice. Existing data augmentation methods generate OOS dialogues automatically, but their performance usually depends on an external corpus. This dependence not only induces uncertainty, but also reduces the quality of generated dialogues. Specifically, all of them are out-of-domain (OOD).Herein we propose SILVER, a self data augmentation method that does not use external data. It addresses issues of previous research and improves the accuracy of OOS detection (false positive rate: 90.5% → 47.4%). Furthermore, SILVER successfully generates highquality in-domain (IND) OOS dialogues in terms of naturalness (percentage: 8% → 68%) and OOS correctness (percentage: 74% → 88%), as evaluated by human workers.

show abstract

“…As noted by Chen and Yu (2021), PersonaChat is particularly suitable for OOS detection of STAR data because it is a rich source of OOS dialogues. For FLOW, "Int."…”

Section: Results On Oos Detectionmentioning

confidence: 99%

Section: Gold: Generating Out-of-scope Labelsmentioning

confidence: 99%

See 1 more Smart Citation

SILVER: Self Data Augmentation for Out-of-Scope Detection in Dialogues

Ma,

Makino

2023

Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacifi

View full text Add to dashboard Cite

show abstract

“…The same paper shows that we can set OOD examples as n + 1 class and train the classification model with other IND classes. The aforementioned approaches can be used on artificially created OOD instances from IND training examples [17] or enlarge known OOD training data [3] with the help of a pretrained language model.…”

Section: Related Workmentioning

confidence: 99%

Metric Learning and Adaptive Boundary for Out-of-Domain Detection

Lorenc¹,

Gargiani²,

Pichl³

et al. 2022

Preprint

View full text Add to dashboard Cite

Conversational agents are usually designed for closed-world environments. Unfortunately, users can behave unexpectedly. Based on the open-world environment, we often encounter the situation that the training and test data are sampled from different distributions. Then, data from different distributions are called out-of-domain (OOD). A robust conversational agent needs to react to these OOD utterances adequately. Thus, the importance of robust OOD detection is emphasized. Unfortunately, collecting OOD data is a challenging task. We have designed an OOD detection algorithm independent of OOD data that outperforms a wide range of current state-of-the-art algorithms on publicly available datasets. Our algorithm is based on a simple but efficient approach of combining metric learning with adaptive decision boundary. Furthermore, compared to other algorithms, we have found that our proposed algorithm has significantly improved OOD performance in a scenario with a lower number of classes while preserving the accuracy for in-domain (IND) classes.

show abstract

“…Latent perturbation maps text to a hidden state before mapping back to natural language text again (Zhao et al, 2018). Auxiliary datasets take advantage of external unlabeled data from a relevant domain to form new pseudo-labeled examples (Chen and Yu, 2021). Text generation uses large pre-trained models to create new examples (Devlin et al, 2018).…”

Section: Introductionmentioning

confidence: 99%

Data Augmentation for Intent Classification

Chen¹,

Yin²

2022

Preprint

Self Cite

View full text Add to dashboard Cite

Training accurate intent classifiers requires labeled data, which can be costly to obtain. Data augmentation methods may ameliorate this issue, but the quality of the generated data varies significantly across techniques. We study the process of systematically producing pseudo-labeled data given a small seed set using a wide variety of data augmentation techniques, including mixing methods together. We find that while certain methods dramatically improve qualitative and quantitative performance, other methods have minimal or even negative impact. We also analyze key considerations when implementing data augmentation methods in production.

show abstract

GOLD: Improving Out-of-Scope Detection in Dialogues using Data Augmentation

Cited by 9 publications

References 44 publications

SILVER: Self Data Augmentation for Out-of-Scope Detection in Dialogues

SILVER: Self Data Augmentation for Out-of-Scope Detection in Dialogues

Metric Learning and Adaptive Boundary for Out-of-Domain Detection

Data Augmentation for Intent Classification

Contact Info

Product

Resources

About