Contrastive Zero-Shot Learning for Cross-Domain Slot Filling with Adversarial Attack

He, Keqing; Zhang, Jinchao; Yan, Yuanmeng; Wang, Xu; Niu, Cheng; Zhou, Jie

doi:10.18653/v1/2020.coling-main.126

Cited by 21 publications

(24 citation statements)

References 20 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In the second stage, our model encodes the slot entity and predicts the label for it by calculating the similarity with the slot prototypes in the label semantic space. Unlike the previous works (Liu et al, 2020b;He et al, 2020c) which directly utilize the slot name embedding as slot prototypes, we introduce Prototypical Contrastive learning and Label Confusion strategies (PCLC) strategies to dynamically refine the constraint relationship between slot prototypes in the semantic space, as shown in Fig 2 . In the training procedure, we use an MLP layer to encode the original slot name embedding. So we can obtain a dynamically updated slot prototype matrix.…”

Section: Overall Architecturementioning

confidence: 99%

“…• Contrastive Zero-Shot Learning with Adversarial Attack (CZSL-Adv) A method proposed by (He et al, 2020c) based on Coach, which utilizes contrastive learning and adversarial attacks to improve the performance and robustness of the framework. Implementation Details We follow the setup of (Liu et al, 2020b), selecting one domain as the target domain at a time, and use 500 samples in this domain as a validation set, the rest as a test set.…”

Section: Setupmentioning

confidence: 99%

“…The main drawback is that the model possibly predicts multiple slot types for one entity span. To avoid the above problem, (Liu et al, 2020b,a;He et al, 2020c) decompose the slot filling task into two stages. First, all slot entities in the utterances are identified by the coarse-grained binary sequence labeling model.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Bridge to Target Domain by Prototypical Contrastive Learning and Label Confusion: Re-explore Zero-Shot Learning for Slot Filling

Wang¹,

Li²,

Liu³

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

Zero-shot cross-domain slot filling alleviates the data dependence in the case of data scarcity in the target domain, which has aroused extensive research. However, as most of the existing methods do not achieve effective knowledge transfer to the target domain, they just fit the distribution of the seen slot and show poor performance on unseen slot in the target domain. To solve this, we propose a novel approach based on prototypical contrastive learning with a dynamic label confusion strategy for zero-shot slot filling. The prototypical contrastive learning aims to reconstruct the semantic constraints of labels, and we introduce the label confusion strategy to establish the label dependence between the source domains and the target domain on-the-fly. Experimental results show that our model achieves significant improvement on the unseen slots, while also set new state-of-the-arts on slot filling task. 1

show abstract

Section: Overall Architecturementioning

confidence: 99%

Section: Setupmentioning

confidence: 99%

See 1 more Smart Citation

Bridge to Target Domain by Prototypical Contrastive Learning and Label Confusion: Re-explore Zero-Shot Learning for Slot Filling

Wang¹,

Li²,

Liu³

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…for pre-defined slot types, using character embedding (Liang et al, 2017a), copy mechanism (Zhao and Feng, 2018), few/zero-shot learning (Hu et al, 2019;He et al, 2020e;Shah et al, 2019), transfer learning (Chen and Moschitti, 2019;He et al, 2020c,b) and background knowledge (Yang and Mitchell, 2017;He et al, 2020d), etc. Compared to OOV recognition, our proposed novel slot detection task focuses on detecting unknown slot types, not just unseen values.…”

Section: Introductionmentioning

confidence: 99%

“…NSD aims to discover potential new or out-of-domain entity types to strengthen the capability of a dialogue system based on in-domain precollected training data. There are two aspects in the previous work related to NSD, out-of-vocabulary (OOV) recognition (Liang et al, 2017a;Zhao and Feng, 2018;Hu et al, 2019;He et al, 2020c,d;Yan et al, 2020;He et al, 2020e) and out-of-domain (OOD) intent detection (Lin and Xu, 2019;Larson et al, 2019;Xu et al, 2020a;Zeng et al, 2021b,a) 1: Comparison between slot filling and novel slot detection. In the novel slot detection labels, we consider "album" as an unknown slot type that is out of the scope of the pre-defined slot set.…”

Section: Introductionmentioning

confidence: 99%

Novel Slot Detection: A Benchmark for Discovering Unknown Slot Types in the Task-Oriented Dialogue System

Wu¹,

Zeng²,

He³

et al. 2021

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Confer

Self Cite

View full text Add to dashboard Cite

Existing slot filling models can only recognize pre-defined in-domain slot types from a limited slot set. In the practical application, a reliable dialogue system should know what it does not know. In this paper, we introduce a new task, Novel Slot Detection (NSD), in the task-oriented dialogue system. NSD aims to discover unknown or out-of-domain slot types to strengthen the capability of a dialogue system based on in-domain training data. Besides, we construct two public NSD datasets, propose several strong NSD baselines, and establish a benchmark for future work. Finally, we conduct exhaustive experiments and qualitative analysis to comprehend key challenges and provide new guidance for future directions 1 .

show abstract

Cross-domain Slot Filling with Distinct Slot Entity and Type Prediction

Liu

Huang

Zhu

et al. 2021

Natural Language Processing and Chinese Computing

View full text Add to dashboard Cite

Contrastive Zero-Shot Learning for Cross-Domain Slot Filling with Adversarial Attack

Cited by 21 publications

References 20 publications

Bridge to Target Domain by Prototypical Contrastive Learning and Label Confusion: Re-explore Zero-Shot Learning for Slot Filling

Bridge to Target Domain by Prototypical Contrastive Learning and Label Confusion: Re-explore Zero-Shot Learning for Slot Filling

Novel Slot Detection: A Benchmark for Discovering Unknown Slot Types in the Task-Oriented Dialogue System

Cross-domain Slot Filling with Distinct Slot Entity and Type Prediction

Contact Info

Product

Resources

About