Transferable Dialogue Systems and User Simulators

Tseng, Bo-Hsiang; Dai, Yinpei; Kreyssig, Florian; Byrne, Bill

doi:10.18653/v1/2021.acl-long.13

Cited by 23 publications

(12 citation statements)

References 23 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We calculate the turn accuracy (ACC), joint goal accuracy (JGA) and combined scores (Comb) for these tasks, and compute the average score as the overall metric to measure the model performance. Due to few-shot learning gains increasing attention in various dialog tasks to assess model capability [14,26,90], we conduct experiments under both a full-data setting that uses all training data to fine-tune the models and a few-shot setting that uses only 10% of training data to fine-tune the 3, SPACE-3 outperforms all baselines on all datasets. On the full-data setting and the few-shot setting, SPACE-3 surpasses the PPTOD * with 1.75 and 2.48 absolute average score improvement respectively, indicating the SPACE-3 have better adaptability on all types of dialog tasks.…”

Section: Overall Comparison With Pcmsmentioning

confidence: 99%

Unified Dialog Model Pre-training for Task-Oriented Dialog Understanding and Generation

Dai

Yang

et al. 2022

Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval

Self Cite

View full text Add to dashboard Cite

Recently, pre-training methods have shown remarkable success in task-oriented dialog (TOD) systems. However, most existing pretrained models for TOD focus on either dialog understanding or dialog generation, but not both. In this paper, we propose SPACE-3, a novel unified semi-supervised pre-trained conversation model learning from large-scale dialog corpora with limited annotations, which can be effectively fine-tuned on a wide range of downstream dialog tasks. Specifically, SPACE-3 consists of four successive components in a single transformer to maintain a task-flow in TOD systems: (i) a dialog encoding module to encode dialog history, (ii) a dialog understanding module to extract semantic vectors from either user queries or system responses, (iii) a dialog policy module to generate a policy vector that contains high-level semantics of the response, and (iv) a dialog generation module to produce appropriate responses. We design a dedicated pre-training objective for each component. Concretely, we pre-train the dialog encoding module with span mask language modeling to learn contextualized dialog information. To capture the structured dialog semantics, we pre-train the dialog understanding module via a novel tree-induced semi-supervised contrastive learning objective with the help of extra dialog annotations. In addition, we pre-train the dialog policy module by minimizing the L 2 distance between its output policy * Equal Contribution.† Wanwei He is also with the University of Chinese Academy of Sciences. This work was conducted when Wanwei He was interning at Alibaba.

show abstract

Section: Overall Comparison With Pcmsmentioning

confidence: 99%

Unified Dialog Model Pre-training for Task-Oriented Dialog Understanding and Generation

Dai

Yang

et al. 2022

Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval

Self Cite

View full text Add to dashboard Cite

show abstract

“…Wang et al (2020) regarded the dialogue act as a sequence generation task to assist the system in generating replies at each step of the dialogue process. To focus on more diverse dialogue acts, Zhang et al (2020c) devised a data augmentation method that considers multiple dialogue acts to generate system responses simultaneously Tseng et al (2021) designed to first capture the correct dialogue act through the dialogue state, then represent it as a sequence of tokens and generate it through an LSTM. Their strategy for learning dialogue acts can handle multiple different acts during a conversation simultaneously.…”

Section: Response Generationmentioning

confidence: 99%

“…JOUST (Tseng et al, 2021) designed a strategy to obtain the current dialogue act through the dialog state. We denote one of their methods as JOUST+ RL-turn-R. NoisyChannel (Liu et al, 2021) uses the noisy channel model (Yu et al, 2017) to decode dialogue act to generate a higher quality response.…”

Section: End-to-end Modelmentioning

confidence: 99%

A Unified Dialogue Framework Based on Joint Generation of Dialogue Acts and Responses

Rong

et al. 2022

Preprint

View full text Add to dashboard Cite

Currently, building an end-to-end dialogue system for multi-domain task-oriented dialogue has some enormous challenges. Dialogue systems must obtain entire dialogue states from all relevant domains in order to respond correctly. However, multiple domains are involved in the actual dialogue process, which increases the difficulty of obtaining the dialogue state. In addition, dialogue systems process diverse information such as dialogue history, dialogue state, dialogue act, and database across domains, resulting in natural responses. These complex dialogue information brings greater difficulties to response generation. In this paper, we propose a novel unified dialogue framework for multi-domain dialogue in task-oriented dialog, including three modules: Encoder, Dialogue State Tracker and Multiple Decoders. First, encoder module encode all text input into continuous representations. Secondly, we train the dialogue state tracker module with a stacked-attention architecture.It learns information from slot-atten structure and domain-atten structure to track dialogue state. Then, multiple decoders module consists of act decoder structure and response decoder structure. It combines information from different textual inputs while modeling dialogue act. Finally, we jointly train all the above modules to generate system responses. We conducted extensive experiments on the dataset MultiWOZ. The experimental results show that our model achieves state-of-the-art results on evaluation metrics compared to models from previous work.

show abstract

“…1 Our code is publicly available at https://github. com/nu-dialogue/post-processing-networks However, since each module is processed sequentially, errors in the preceding module can easily propagate to the following ones, and the performance of the entire system cannot be optimized (Tseng et al, 2021). This results in low dialogue performance of the entire system .…”

Section: Introductionmentioning

confidence: 99%

Post-processing Networks: Method for Optimizing Pipeline Task-oriented Dialogue Systems using Reinforcement Learning

Higashinaka¹

2022

Preprint

View full text Add to dashboard Cite

Many studies have proposed methods for optimizing the dialogue performance of an entire pipeline task-oriented dialogue system by jointly training modules in the system using reinforcement learning. However, these methods are limited in that they can only be applied to modules implemented using trainable neuralbased methods. To solve this problem, we propose a method for optimizing a pipeline system composed of modules implemented with arbitrary methods for dialogue performance. With our method, neural-based components called post-processing networks (PPNs) are installed inside such a system to post-process the output of each module. All PPNs are updated to improve the overall dialogue performance of the system by using reinforcement learning, not necessitating each module to be differentiable. Through dialogue simulation and human evaluation on the MultiWOZ dataset, we show that our method can improve the dialogue performance of pipeline systems consisting of various modules 1 .

show abstract

Transferable Dialogue Systems and User Simulators

Cited by 23 publications

References 23 publications

Unified Dialog Model Pre-training for Task-Oriented Dialog Understanding and Generation

Unified Dialog Model Pre-training for Task-Oriented Dialog Understanding and Generation

A Unified Dialogue Framework Based on Joint Generation of Dialogue Acts and Responses

Post-processing Networks: Method for Optimizing Pipeline Task-oriented Dialogue Systems using Reinforcement Learning

Contact Info

Product

Resources

About