Multi-Level Memory for Task Oriented Dialogs

Reddy, Revanth Gangi; Contractor, Danish; Raghu, Dinesh; Joshi, Sachindra

doi:10.18653/v1/n19-1375

Cited by 46 publications

(27 citation statements)

References 17 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Madotto et al (2018) combines end-toend memory network (Sukhbaatar et al, 2015) into sequence generation. Gangi Reddy et al (2019) proposes a multi-level memory architecture which first addresses queries, followed by results and finally each key-value pair within a result. Wu et al (2019a) proposes a global-to-locally pointer mechanism to query the knowledge base.…”

Section: Related Workmentioning

confidence: 99%

“…Task-oriented dialogue systems (Young et al, 2013) help users to achieve specific goals such as restaurant reservation or navigation inquiry. In recent years, end-to-end methods in the literature usually take the sequence-to-sequence (Seq2Seq) model to generate a response from a dialogue history Madotto et al, 2018;Gangi Reddy et al, 2019;Qin et al, 2019b;Wu et al, 2019a). Taking the dialogue in Figure 1 as an example, to answer the driver's query about the "gas station", the end-to-end dialogue system directly generates system response given the query and a corresponding knowledge base (KB).…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Dynamic Fusion Network for Multi-Domain End-to-end Task-Oriented Dialog

Qin¹,

Xiao²,

Che³

et al. 2020

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics

View full text Add to dashboard Cite

Recent studies have shown remarkable success in end-to-end task-oriented dialog system. However, most neural models rely on large training data, which are only available for a certain number of task domains, such as navigation and scheduling. This makes it difficult to scalable for a new domain with limited labeled data. However, there has been relatively little research on how to effectively use data from all domains to improve the performance of each domain and also unseen domains. To this end, we investigate methods that can make explicit use of domain knowledge and introduce a shared-private network to learn shared and specific knowledge. In addition, we propose a novel Dynamic Fusion Network (DF-Net) which automatically exploit the relevance between the target domain and each domain. Results show that our model outperforms existing methods on multi-domain dialogue, giving the state-of-the-art in the literature. Besides, with little training data, we show its transferability by outperforming prior best model by 13.9% on average. * Email corresponding. Address Distance POI type POI Traffic info 5672 barringer street 5 miles certain address 5672 barringer street no traffic 200 Alester Ave 2 miles gas station Valero road block nearby 899 Ames Ct 5 miles hospital Stanford Childrens Health moderate traffic 481 Amaranta Ave 1 miles parking garage Palo Alto Garage R moderate traffic Driver Address to the gas station. Dialogue Knowledge Base (KB) CarValero is located at 200 Alester Ave. CarSince there is a road block nearby, I found another route for you and I sent it on your screen.Driver OK , please give me directions via a route that avoids all heavy traffic.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Dynamic Fusion Network for Multi-Domain End-to-end Task-Oriented Dialog

Qin¹,

Xiao²,

Che³

et al. 2020

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics

View full text Add to dashboard Cite

show abstract

“…SDS (Wen et al, 2017;Williams et al, 2017) use hand-crafted states and state annotations on every utterance in the dialogs-a significant human supervision. End-to-end TOD systems (Reddy et al, 2019;Wu et al, 2019; do not require state annotations but just the KB query annotations. There exist approaches (Chen et al, 2013(Chen et al, , 2015 to induce state annotations in SDS, but we are the first to induce query annotations in end-toend TOD systems.…”

Section: Background and Related Workmentioning

confidence: 99%

“…An example TOD is shown in Figure 1, where during the conversation (at turn 2), the agent queries the KB based on the user needs, and then suggests the Peking Restaurant based on the retrieved results. Existing end-to-end approaches (Bordes and Weston, 2017;Madotto et al, 2018;Reddy et al, 2019) learn to formulate KB queries using manually annotated queries.…”

Section: Introductionmentioning

confidence: 99%

Unsupervised Learning of KB Queries in Task-Oriented Dialogs

Raghu¹,

Gupta

2021

Transactions of the Association for Computational Linguistics

Self Cite

View full text Add to dashboard Cite

Task-oriented dialog (TOD) systems often need to formulate knowledge base (KB) queries corresponding to the user intent and use the query results to generate system responses. Existing approaches require dialog datasets to explicitly annotate these KB queries—these annotations can be time consuming, and expensive. In response, we define the novel problems of predicting the KB query and training the dialog agent, without explicit KB query annotation. For query prediction, we propose a reinforcement learning (RL) baseline, which rewards the generation of those queries whose KB results cover the entities mentioned in subsequent dialog. Further analysis reveals that correlation among query attributes in KB can significantly confuse memory augmented policy optimization (MAPO), an existing state of the art RL agent. To address this, we improve the MAPO baseline with simple but important modifications suited to our task. To train the full TOD system for our setting, we propose a pipelined approach: it independently predicts when to make a KB query (query position predictor), then predicts a KB query at the predicted position (query predictor), and uses the results of predicted query in subsequent dialog (next response predictor). Overall, our work proposes first solutions to our novel problem, and our analysis highlights the research challenges in training TOD systems without query annotation.

show abstract

“…In this task, conventional approaches combine Natural Language Understanding (NLU), DST, Dialogue Policy, and NLG, into a pipeline architecture (Wen et al, 2017;Bordes et al, 2016;Liu and Lane, 2017;Liu and Perez, 2017;Williams et al, 2017;Zhao et al, 2017;Jhunjhunwala et al, 2020). Another framework does not explicitly modularize these components but incorporate them through a sequence-to-sequence framework Lei et al, 2018;Yavuz et al, 2019) and a memory-based entity dataset of triplets Madotto et al, 2018;Gangi Reddy et al, 2019;Wu et al, 2019b). These approaches bypass dialogue state and/or act modeling and aim to generate output responses directly.…”

Section: Related Workmentioning

confidence: 99%

UniConv: A Unified Conversational Neural Architecture for Multi-domain Task-oriented Dialogues

Lê¹,

Liu²,

Chen³

et al. 2020

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)

View full text Add to dashboard Cite

Building an end-to-end conversational agent for multi-domain task-oriented dialogues has been an open challenge for two main reasons. First, tracking dialogue states of multiple domains is non-trivial as the dialogue agent must obtain complete states from all relevant domains, some of which might have shared slots among domains as well as unique slots specifically for one domain only. Second, the dialogue agent must also process various types of information across domains, including dialogue context, dialogue states, and database, to generate natural responses to users. Unlike the existing approaches that are often designed to train each module separately, we propose "UniConv" -a novel unified neural architecture for end-to-end conversational systems in multi-domain task-oriented dialogues, which is designed to jointly train (i) a Bi-level State Tracker which tracks dialogue states by learning signals at both slot and domain level independently, and (ii) a Joint Dialogue Act and Response Generator which incorporates information from various input components and models dialogue acts and target responses simultaneously. We conduct comprehensive experiments in dialogue state tracking, contextto-text, and end-to-end settings on the Multi-WOZ2.1 benchmark, achieving superior performance over competitive baselines.

show abstract

Multi-Level Memory for Task Oriented Dialogs

Cited by 46 publications

References 17 publications

Dynamic Fusion Network for Multi-Domain End-to-end Task-Oriented Dialog

Dynamic Fusion Network for Multi-Domain End-to-end Task-Oriented Dialog

Unsupervised Learning of KB Queries in Task-Oriented Dialogs

UniConv: A Unified Conversational Neural Architecture for Multi-domain Task-oriented Dialogues

Contact Info

Product

Resources

About