Do Neural Dialog Systems Use the Conversation History Effectively? An Empirical Study

Sankar, Chinnadhurai; Subramanian, Sandeep; Pal, Christopher; Chandar, Sarath; Bengio, Yoshua

doi:10.18653/v1/p19-1004

Cited by 84 publications

(92 citation statements)

References 29 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Results show that all of the REDfull models get larger PPL increases in most kinds of perturbations than origin models and thus more sensitive to history utterance perturbations. It proves that the dynamic (order) information is more e ectively used by RED models according to the premise in [21] that the more sensitive the model to perturbations, the stronger ability for it of modeling dynamics.…”

Section: 32mentioning

confidence: 94%

“…We conduct experiments on three multi-turn dialogue datasets with di erent styles, they are the bAbI dialog [4], the PersonaChat [35] and the Chinese customer service dataset (JDC) [34] respectively. Each dataset is split into train/valid/test sets according to the previous works [21,34]. Note that each multi-turn dialogue in the three datasets is processed to many history-response pairs with di erent history lengths.…”

Section: Methodsmentioning

confidence: 99%

“…These model structures proposed for the sequential data assume to capture the dynamics implicitly. Nevertheless, Sankar et al [21] point out that neither the recurrent neural network nor the Transformer neural network in the Seq2Seq framework fully captures the dynamic ow in the dialogue history. In their study, authors make the perturbations to the dialogue history, but surprisingly nd a marginal hurt to the generation performance.…”

Section: Motivationmentioning

confidence: 99%

“…The above three models are provided by the ParlAI [18] framework which is identical to the experiments in Sankar's empirical study [21]. The hyperparameters are exactly the same as which provided by their work for fair comparison on the bAbI and Per-sonaChat.…”

Section: Baselines Andmentioning

confidence: 99%

“…What's more, we also do utterance perturbation experiments on the PersonaChat test set as implemented in Sankar's empirical study [21]. We rst get the PPL for di erent models on the test set.…”

Section: 32mentioning

confidence: 99%

See 4 more Smart Citations

Ranking Enhanced Dialogue Generation

Hao

Pang

Lan

et al. 2020

Proceedings of the 29th ACM International Conference on Information &Amp; Knowledge Management

View full text Add to dashboard Cite

How to e ectively utilize the dialogue history is a crucial problem in multi-turn dialogue generation. Previous works usually employ various neural network architectures (e.g., recurrent neural networks, attention mechanisms, and hierarchical structures) to model the history. However, a recent empirical study by Sankar et al. has shown that these architectures lack the ability of understanding and modeling the dynamics of the dialogue history. For example, the widely used architectures are insensitive to perturbations of the dialogue history, such as words shu ing, utterances missing, and utterances reordering. To tackle this problem, we propose a Ranking Enhanced Dialogue generation framework in this paper. Despite the traditional representation encoder and response generation modules, an additional ranking module is introduced to model the ranking relation between the former utterance and consecutive utterances. Speci cally, the former utterance and consecutive utterances are treated as query and corresponding documents, and both local and global ranking losses are designed in the learning process. In this way, the dynamics in the dialogue history can be explicitly captured. To evaluate our proposed models, we conduct extensive experiments on three public datasets, i.e., bAbI, PersonaChat, and JDC. Experimental results show that our models produce better responses in terms of both quantitative measures and human judgments, as compared with the state-of-the-art dialogue generation models. Furthermore, we give some detailed experimental analysis to show where and how the improvements come from. CCS CONCEPTS • Computing methodologies → Discourse, dialogue and pragmatics; Natural language generation.

show abstract

Section: 32mentioning

confidence: 94%