Group-wise Contrastive Learning for Neural Dialogue Generation

Cai, Hengyi; Chen, Hongshen; Song, Yujiang; Ding, Zhuoye; Bao, Yongjun; Yan, Weipeng; Zhao, Xianfeng

doi:10.18653/v1/2020.findings-emnlp.70

Cited by 22 publications

(20 citation statements)

References 34 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Cai et al [20] introduced contrastive learning into dialogue generation, where the model explicitly perceives the difference between the well-chosen positive and negative utterances.…”

Section: Multi-head Attentionmentioning

confidence: 99%

Neural Dialogue Generation Methods in Open Domain: A Survey

Sun¹,

Li²

2021

NLPRE

View full text Add to dashboard Cite

Open-Domain Dialogue Generation (human-computer interaction) is an important issue in the field of Natural Language Processing (NLP). Because of the improvement of deep learning techniques, a large number of neural dialogue generative methods were proposed to generate better responses. In this survey, we elaborated the research history of these existing generative methods, and then roughly divided them into six categories, i.e., Encoder-Decoder framework-based methods, Hierarchical Recurrent Encoder-Decoder (HRED)-based methods, Variational Autoencoder (VAE)-based methods, Reinforcement Learning (RL)-based methods, Generative Adversarial Network (GAN)-based methods, and pretraining-model-based methods. We dived into the methods of each category and gave the detailed discussions of these methods. After that, we presented a comparison among the different categories of methods and analyzed their advantages and disadvantages. We enumerated some open access public datasets and some commonly used automatic evaluating metrics. Finally, we discuss some possible research directions that can take the research of neural dialogue generation into a new frontier in the future.

show abstract

“…Cai et al [20] introduced contrastive learning into dialogue generation, where the model explicitly perceives the difference between the well-chosen positive and negative utterances.…”

Section: Multi-head Attentionmentioning

confidence: 99%

Neural Dialogue Generation Methods in Open Domain: A Survey

Sun¹,

Li²

2021

NLPRE

View full text Add to dashboard Cite

show abstract

“…In our case, the multi-turn dialogue data setup allows us to further utilize the context-response relationship, and conduct hard negative sampling by using context-response matching models. Following (Cai et al, 2020), we consider training a Multi-hop Selector Network (MSN) (Yuan et al, 2019) which provides matching scores between the context and response inputs. Specifically, we construct a dialogue dataset, in which each context input c is paired with one positive response sample x, and multiple randomly sample distrator response samples x j .…”

Section: Improved CL With Hard Negative Samplingmentioning

confidence: 99%

“…For auto-regressive model, we consider Dialog-GPT (Zhang et al, 2019), which is a GPT-2 based model that is specifically designed for dialogue response generation. For dialogue response generation model that uses contrastive learning, we include group-wise contrastive learning (GCL) (Cai et al, 2020), which conduct CL between target dialogue model and a pretrained reference model. PLATO (Bao et al, 2019) is another model that uses transformer-based model architecture while including a discrete latent variable to tackle the oneto-many mapping problem.…”

Section: Baseline Modelsmentioning

confidence: 99%

“…Such an approach has been used in the computer vision domain (Robinson et al, 2020), guiding a learning method to correct its mistakes more quickly. In our case, we select the negative samples by using a pretrained context-response matching model (Cai et al, 2020). Given a context input, the responses with the top matching scores would be considered as the negative samples, and used for the contrastive objective.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Dialogue Response Generation via Contrastive Latent Representation Learning

Dai¹,

Wang²,

Park³

et al. 2021

Proceedings of the 3rd Workshop on Natural Language Processing for Conversational AI

View full text Add to dashboard Cite

Large-scale auto-regressive models have achieved great success in dialogue response generation, with the help of Transformer layers. However, these models do not learn a representative latent space of the sentence distribution, making it hard to control the generation. Recent works have tried to learn sentence representations using Transformerbased framework, but do not model the context-response relationship embedded in the dialogue datasets. In this work, we aim to construct a robust sentence representation learning model, that is specifically designed for dialogue response generation, with Transformer-based encoder-decoder structure.An utterance-level contrastive learning is proposed, encoding predictive information in each context representation for its corresponding response. Extensive experiments are conducted to verify the robustness of the proposed representation learning mechanism. By using both referencebased and reference-free evaluation metrics, we provide detailed analysis on the generated sentences, demonstrating the effectiveness of our proposed model.

show abstract

“…Here, the gains are evaluated using reconstruction loss. Finally, inspired from the contrastive learning paradigm (Cai et al, 2020;Chen et al, 2020a,b;Mitrovic et al, 2020), we propose relationship enhancement to increase similarity between the representations of data within the same group, and differentiate the representations of data between different groups.…”

Section: Introductionmentioning

confidence: 99%

Generating Relevant and Coherent Dialogue Responses using Self-Separated Conditional Variational AutoEncoders

Sun¹,

Feng²,

Li³

et al. 2021

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Confer

View full text Add to dashboard Cite

Conditional Variational AutoEncoder (CVAE) effectively increases the diversity and informativeness of responses in open-ended dialogue generation tasks through enriching the context vector with sampled latent variables. However, due to the inherent one-to-many and many-toone phenomena in human dialogues, the sampled latent variables may not correctly reflect the contexts' semantics, leading to irrelevant and incoherent generated responses. To resolve this problem, we propose Self-separated Conditional Variational AutoEncoder (abbreviated as SepaCVAE) that introduces group information to regularize the latent variables, which enhances CVAE by improving the responses' relevance and coherence while maintaining their diversity and informativeness. SepaCVAE actively divides the input data into groups, and then widens the absolute difference between data pairs from distinct groups, while narrowing the relative distance between data pairs in the same group. Empirical results from automatic evaluation and detailed analysis demonstrate that SepaC-VAE can significantly boost responses in wellestablished open-domain dialogue datasets.

show abstract

Group-wise Contrastive Learning for Neural Dialogue Generation

Cited by 22 publications

References 34 publications

Neural Dialogue Generation Methods in Open Domain: A Survey

Neural Dialogue Generation Methods in Open Domain: A Survey

Dialogue Response Generation via Contrastive Latent Representation Learning

Generating Relevant and Coherent Dialogue Responses using Self-Separated Conditional Variational AutoEncoders

Contact Info

Product

Resources

About