Modeling Text-visual Mutual Dependency for Multi-modal Dialog Generation

Wang, Shuhe; Meng, Yuxian; Sun, Xiaofei; Wu, Fei; Rongbin, Ouyang,; Yan, Rui; Tianwei, Zhang; Li, Jiwei

doi:10.48550/arxiv.2105.14445

“…For two sentences of the same meaning, the probability of generating contexts given the two sentences should be also the same, which correspond to the backward probability given from sentences to contexts. This is akin to the bi-directional mutual-information based generation strategy (Fang et al, 2015;Li et al, 2016a;Li and Jurafsky, 2016;Wang et al, 2021). The backward probability can be modeled by predicting preceding contexts given subsequent contexts p(c <i |c i , c >i ) and to predict subsequent contexts given preceding contexts p(c >i |c <i , c i ).…”

Section: Training Context-lmmentioning

confidence: 99%

ConRPG: Paraphrase Generation using Contexts as Regularizer

Meng¹,

Ao

²

,

He

³

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

A long-standing issue with paraphrase generation is how to obtain reliable supervision signals. In this paper, we propose an unsupervised paradigm for paraphrase generation based on the assumption that the probabilities of generating two sentences with the same meaning given the same context should be the same. Inspired by this fundamental idea, we propose a pipelined system which consists of paraphrase candidate generation based on contextual language models, candidate filtering using scoring functions, and paraphrase model training based on the selected candidates.The proposed paradigm offers merits over existing paraphrase generation methods: (1) using the context regularizer on meanings, the model is able to generate massive amounts of high-quality paraphrase pairs; and (2) using human-interpretable scoring functions to select paraphrase pairs from candidates, the proposed framework provides a channel for developers to intervene with the data generation process, leading to a more controllable model. Experimental results across different tasks and datasets demonstrate that the effectiveness of the proposed model in both supervised and unsupervised setups. 1

show abstract

ConRPG: Paraphrase Generation using Contexts as Regularizer

Meng¹,

Ao²,

He³

et al. 2021

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing

View full text Add to dashboard Cite

A long-standing issue with paraphrase generation is how to obtain reliable supervision signals. In this paper, we propose an unsupervised paradigm for paraphrase generation based on the assumption that the probabilities of generating two sentences with the same meaning given the same context should be the same. Inspired by this fundamental idea, we propose a pipelined system which consists of paraphrase candidate generation based on contextual language models, candidate filtering using scoring functions, and paraphrase model training based on the selected candidates.The proposed paradigm offers merits over existing paraphrase generation methods: (1) using the context regularizer on meanings, the model is able to generate massive amounts of high-quality paraphrase pairs; and (2) using human-interpretable scoring functions to select paraphrase pairs from candidates, the proposed framework provides a channel for developers to intervene with the data generation process, leading to a more controllable model. Experimental results across different tasks and datasets demonstrate that the effectiveness of the proposed model in both supervised and unsupervised setups.

show abstract

OpenViDial 2.0: A Larger-Scale, Open-Domain Dialogue Generation Dataset with Visual Contexts

Wang¹,

Meng²,

Li³

et al. 2021

Preprint

Self Cite

0

View full text Add to dashboard Cite

In order to better simulate the real human conversation process, models need to generate dialogue utterances based on not only preceding textual contexts but also visual contexts. However, with the development of multi-modal dialogue learning, the dataset scale gradually becomes a bottleneck. In this report, we release OpenViDial 2.0, a larger-scale open-domain multi-modal dialogue dataset compared to the previous version OpenViDial 1.0 (Meng et al., 2020). OpenViDial 2.0 contains a total number of 5.6 million dialogue turns extracted from either movies or TV series from different resources, and each dialogue turn is paired with its corresponding visual context. We hope this large-scale dataset can help facilitate future researches on open-domain multi-modal dialog generation, e.g., multi-modal pretraining for dialogue generation. 1

show abstract

Modeling Text-visual Mutual Dependency for Multi-modal Dialog Generation

Cited by 3 publications

References 72 publications

ConRPG: Paraphrase Generation using Contexts as Regularizer

ConRPG: Paraphrase Generation using Contexts as Regularizer

ConRPG: Paraphrase Generation using Contexts as Regularizer

OpenViDial 2.0: A Larger-Scale, Open-Domain Dialogue Generation Dataset with Visual Contexts

Contact Info

Product

Resources

About