Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics 2019
DOI: 10.18653/v1/p19-1540
|View full text |Cite
|
Sign up to set email alerts
|

Ordinal and Attribute Aware Response Generation in a Multimodal Dialogue System

Abstract: Multimodal dialogue systems have opened new frontiers in the traditional goal-oriented dialogue systems. The state-of-the-art dialogue systems are primarily based on unimodal sources, predominantly the text, and hence cannot capture the information present in the other sources such as videos, audios, images etc. With the availability of large scale multimodal dialogue dataset (MMD) (Saha et al., 2018) on the fashion domain, the visual appearance of the products is essential for understanding the intention of t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
28
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 29 publications
(28 citation statements)
references
References 29 publications
(16 reference statements)
0
28
0
Order By: Relevance
“…The user utterances play a significant role in building the dialogue context for generating coherent and appropriate responses, in accordance to the user demands. We focus on generating the textual responses only in a similar manner as [8,12,13]. Here, the task of multi-modal dialog generation is defined as follows: we consider both the modalities, i.e.…”
Section: Plos Onementioning
confidence: 99%
See 4 more Smart Citations
“…The user utterances play a significant role in building the dialogue context for generating coherent and appropriate responses, in accordance to the user demands. We focus on generating the textual responses only in a similar manner as [8,12,13]. Here, the task of multi-modal dialog generation is defined as follows: we consider both the modalities, i.e.…”
Section: Plos Onementioning
confidence: 99%
“…Earlier works on the MMD dataset reported in [12,13,44] used the hierarchical encoder-decoder model to generate responses by capturing information from text, images and the knowledge base. Recently, [8] proposed attribute and position-aware attention for generating textual responses. The authors in [45] used an hierarchical attention mechanism for generating responses on the MMD dataset.…”
Section: Multimodal Dialogue Systemsmentioning
confidence: 99%
See 3 more Smart Citations