Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence 2021
DOI: 10.24963/ijcai.2021/548
|View full text |Cite
|
Sign up to set email alerts
|

UniMF: A Unified Framework to Incorporate Multimodal Knowledge Bases intoEnd-to-End Task-Oriented Dialogue Systems

Abstract: Knowledge bases (KBs) are usually essential for building practical dialogue systems. Recently we have seen rapidly growing interest in integrating knowledge bases into dialogue systems. However, existing approaches mostly deal with knowledge bases of a single modality, typically textual information. As today's knowledge bases become abundant with multimodal information such as images, audios and videos, the limitation of existing approaches greatly hinders the development of dialogue systems. In this paper, we… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 16 publications
(3 citation statements)
references
References 2 publications
(2 reference statements)
0
3
0
Order By: Relevance
“…There has been some effort to incorporate knowledge broader than what can be learned from the training dataset itself. One particular area of application is visual dialogue, where external knowledge bases have been proposed [160],…”
Section: Directions For Future Researchmentioning
confidence: 99%
“…There has been some effort to incorporate knowledge broader than what can be learned from the training dataset itself. One particular area of application is visual dialogue, where external knowledge bases have been proposed [160],…”
Section: Directions For Future Researchmentioning
confidence: 99%
“…W. Wei et al [28] think dialogue reading comprehension is also used for intelligent human-computer interaction systems. Compared to the more mature two-party dialogue MRC [29], [30], one would expect applications such as dialogue systems to be able to handle more complex multi-party dialogue MRC. Due to the excellent performance of PrLMs in text-level NLP tasks (Section II-A), PrLMs have been widely used in the processing of multi-party dialogue MRC in earlier studies [31], [32].…”
Section: B Transformers For Learning Dialoguementioning
confidence: 99%
“…Since knowledge plays a vital role in the response generation of task-oriented dialog systems, we first conduct the knowledge acquisition for the given multimodal context. Considering the semantic knowledge is pivotal to capturing the user's intentions [33,36,37], we focus on selecting two kinds of semantic knowledge: attribute knowledge and relation knowledge. Thereinto, the attribute knowledge, which is widely used, refers to the attribute-value pairs of entities mentioned directly in the context.…”
Section: Dual Semantic Knowledge Acquisitionmentioning
confidence: 99%