2021
DOI: 10.1609/aaai.v35i15.17625
|View full text |Cite
|
Sign up to set email alerts
|

DialogXL: All-in-One XLNet for Multi-Party Conversation Emotion Recognition

Abstract: This paper presents our pioneering effort for emotion recognition in conversation (ERC) with pre-trained language models. Unlike regular documents, conversational utterances appear alternately from different parties and are usually organized as hierarchical structures in previous work. Such structures are not conducive to the application of pre-trained language models such as XLNet. To address this issue, we propose an all-in-one XLNet model, namely DialogXL, with enhanced memory to store longer historical con… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
23
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
4

Relationship

0
8

Authors

Journals

citations
Cited by 104 publications
(44 citation statements)
references
References 26 publications
(74 reference statements)
0
23
0
Order By: Relevance
“…Considering the inter-speaker dependencies, DialogueRNN [29] proposes a global state RNN to model multi-party relations and emotional dynamics. Another branch of work leverages the strong context modelling ability of Transformer-based networks to model the dialogue as a whole [10], [30], [31]. To introduce more interpretable structures, there are also many works [32], [33], [34] that construct a graph on the dialogue, and devise graph neural networks to model ERC as a node-classification task.…”
Section: Emotion Recognition In Conversationsmentioning
confidence: 99%
See 2 more Smart Citations
“…Considering the inter-speaker dependencies, DialogueRNN [29] proposes a global state RNN to model multi-party relations and emotional dynamics. Another branch of work leverages the strong context modelling ability of Transformer-based networks to model the dialogue as a whole [10], [30], [31]. To introduce more interpretable structures, there are also many works [32], [33], [34] that construct a graph on the dialogue, and devise graph neural networks to model ERC as a node-classification task.…”
Section: Emotion Recognition In Conversationsmentioning
confidence: 99%
“…DialogXL [10]: This PLM-based work uses dialog-aware self-attention to model inter-and intra-speaker dependencies, and utilises utterance recurrence to model long-range contexts.…”
Section: Baselinesmentioning
confidence: 99%
See 1 more Smart Citation
“…• Text modality: The model selected to process the text obtained from the audio transcription was based on the DialogXL, proposed in [57], which has obtained state of the art results when compared to other text processing models [58]. The output of the DialogXL was then passed to a neural network where each layer contained dialog-aware self-attention and utterance recurrence components.…”
Section: ) Individual Modality Trainingmentioning
confidence: 99%
“…Moreover, there is variability among speakers, their body language, and facial expressions (Kacur et al, 2021 ). Emotion recognition can greatly promote the integration and development of many different disciplines, such as graphics and image processing, artificial intelligence, human-computer interaction and psychology (Jiang and Yin, 2021 ; Shen et al, 2021 ). In human-computer interaction scenes with many different modes, the combination of emotion, posture, sound and other modes can make human-computer interaction experience more real.…”
Section: Introductionmentioning
confidence: 99%