2022
DOI: 10.1016/j.knosys.2021.107751
|View full text |Cite
|
Sign up to set email alerts
|

A multi-view network for real-time emotion recognition in conversations

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
13
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 29 publications
(13 citation statements)
references
References 36 publications
0
13
0
Order By: Relevance
“…It has gained significant popularity due to numerous applications. Existing literature suggests that a wide range of deep learning methods have been applied to address the Emotion Recognition in Conversation (ERC) task [25], [26], [27], [28], [29], [30], [31], [32], [33], [34], [35], [36], [37]. ICON [25] used a memory network architecture to model the interaction between self and inter-speaker states in two-party conversations.…”
Section: Related Workmentioning
confidence: 99%
“…It has gained significant popularity due to numerous applications. Existing literature suggests that a wide range of deep learning methods have been applied to address the Emotion Recognition in Conversation (ERC) task [25], [26], [27], [28], [29], [30], [31], [32], [33], [34], [35], [36], [37]. ICON [25] used a memory network architecture to model the interaction between self and inter-speaker states in two-party conversations.…”
Section: Related Workmentioning
confidence: 99%
“…AGHMN [23] uses a hierarchical memory network to enhance utterance representations and introduce an attention GRU to model contextual information. MVN [11] utilizes a multi-view network to model word-and utterancelevel dependencies in a conversation. In contrast, speakerdependent methods model both context-and speaker-sensitive dependencies.…”
Section: A Emotion Recognition In Conversationsmentioning
confidence: 99%
“…Existing mainstream works on ERC can generally be categorized into sequence-and graph-based methods. Sequencebased methods [4]- [11] use recurrent neural networks or transformers to model long-distance contextual information in a conversation. In contrast, graph-based methods [12]- [15] design graph structures for conversations and then use graph neural networks to capture multiple dependencies.…”
Section: Introductionmentioning
confidence: 99%
“…We propose a new loss function paradigm covering the pixel loss, structural similarity loss and gradient loss. The new loss function paradigm L total is shown in Equation (5).…”
Section: Loss Function and Evaluation Parametermentioning
confidence: 99%
“…Multi-view learning and multimodal fusion have been widely applied in many fields, including image segmentation [2], target tracking [3], object detection [4], behaviour and emotion recognition [5,6], multi-view question answering [7]. The above research fields and results provide a certain reference for multi-view image fusion technology in industry.…”
Section: Introductionmentioning
confidence: 99%