2021
DOI: 10.1609/aaai.v35i16.17686
|View full text |Cite
|
Sign up to set email alerts
|

Multi-modal Multi-label Emotion Recognition with Heterogeneous Hierarchical Message Passing

Abstract: As an important research issue in affective computing community, multi-modal emotion recognition has become a hot topic in the last few years. However, almost all existing studies perform multiple binary classification for each emotion with focus on complete time series data. In this paper, we focus on multi-modal emotion recognition in a multi-label scenario. In this scenario, we consider not only the label-to-label dependency, but also the feature-to-label and modality-to-label dependencies. Particularly, we… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 23 publications
(4 citation statements)
references
References 34 publications
0
4
0
Order By: Relevance
“…[19] employed adversarial learning and combined adversarial loss and multi-label supervised loss to achieve multi-label emotion tagging for video data. [20] exploited Graph Neural Networks (GNNs) with heterogeneous hierarchical message passing for multi-modal multi-label emotion classification with textual, visual and acoustic modalities. However, these studies all belong to coarse-grained emotion classification which involve only 12 emotion categories at most [20].…”
Section: B Multi-label Emotion Classificationmentioning
confidence: 99%
See 1 more Smart Citation
“…[19] employed adversarial learning and combined adversarial loss and multi-label supervised loss to achieve multi-label emotion tagging for video data. [20] exploited Graph Neural Networks (GNNs) with heterogeneous hierarchical message passing for multi-modal multi-label emotion classification with textual, visual and acoustic modalities. However, these studies all belong to coarse-grained emotion classification which involve only 12 emotion categories at most [20].…”
Section: B Multi-label Emotion Classificationmentioning
confidence: 99%
“…[20] exploited Graph Neural Networks (GNNs) with heterogeneous hierarchical message passing for multi-modal multi-label emotion classification with textual, visual and acoustic modalities. However, these studies all belong to coarse-grained emotion classification which involve only 12 emotion categories at most [20]. It is difficult for these studies to truly reflect the complex emotion categories of human.…”
Section: B Multi-label Emotion Classificationmentioning
confidence: 99%
“…Aiming at facial expression recognition, Li and Deng [33] proposed a new deep manifold learning network to learn discriminative features of multi-label expressions. Moreover, Zhang et al [34] focused on multi-modal emotion recognition in a multi-label scenario.…”
Section: Multi-label Emotion Classificationmentioning
confidence: 99%
“…To improve the decoding efficiency of autoregressive transformers, bidirectional generative transformers have been proposed [5,13,56]. Contrary to autoregressive models that predict a single consecutive token at each step, a bidirectional transformer learns to predict multiple masked tokens at once based on the previously generated context.…”
Section: Bidirectional Transformersmentioning
confidence: 99%