Contextual Inter-modal Attention for Multi-modal Sentiment Analysis

Ghosal, Deepanway; Akhtar, Md. Shad; Chauhan, Dushyant Singh; Poria, Soujanya; Ekbal, Asif; Bhattacharyya, Pushpak

doi:10.18653/v1/d18-1382

Cited by 123 publications

(88 citation statements)

References 22 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Multimodal sentiment analysis provides an opportunity to learn interactions between different modalities. Similar to approaches mentioned for intermodal attention in Ghosal et al [10], we propose a method to learn cross-interaction vectors. For a pair of Text (H T ) and Video (H V ) modalities, co-attention matrix (M T V ∈R u×u ) can be defined as:…”

Section: Cross Attentionmentioning

confidence: 99%

“…In our experiments, we used same features mentioned in Ghosal et al [10]. Specifically, for CMU-MOSEI dataset, we used Glove embeddings for word features, Facets 2 for visual features and CovaRep [16] for acoustic features.…”

Section: Implementation Detailsmentioning

confidence: 99%

“…[5] 76.5 73. 4 Zadeh et al [7] 76.9 77.0 Georgiou et al [9] 76.9 76.9 Poria et al [2] 77.64 -Ghosal et al [10] 82.31 80.69 Ghosal et al [10] 79.80 -Sun et al [4] 80 [7], ( ¦ ) results are obtained on CMU-MOSEI dataset after excluding the utterances with sentiment score of 0. We mention the results of proposed model with this setup in the parenthesis.…”

Section: Cmu-mosei Approachmentioning

confidence: 99%

“…Methods that jointly learn the interactions between two or three modalities [3,4], and 3. Methods that explicitly learn contributions from these unimodal and cross modal cues, typically using attention based techniques [5,6,7,8,9,10].…”

Section: Introductionmentioning

confidence: 99%

“…Most of the existing approaches propose either fusion at different granularities [3,9] or use a cross interaction block that couple the features from different modalities [10,6].…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Gated Mechanism for Attention Based Multi Modal Sentiment Analysis

Kumar¹,

Vepa²

2020

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

View full text Add to dashboard Cite

Multimodal sentiment analysis has recently gained popularity because of its relevance to social media posts, customer service calls and video blogs. In this paper, we address three aspects of multimodal sentiment analysis; 1. Cross modal interaction learning, i.e. how multiple modalities contribute to the sentiment, 2. Learning long-term dependencies in multimodal interactions and 3. Fusion of unimodal and cross modal cues. Out of these three, we find that learning cross modal interactions is beneficial for this problem. We perform experiments on two benchmark datasets, CMU Multimodal Opinion level Sentiment Intensity (CMU-MOSI) and CMU Multimodal Opinion Sentiment and Emotion Intensity (CMU-MOSEI) corpus. Our approach on both these tasks yields accuracies of 83.9% and 81.1% respectively, which is 1.6% and 1.34% absolute improvement over current state-ofthe-art.

show abstract

Section: Cross Attentionmentioning

confidence: 99%

Section: Implementation Detailsmentioning

confidence: 99%

Section: Cmu-mosei Approachmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

“…Most of the existing approaches propose either fusion at different granularities [3,9] or use a cross interaction block that couple the features from different modalities [10,6].…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations