Cm-Bert

Yang, Kaicheng; Xu, Hua; Gao, Kai

doi:10.1145/3394171.3413690

Cited by 73 publications

(10 citation statements)

References 30 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In line with similar conclusions from psychology literature surrounding the importance of paralinguistic cues in the communication process, there are a number of studies employing sentiment analysis techniques that suggest a combination of text and audio data may improve classification accuracy, and consequently create a more robust representation of sentiment (Bhaskar et al, 2014; Dair et al, 2021; Houjeij et al, 2012; Yang et al, 2020). Hence, given that prior literature suggests both textual and vocal characteristics of earnings calls to be informative, and that Natural Language Processing literature finds a combination of text and audio to significantly increase classification accuracy, the adhesion of both measures represents a natural future direction for the literature.…”

Section: Discussionmentioning

confidence: 57%

“…The gold standard for multimodal sentiment analysis is considered to be the combination of all three communication modalities-text, audio and visual. Various studies have used the combination of all three modalities to define sentiment, showing that the use of a tri-modality model is more robust at classifying sentiment over bi-modal and singular modality models (Bhaskar et al, 2014;Dair et al, 2021;Houjeij et al, 2012;Morency et al, 2011;Poria et al, 2015;Yang et al, 2020). 44 The main advantage of using multimodal classifiers for sentiment classification is the additional behavioural cues provided by the visual and audio data.…”

Section: Multimodal Analysismentioning

confidence: 99%

“…They note that there is a scarcity of advanced NLP techniques being applied in the financial domain 32 . While the ML techniques discussed in the previous section have been shown to classify financial sentiment better than more rudimentary approaches, alternative approaches such as transformer architecture (Alamoudi & Alghamdi, 2021; Munikar et al, 2019; Sun et al, 2019) and multimodal analysis (Bhaskar et al, 2014; Dair et al, 2021; Houjeij et al, 2012; Yang et al, 2020) have been demonstrated as having greater abilities in accurately capturing sentiment.…”

Section: Applications Of Sentiment Analysis In Financementioning

confidence: 99%

See 2 more Smart Citations

Text‐based sentiment analysis in finance: Synthesising the existing literature and exploring future directions

Todd,

Bowden,

Moshfeghi

2024

Intell Sys Acc Fin Mgmt

View full text Add to dashboard Cite

SummaryAdvances in Deep Learning have drastically improved the abilities of Natural Language Processing (NLP) research, creating new state‐of‐the‐art benchmarks. Two research streams at the forefront of NLP analysis are transformer architecture and multimodal analysis. This paper critically evaluates the extant literature applying sentiment analysis techniques to the financial domain. We classify the financial sentiment analysis literature according to the most used techniques in the area, with a focus on methods used to detect sentiment within corporate earnings conference calls, because of their dual modality (text‐audio) nature. We find that the financial literature follows a similar path to NLP sentiment literature, in that more advanced techniques to define sentiment are being used as the field progresses. However, techniques used to determine financial sentiment currently fall behind state‐of‐the‐art techniques used within NLP. Two future directions stem from this paper. Firstly, we propose that the adoption of transformer architecture to create robust representations of textual data could enhance sentiment analysis in academic finance. Secondly, the adoption of multimodal classifiers in finance represents a new, currently underexplored area of study that offers opportunities for finance research.

show abstract

Section: Discussionmentioning

confidence: 57%

Section: Multimodal Analysismentioning

confidence: 99%

Section: Applications Of Sentiment Analysis In Financementioning

confidence: 99%

See 1 more Smart Citation

Text‐based sentiment analysis in finance: Synthesising the existing literature and exploring future directions

Todd,

Bowden,

Moshfeghi

2024

Intell Sys Acc Fin Mgmt

View full text Add to dashboard Cite

show abstract

“…Researchers have been working on different combinations of text, audio, and image modalities that can enhance prediction accuracy [10]. This area of study focuses on various methods of integrating multimodal information, primarily through feature fusion and decision fusion, as outlined in several studies [22][12] [5]. The choice of method often depends on the specific application and the nature of the data being analyzed.…”

Section: Introductionmentioning

confidence: 99%

Integrative Sentiment Analysis: Leveraging Audio, Visual, and Textual Data

S. Chu,

Ghanta

2024

AI, Machine Learning and Applications

View full text Add to dashboard Cite

Exploring the area of multimodal sentiment analysis, this paper addresses the growing significance of this field, driven by the exponential rise in multimodal data across platforms like YouTube. Traditional sentiment analysis, primarily focused on textual data, often overlooks the complexities and nuances of human emotions conveyed through audio and visual cues. Addressing this gap, our study explores a comprehensive approach that integrates data from text, audio, and images, applying state-of-the-art machine learning and deep learning techniques tailored to each modality. Our methodology is tested on the CMU-MOSEI dataset, a multimodal collection from YouTube, offering a diverse range of human sentiments. Our research highlights the limitations of conventional text-based sentiment analysis, especially in the context of the intricate expressions of sentiment that multimodal data encapsulates. By fusing audio and visual information with textual analysis, we aim to capture a more complete spectrum of human emotions. Our experimental results demonstrate notable improvements in precision, recall and accuracy for emotion prediction, validating the efficacy of our multimodal approach over single-modality methods. This study not only contributes to the ongoing advancements in sentiment analysis but also underscores the potential of multimodal approaches in providing more accurate and nuanced interpretations of human emotions.

show abstract

“…Emotion recognition in conversations (ERC) is vital and very challenging in the natural human machine interaction [1], intelligent education tutoring [2], and mental health analysis applications [3]. In daily life, humans utter a multi-turn conversation in a natural way which conveys emotion state through language and nonverbal content (e.g., facial expression and body language) [4].…”

Section: Introductionmentioning

confidence: 99%

cross-modal fusion techniques for utterance-level emotion recognition from text and speech

Luo¹,

Phan²,

Reiss³

2023

Preprint

View full text Add to dashboard Cite

Multimodal emotion recognition (MER) is a fundamental complex research problem due to the uncertainty of human emotional expression and the heterogeneity gap between different modalities. Audio and text modalities are particularly important for a human participant in understanding emotions. Although many successful attempts have been designed multimodal representations for MER, there still exist multiple challenges to be addressed: 1) bridging the heterogeneity gap between multimodal features and model inter-and intramodal interactions of multiple modalities; 2) effectively and efficiently modeling the contextual dynamics in the conversation sequence. In this paper, we propose Cross-Modal RoBERTa (CM-RoBERTa) model for emotion detection from spoken audio and corresponding transcripts. As the core unit of the CM-RoBERTa, parallel self-and cross-attention is designed to dynamically capture inter-and intra-modal interactions of audio and text. Specially, the mid-level fusion and residual module are employed to model longterm contextual dependencies and learn modality-specific patterns. We evaluate the approach on the MELD dataset and the experimental results show the proposed approach achieves the state-of-art performance on the dataset.

show abstract

Cm-Bert

Cited by 73 publications

References 30 publications

Text‐based sentiment analysis in finance: Synthesising the existing literature and exploring future directions

Text‐based sentiment analysis in finance: Synthesising the existing literature and exploring future directions

Integrative Sentiment Analysis: Leveraging Audio, Visual, and Textual Data

cross-modal fusion techniques for utterance-level emotion recognition from text and speech

Contact Info

Product

Resources

About