Effective emotion recognition in movie audio tracks

Kotti, Margarita; Stylianou, Yannis

doi:10.1109/icassp.2017.7953132

“…Audio-based emotion recognition exploits features such as pitch, intensity, energy, and MFCCs (Mel-frequency cepstrum coefficients). Challenges in speech-based emotion recognition are that expression differs among subjects [9] and is directly influenced by age, culture, and externals factors such as the environment [10]. Video-based systems extract emotions from features such as facial expression, mouth, or eye shape [4].…”

Section: Related Workmentioning

confidence: 99%

Real-time Emotion Recognition for Sales

Naas

¹

,

Sigg

²

2020

2020 16th International Conference on Mobility, Sensing and Networking (MSN)

5

0

View full text Add to dashboard Cite

Positive emotion is a pre-condition to any sales contract. Likewise, the ability to perceive the emotions of a customer impacts sales performance.To support emotional perception in buyer-seller interactions, we propose an audio-visual emotion recognition system that can recognize eight emotions: neutral, calm, sad, happy, angry, fearful, surprised, and disgusted. We reduced noise in audio samples and we applied transfer learning for image feature extraction based on a pre-trained deep neural network VGG16. For emotion recognition, we successfully obtained an audio emotion-recognition accuracy of 62.51% and 68% and video emotion-recognition accuracy of 97.13% and 97.77% on the Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS) and Surrey Audio-Visual Expressed Emotion (SAVEE) datasets respectively. For the combination of the two models, our proposed merging mechanism without re-training achieved an accuracy of close to 100% on both datasets. Finally, we demonstrated our system for a customer satisfaction use case in a real customer-to-salesperson interaction using audio and video models, achieving an average accuracy of 78%.

show abstract

“…the frame size is size change 1ms effect noticeable changes in speech emotion recognition. [13] Focus on the emotion to differ between the angry, happy, and neutral. we extract the feature and subset which is not commonly used in the emotion recognition task.…”

Section: Audio Analysis Andmentioning

confidence: 99%

Design & Development of Network Geo-Fencing Model for User Monitoring and it’s Alertness in a Security Applications

Barapatre¹,

Deshmukh²

2019

IJRAT

1

0

View full text Add to dashboard Cite

Communication plays a vital role according to the people's emotion, as emotions and gesture play 80% role while communication. Nowadays emotion recognition and classification are used in different areas to understand the human feelings like in the robotics, Health care, Military, Home automation, Hands-free computing, Mobile Telephony, Video game,call-center system, Marketing, etc. SER can help better interaction between the machine and the human. There are various algorithms and combination of the algorithms are available to recognize and classify the audio according to their emotion. In this paper, we attempted to investigate the episodic significant works, their technique and the impact of the approaches and the scope of the correction of the results.

show abstract

“…The random forest has predicted the correct emotion and give the highest accuracy of 81.5%. Margarita Kotti [13] in 2017 The author introduces a method to recognize the emotion from movies and drama clips.focus on the emotion to differ between the angry, happy, and neutral. we extract the feature and subset which is not mostly used in the emotion recognition task.…”

Section: Audio Analysis Andmentioning

confidence: 99%

Audio Analysis and Classification: A Review

Mohammad¹,

Tripathi²

2019

IJRAT

1

0

View full text Add to dashboard Cite

Communication plays a vital role according to the people's emotion, as emotions and gesture play 80% role while communication. Nowadays emotion recognition and classification are used in different areas to understand the human feelings like in the robotics, Health care, Military, Home automation, Hands-free computing, Mobile Telephony, Video game,call-center system, Marketing, etc. SER can help better interaction between the machine and the human. There are various algorithms and combination of the algorithms are available to recognize and classify the audio according to their emotion. In this paper, we attempted to investigate the episodic significant works, their technique and the impact of the approaches and the scope of the correction of the results.

show abstract

Effective emotion recognition in movie audio tracks

Cited by 10 publications

References 26 publications

Real-time Emotion Recognition for Sales

Real-time Emotion Recognition for Sales

Design & Development of Network Geo-Fencing Model for User Monitoring and it’s Alertness in a Security Applications

Audio Analysis and Classification: A Review

Contact Info

Product

Resources

About