2020
DOI: 10.1007/s00779-020-01389-0
|View full text |Cite
|
Sign up to set email alerts
|

Supervised machine learning for audio emotion recognition

Abstract: The field of Music Emotion Recognition has become and established research sub-domain of Music Information Retrieval. Less attention has been directed towards the counterpart domain of Audio Emotion Recognition, which focuses upon detection of emotional stimuli resulting from non-musical sound. By better understanding how sounds provoke emotional responses in an audience, it may be possible to enhance the work of sound designers. The work in this paper uses the International Affective Digital Sounds set. A tot… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
23
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
3
1
1

Relationship

1
8

Authors

Journals

citations
Cited by 39 publications
(23 citation statements)
references
References 50 publications
0
23
0
Order By: Relevance
“…Ntalampiras [52] compared emotion prediction using two CNNs that were designed to individually predict arousal and valence. The authors used the EmoSoundscapes data.…”
Section: Sound Emotion Recognitionmentioning
confidence: 99%
See 1 more Smart Citation
“…Ntalampiras [52] compared emotion prediction using two CNNs that were designed to individually predict arousal and valence. The authors used the EmoSoundscapes data.…”
Section: Sound Emotion Recognitionmentioning
confidence: 99%
“…Part of the work performed by Ntalampiras [52] was on the EmoSoundscape dataset, and they used CNN models. The MSE values reported for arousal and valence prediction were around 0.049 and 0.11, respectively, which are equivalent to 0.22 and 0.33 for RMSE.…”
Section: Performance Of Prediction Modelsmentioning
confidence: 99%
“…23,24 Using properties of sound as features and experience measures as labels for those features, several groups have attempted to build machine learning algorithms that can predict emotional responses based on the sound properties alone, commonly according to a valence-arousal circumplex model. 25,26,27,28,29 However results in this area remain mixed due to lack of sufficiently high dimensional measurement and modeling tools suitable for capturing the fast changes in human experience that accompany changes in sound. 30,31…”
Section: A Effects Of Sound On Human Experiencementioning
confidence: 99%
“…Upon connecting all three lasers, a green LED lights up, and a 'success' sound is emitted, which gives positive emotional feedback to the user through an ascending harmonic sequence of tones. This emotional feedback can be understood as an affective sound design (see Cunningham et al 2020).…”
Section: Laser Puzzlementioning
confidence: 99%