2019 29th International Conference Radioelektronika (RADIOELEKTRONIKA) 2019
DOI: 10.1109/radioelek.2019.8733572
|View full text |Cite
|
Sign up to set email alerts
|

DNN Based Music Emotion Recognition from Raw Audio Signal

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
16
0
4

Year Published

2021
2021
2023
2023

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 28 publications
(20 citation statements)
references
References 10 publications
0
16
0
4
Order By: Relevance
“…The spectrogram was a handcrafted magnitude-only representation without phase information. Orjesek et al [ 36 ] addressed this problem by using a raw waveform input for their classifier. Our study used both the real (magnitude) and imaginary (phase angle) information from audio for emotion classification because several studies [ 37 , 38 , 39 ] have demonstrated that phase information improves the performance of both speech and music processing.…”
Section: Related Workmentioning
confidence: 99%
“…The spectrogram was a handcrafted magnitude-only representation without phase information. Orjesek et al [ 36 ] addressed this problem by using a raw waveform input for their classifier. Our study used both the real (magnitude) and imaginary (phase angle) information from audio for emotion classification because several studies [ 37 , 38 , 39 ] have demonstrated that phase information improves the performance of both speech and music processing.…”
Section: Related Workmentioning
confidence: 99%
“…Representation methods can be divided into the preparation of raw sound samples, 2D representations of music (e.g., spectrograms, cepstrograms, chromagrams, etc.) [7,29,30] and musical signal parametric form, i.e., feature vector (e.g., a vector of mel-cepstral coefficients or MPEG-7-based parameters) [14,15,17].…”
Section: Emotion Classificationmentioning
confidence: 99%
“…An approach of training a neural network with data obtained from raw, unprocessed sound was proposed by Orjesek et al [17]. The algorithm used convolutional network layers connected with layers of a recursive neural network.…”
Section: Emotion Classificationmentioning
confidence: 99%
See 2 more Smart Citations