2020 Third International Conference on Advances in Electronics, Computers and Communications (ICAECC) 2020
DOI: 10.1109/icaecc50550.2020.9339502
|View full text |Cite
|
Sign up to set email alerts
|

Spectral Features for Emotional Speaker Recognition

Abstract: Speaker recognition in an emotive environment is a bit challenging task because of influence of emotions in a speech. Identifying the speaker from the speech can be done by analyzing the features of the speech signal. In normal conditions, identifying a speaker is not a tedious task. Whereas, identifying the speaker in an emotional environment such as happy, sad, anger, surprise, sarcastic, fear etc. is really challenging, since speech becomes altered under emotions and noise. The spectral features of speech s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
13
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 21 publications
(13 citation statements)
references
References 18 publications
0
13
0
Order By: Relevance
“…This feature [ 24 ] is referred as global energy of audio signal, which is estimated by, where, defines signal amplitude at amplitude, symbolizes quantity of frames in sample length, and specifies root mean square feature.…”
Section: Developed Covid-19 Detection Model Based On Hybrid Optimizationmentioning
confidence: 99%
See 1 more Smart Citation
“…This feature [ 24 ] is referred as global energy of audio signal, which is estimated by, where, defines signal amplitude at amplitude, symbolizes quantity of frames in sample length, and specifies root mean square feature.…”
Section: Developed Covid-19 Detection Model Based On Hybrid Optimizationmentioning
confidence: 99%
“…This feature defines the ratio of quantity of times the audio sample alters the value from negative to positive or else positive to negative to frame dimension [ 24 ]. The zero-crossing rate feature is denoted as .…”
Section: Developed Covid-19 Detection Model Based On Hybrid Optimizationmentioning
confidence: 99%
“…The input features for deep-learning-based SER models are generally extracted from the time or spectrum axis in units of speech segments or frames. There are various LLDs and high-level statistical functions of the LLD single features [19,20,[31][32][33]. The spectrum LLD features of speech signals include logMel filter-banks and mel-frequency cepstral coefficients (MFCC).…”
Section: Related Workmentioning
confidence: 99%
“…The spectrum LLD features of speech signals include logMel filter-banks and mel-frequency cepstral coefficients (MFCC). Zero-crossing rates and signal energies are representative time-domain features [27][28][29][30], whereas spectral roll-off and spectral centroid are classified as spectral parameters [33]. A set of multiple single features for acoustic signal processing, such as the extended Geneva Minimalistic Acoustic Parameter Set [34] and the INTERSPEECH 2010 Paralinguistic Challenge (IS10) dataset [35], is now accessible from open-source frameworks, such as OpenSmile [36].…”
Section: Related Workmentioning
confidence: 99%
“…In this paper [21], the speaker is recognized in the emotional environment. Spectral features are extracted from the data and are classified.…”
Section: Related Workmentioning
confidence: 99%