2019
DOI: 10.32628/ijsrset196110
|View full text |Cite
|
Sign up to set email alerts
|

A Hybrid Approach to Gender Classification using Speech Signal

Abstract: Speech forms a significant means of communication and the variation in pitch of a speech signal of a gender is commonly used to classify gender as male or female. In this study, we propose a system for gender classification from speech by combining hybrid model of 1-D Stationary Wavelet Transform (SWT) and artificial neural network. Features such as power spectral density, frequency, and amplitude of human voice samples were used to classify the gender. We use Daubechies wavelet transform at different levels f… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
2
2

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(2 citation statements)
references
References 7 publications
0
2
0
Order By: Relevance
“…The classification of gender and age from direct speech signals can be logically related to the time or frequency domains. In the time domain analysis, we directly measure the speech signals considering the content of a signal for evaluating information regarding a speaker; on the other hand, in the frequency domain analysis, the frequency content of a speech signal is used to form a spectrum for evaluating information regarding a speaker, which is analyzed accordingly [ 17 , 18 ]. The smart gender-age recognition system used the main variation in the levels of power and the frequency content of the two genders to identify the gender-based speech signals [ 9 , 10 ].…”
Section: Literature Reviewmentioning
confidence: 99%
“…The classification of gender and age from direct speech signals can be logically related to the time or frequency domains. In the time domain analysis, we directly measure the speech signals considering the content of a signal for evaluating information regarding a speaker; on the other hand, in the frequency domain analysis, the frequency content of a speech signal is used to form a spectrum for evaluating information regarding a speaker, which is analyzed accordingly [ 17 , 18 ]. The smart gender-age recognition system used the main variation in the levels of power and the frequency content of the two genders to identify the gender-based speech signals [ 9 , 10 ].…”
Section: Literature Reviewmentioning
confidence: 99%
“…Manjunath et al [19] presented an acoustic event classification model using new features extracted from spectrogram blocks. A hybrid model using 1-D Stationary Wavelet Transform (SWT) for signal denoising and reconstruction was given to ANN for gender recognition developed by Yasin et al [20]. Anna V. Kuchebo et al [21] presented a deep learning network to classify gender from speech using the Mel spectrogram of speech of the Mozilla voice dataset.…”
Section: Introductionmentioning
confidence: 99%