2017
DOI: 10.12785/ijcds/060303
|View full text |Cite
|
Sign up to set email alerts
|

On the Performance Degradation of Speaker Recognition System due to Variation in Speech Characteristics Caused by Physiological Changes

Abstract: Speaker recognition is the process of identifying a person using their speech characteristics (voice biometrics). Speech characteristics of an individual can vary due to physiological changes which may be caused by health changes, physical activity as well as emotional changes. Such changes in speech characteristics are likely to affect the accuracy of speaker recognition systems. In this paper, the performance degradation of a speaker recognition system is quantified, empirically, when the characteristics of … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
7
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
3
1
1
1

Relationship

2
4

Authors

Journals

citations
Cited by 8 publications
(7 citation statements)
references
References 20 publications
0
7
0
Order By: Relevance
“…Lockdown process is valuable because it gives excellent time and scope of testing for a maximum number of patients. Reverse transcription polymerase chain reaction (RT-PCR) is one of the best methods for analyzing and detecting COVID 19 within 48 h (Ghosh et al, 2015(Ghosh et al, , 2016a(Ghosh et al, , 2016bUsman, 2017).…”
Section: Introductionmentioning
confidence: 99%
“…Lockdown process is valuable because it gives excellent time and scope of testing for a maximum number of patients. Reverse transcription polymerase chain reaction (RT-PCR) is one of the best methods for analyzing and detecting COVID 19 within 48 h (Ghosh et al, 2015(Ghosh et al, , 2016a(Ghosh et al, , 2016bUsman, 2017).…”
Section: Introductionmentioning
confidence: 99%
“…The time series are represented by 13 Mel Frequency Cepstral Coefficients (MFCC) (Davis and Mermelstein, 1980), sampled at 11025 Hz. MFCC constitute a widely used feature for tasks such as speech recognition (Usman, 2017). They mimic the transformation of the audio signal by the inner ear and are a model of how sound stimuli are "perceived" by the early neuronal auditory system.…”
Section: Spoken Arabic Digits Datasetmentioning
confidence: 99%
“…MFCC is used because it can represent spectral details of speech signals along with temporal variations in the spectral details. A detailed description of MFCC computation procedure is given in [11], [12]. The MFCC data generated in this work uses a Hamming window of length 256 samples which corresponds to speech frame duration of 16 ms, with 50% overlap between adjacent windows.…”
Section: Speech Features Datamentioning
confidence: 99%