2015
DOI: 10.5120/20312-2362
|View full text |Cite
|
Sign up to set email alerts
|

Voice Recognition using Dynamic Time Warping and Mel-Frequency Cepstral Coefficients Algorithms

Abstract: Voice recognition is an important and active research area of the recent years. This research aims to build a system for voice recognition using dynamic time wrapping algorithm, by comparing the voice signal of the speaker with pre-stored voice signals in the database, and extracting the main features of the speaker voice signal using Mel-frequency cepstral coefficients, which is one of the most important factors in achieving high recognition accuracy. General TermsDynamic time wrapping "DTW" algorithm, Mel-fr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
15
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 26 publications
(15 citation statements)
references
References 2 publications
0
15
0
Order By: Relevance
“…The Mel-Frequency Cepstral (MFC) is a well-known method for extracting the speech signal features. A smart combination of the MFC and the Dynamic Time Warping (DTW) techniques can provide effective solutions especially in the case of isolated speech words recognition [23,26,32]. The devised system deals with the isolated speech word.…”
Section: Methodsmentioning
confidence: 99%
“…The Mel-Frequency Cepstral (MFC) is a well-known method for extracting the speech signal features. A smart combination of the MFC and the Dynamic Time Warping (DTW) techniques can provide effective solutions especially in the case of isolated speech words recognition [23,26,32]. The devised system deals with the isolated speech word.…”
Section: Methodsmentioning
confidence: 99%
“…Another method is Perceptual Linear prediction (PLP), which is an analytical model perceptually motivated auditory spectrum by a low order pole function using the autocorrelation LP technique [8,11,12]. The PLP analysis provides similar results as with the LPC analysis, but the order of PLP model is half of the LP model therefore less computational storage [13,14]. PLP sometimes has been slightly better than LPCC, when it comes to noisy environment.…”
Section: Feature Extractionmentioning
confidence: 99%
“…PLP sometimes has been slightly better than LPCC, when it comes to noisy environment. Among those techniques, the most widely used feature extraction methods is Mel frequency Cepstral Coefficient (MFCC) in the field of ASR [8,14]. MFCC provides good discrimination [5] and low correlation between coefficients, but MFCC performance might be affected by the number of filters [10] and does not give accurate results if there are background noise [8].…”
Section: Feature Extractionmentioning
confidence: 99%
See 1 more Smart Citation
“…Meanwhile another research used Linear Predictive Coding (LPC) [4], [14], and wavelet [15]. Another research compared MFCC more accurate than LPC with accuracy up to 100% [16], [17] and also more accurate than Dynamic Time Warping (DTW) with average 96% [18].…”
Section: Introductionmentioning
confidence: 99%