A new method for feature extraction is presented in this paper for speech recognition using a combination of discrete wavelet transform (DWT) and mel Frequency Cepstral Coefficients (MFCCs). The objective of this method is to enhance the performance of the proposed method by introducing more features from the signal. The performance of the Wavelet-based mel Frequency Cepstral Coefficients method is compared to mel Frequency Cepstral Coefficients based method for features extraction. Wavelet transform is applied to the speech signal where the input speech signal is decomposed into various frequency channels using the properties of wavelet transform. then Mel-Frequency Cepstral Coefficients (MFCCs) of the wavelet channels are calculated. A new set of features can be generated by concatenating both features. The speech signals are sampled directly from the microphone. Neural Networks (NN) are used in the proposed methods for classification. The proposed method is implemented for 15 male speakers uttering 10 isolated words each which are the digits from zero to nine. each digit is repeated 15 times.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.