2014 First International Conference on Networks &Amp; Soft Computing (ICNSC2014) 2014
DOI: 10.1109/cnsc.2014.6906692
|View full text |Cite
|
Sign up to set email alerts
|

Robust feature extraction methods for speech recognition in noisy environments

Abstract: This paper presents robust feature extraction techniques for isolated word recognition under noisy conditions. The proposed hybrid feature extraction techniques are Bark Frequency Cepstral Coefficients (BFCC) and Weighted Average Mel-Frequency Cepstral Coefficient (WMFCC). Both methods are tested in various noisy environments using a single Gaussian Hidden Markov Model (HMM) based isolated digit recognition system. The results clearly indicates that WMFCC performed well compared to Mel-Frequency Cepstral Coeff… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2015
2015
2021
2021

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(3 citation statements)
references
References 3 publications
0
3
0
Order By: Relevance
“…After the windowing process, the FFT is applied and the absolute values it returns are placed in a Mel-filter bank; the log of the filter bank values is calculated and the final MFCC feature vectors are created by applying the discrete cosine transform to each Mel-filter bank. Because it relies on auto-correlaton analysis, MFCC shares the trait with LPC that it is not noise robust [13, p. 358]; although there are many MFCC variants, each with their own improvements and compromises [14]. Other feature extraction techniques include Perceptual Linear Predictive Coefficients (PLP), which are often used in conjunction with RASTA for improved performance; Wavelet-based features; and Linear Predictive Cepstral Coefficients (LPCC), an addition to LPC [13][14][15][16].…”
Section: Feature Extraction Techniques Used In Asr Systemsmentioning
confidence: 99%
See 1 more Smart Citation
“…After the windowing process, the FFT is applied and the absolute values it returns are placed in a Mel-filter bank; the log of the filter bank values is calculated and the final MFCC feature vectors are created by applying the discrete cosine transform to each Mel-filter bank. Because it relies on auto-correlaton analysis, MFCC shares the trait with LPC that it is not noise robust [13, p. 358]; although there are many MFCC variants, each with their own improvements and compromises [14]. Other feature extraction techniques include Perceptual Linear Predictive Coefficients (PLP), which are often used in conjunction with RASTA for improved performance; Wavelet-based features; and Linear Predictive Cepstral Coefficients (LPCC), an addition to LPC [13][14][15][16].…”
Section: Feature Extraction Techniques Used In Asr Systemsmentioning
confidence: 99%
“…Because it relies on auto-correlaton analysis, MFCC shares the trait with LPC that it is not noise robust [13, p. 358]; although there are many MFCC variants, each with their own improvements and compromises [14]. Other feature extraction techniques include Perceptual Linear Predictive Coefficients (PLP), which are often used in conjunction with RASTA for improved performance; Wavelet-based features; and Linear Predictive Cepstral Coefficients (LPCC), an addition to LPC [13][14][15][16]. The work performed in [3, p.83] contrasts LPCC and MFCC, demonstrating that LPCC generally results in lower accuracy but has a faster computation rate while MFCC is slower to compute, but often results in improved recognition accuracy.…”
Section: Feature Extraction Techniques Used In Asr Systemsmentioning
confidence: 99%
“…These features are further taken as input to the classification algorithms. In this research, Mel-Frequency Cepstral coefficients (MFCC) are computed from the audio commentary because of its wide usage in speech recognition [8]. In literature, various neural networks have been adopted for classification.…”
Section: Proposed Workmentioning
confidence: 99%