2019
DOI: 10.3390/app9102166
|View full text |Cite
|
Sign up to set email alerts
|

Enhanced Automatic Speech Recognition System Based on Enhancing Power-Normalized Cepstral Coefficients

Abstract: Many new consumer applications are based on the use of automatic speech recognition (ASR) systems, such as voice command interfaces, speech-to-text applications, and data entry processes. Although ASR systems have remarkably improved in recent decades, the speech recognition system performance still significantly degrades in the presence of noisy environments. Developing a robust ASR system that can work in real-world noise and other acoustic distorting conditions is an attractive research topic. Many advanced… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
13
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 22 publications
(13 citation statements)
references
References 25 publications
0
13
0
Order By: Relevance
“…The TIDIGITS (Leonard and Doddington, 1993) (LDC Catalog No. LDC93S10) is a speech corpus of spoken digits for speaker-independent speech recognition (Cooke et al, 2001;Tamazin et al, 2019). The speakers are from different genders (male and female), age ranges (adults and children), dialect districts (Boston, Richmond, Lubbock, etc.).…”
Section: Spike-tidigits and Spike-timit Databasesmentioning
confidence: 99%
“…The TIDIGITS (Leonard and Doddington, 1993) (LDC Catalog No. LDC93S10) is a speech corpus of spoken digits for speaker-independent speech recognition (Cooke et al, 2001;Tamazin et al, 2019). The speakers are from different genders (male and female), age ranges (adults and children), dialect districts (Boston, Richmond, Lubbock, etc.).…”
Section: Spike-tidigits and Spike-timit Databasesmentioning
confidence: 99%
“…Typically, the MFCC and the perceptual linear predictive (PLP) [10] techniques are evaluated as the most widely used techniques in speech and speaker recognition systems. However, the PLP method relative spectral (RASTA) [10] filtering is combined with the feature extraction technique to remove channel noises compared to the speech signal. Recently, the enhanced automatic speech recognition system based on enhancing PNCC has been presented [10].…”
Section: Introductionmentioning
confidence: 99%
“…However, the PLP method relative spectral (RASTA) [10] filtering is combined with the feature extraction technique to remove channel noises compared to the speech signal. Recently, the enhanced automatic speech recognition system based on enhancing PNCC has been presented [10]. PNCC also proposes are estimated over a long duration that is commonly used for speech, as well as frequency smoothing.…”
Section: Introductionmentioning
confidence: 99%
“…However, this technique was decreased the calculation speech and massively required more computational resources. A modified approach of power normalized cepstral coefficient system by utilizing the large time power and minimizing the channel bias was presented [8]. They intended to increase the noise robustness of the system.…”
Section: Introductionmentioning
confidence: 99%