2011 6th Conference on Speech Technology and Human-Computer Dialogue (SpeD) 2011
DOI: 10.1109/sped.2011.5940729
|View full text |Cite
|
Sign up to set email alerts
|

Comparison of feature extraction methods for speech recognition in noise-free and in traffic noise environment

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
6
0
1

Year Published

2011
2011
2022
2022

Publication Types

Select...
5
2
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 16 publications
(7 citation statements)
references
References 5 publications
0
6
0
1
Order By: Relevance
“…However, when this system was evaluated with the handset TIMIT (HTIMIT) Corpus, which is a database of speech data collected over different telephone channels, the accuracy was degraded to 34.4%, owing to the distortions that are present in communication channels. In research [55], two different noise signals: white noise and street noise were considered for the task of word recognition of six languages: English, German, French, Italian, Spanish and Hungarian. The results obtained showed that both PLP and MFCC achieved approximately the same accuracies.…”
Section: Automatic Speech Recognition Systemsmentioning
confidence: 99%
“…However, when this system was evaluated with the handset TIMIT (HTIMIT) Corpus, which is a database of speech data collected over different telephone channels, the accuracy was degraded to 34.4%, owing to the distortions that are present in communication channels. In research [55], two different noise signals: white noise and street noise were considered for the task of word recognition of six languages: English, German, French, Italian, Spanish and Hungarian. The results obtained showed that both PLP and MFCC achieved approximately the same accuracies.…”
Section: Automatic Speech Recognition Systemsmentioning
confidence: 99%
“…Feature extraction is achieved by transforming the speech waveform to a parametric representation for subsequent processing and analysis at a lower data rate. Once quality features are extracted, classification is easy [9].…”
Section: Introductionmentioning
confidence: 99%
“…Another challenge is sufficient advancement of children ASR system where intelligent speech innovations: YouTube Kids, Amazon Alexa, and computeraided language learning has been currently crucial in the process of classroom learning (Valente et al 2012). Since, the acoustic and linguistic patterns in case of children speech signals are very unique which indulge speaking rate, vocal tract length when contrasted to an adult speech signal (Subramanian et al 2019). Additionally, the accessibility of limited children speech datasets even in the context of native language prompts obstruction in development of efficient children speech recognition systems.…”
Section: Introductionmentioning
confidence: 99%