1999
DOI: 10.1121/1.427950
|View full text |Cite
|
Sign up to set email alerts
|

A model of auditory perception as front end for automatic speech recognition

Abstract: A front end for automatic speech recognizers is proposed and evaluated which is based on a quantitative model of the "effective" peripheral auditory processing. The model simulates both spectral and temporal properties of sound processing in the auditory system which were found in psychoacoustical and physiological experiments. The robustness of the auditory-based representation of speech was evaluated in speaker-independent, isolated word recognition experiments in different types of additive noise. The resul… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

2
65
1

Year Published

2001
2001
2014
2014

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 100 publications
(69 citation statements)
references
References 26 publications
2
65
1
Order By: Relevance
“…In that study, there was no pre-processing or enhancement of the speech utterances. The front-ends investigated were perceptual linear prediction (PLP) proposed by Hermansky [17], the PEMO algorithm proposed by Tchorz and Kollmeier [18], and the front-end processor proposed by Li et al [14]. For the task of connected digit recognition using the Aurora 2 database, the front-end proposed by Li et al gave the best overall recognition results of all the auditory models examined, and with an overall reduction in recognition error compared to the ETSI basic front-end [6] which was used as a baseline for comparison.…”
Section: Auditory Modelling As An Alternative Front-endmentioning
confidence: 99%
“…In that study, there was no pre-processing or enhancement of the speech utterances. The front-ends investigated were perceptual linear prediction (PLP) proposed by Hermansky [17], the PEMO algorithm proposed by Tchorz and Kollmeier [18], and the front-end processor proposed by Li et al [14]. For the task of connected digit recognition using the Aurora 2 database, the front-end proposed by Li et al gave the best overall recognition results of all the auditory models examined, and with an overall reduction in recognition error compared to the ETSI basic front-end [6] which was used as a baseline for comparison.…”
Section: Auditory Modelling As An Alternative Front-endmentioning
confidence: 99%
“…This connectionist model has a highly flexible architecture that can be easily customized to model processing in sensory cortices (see, e.g., Erwin et al, 1995;Palakal et al, 1995;Ritter et al, 1992;Sirosh & Miikkulainen, 1997;Swindale & Bauer, 1998). Signalprocessing models of auditory representations have advanced considerably over the past decade (see, e.g., Meyer-Base & Scheich, 1995;Pitton, Wang, & Juang, 1996;Robert & Eriksson, 1999;Tchorz & Kollmeier, 1999;K. Wang & Shamma, 1995a).…”
Section: A Computational Model Of Auditory Cortical Processingmentioning
confidence: 99%
“…[4,5]), a number of feature extraction methods have been proposed in recent years that exploit temporal information. These systems typically provide a recognition accuracy that exceeds that obtained using MFCC or PLP features in the presence of noise and other adverse conditions [6,7,8], especially if they are combined with a traditional recognition system in some fashion.…”
Section: Introductionmentioning
confidence: 99%