2009
DOI: 10.1121/1.3224721
|View full text |Cite
|
Sign up to set email alerts
|

Microscopic prediction of speech recognition for listeners with normal hearing in noise using an auditory model

Abstract: This study compares the phoneme recognition performance in speech-shaped noise of a microscopic model for speech recognition with the performance of normal-hearing listeners. "Microscopic" is defined in terms of this model twofold. First, the speech recognition rate is predicted on a phoneme-by-phoneme basis. Second, microscopic modeling means that the signal waveforms to be recognized are processed by mimicking elementary parts of human's auditory processing. The model is based on an approach by Holube and Ko… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

3
45
0

Year Published

2010
2010
2023
2023

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 67 publications
(48 citation statements)
references
References 32 publications
3
45
0
Order By: Relevance
“…The proposed model was motivated by a similar approach taken by Jürgens and Brand (2009) and Jürgens et al (2010). Their speech intelligibility prediction was performed using two model stages.…”
Section: Introductionmentioning
confidence: 98%
“…The proposed model was motivated by a similar approach taken by Jürgens and Brand (2009) and Jürgens et al (2010). Their speech intelligibility prediction was performed using two model stages.…”
Section: Introductionmentioning
confidence: 98%
“…While this scoring procedure provides important information about whether a listener recognizes a stimulus item, examining the recognition errors produced by the listener can provide valuable information about how partial information is perceived and resolved by the listener depending on the listening condition. Indeed, there is recent interest in the development of "microscopic" models of speech recognition that take into account fine-grained recognition on a phoneme level (e.g., Cooke, 2006;J€ urgens and Brand, 2009). Toward this end, various corpora have been developed to document consistent word confusions in various types of noise (T oth et al, 2015;Marxer et al, 2016).…”
Section: Introductionmentioning
confidence: 99%
“…Numerous studies have been reported aiming at predicting the performance of HI listeners in a speech perception task (e.g., J€ urgens and Brand, 2009;Brown et al, 2010), with a peripheral model used as a front end to a conventional automatic speech recognition system, as a back end. A limiting property of such an approach is the inability to decompose the origin of errors, especially front-end versus back-end errors.…”
Section: Introductionmentioning
confidence: 99%