2013
DOI: 10.1109/tasl.2013.2244089
|View full text |Cite
|
Sign up to set email alerts
|

Maximizing Phoneme Recognition Accuracy for Enhanced Speech Intelligibility in Noise

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
14
0

Year Published

2013
2013
2022
2022

Publication Types

Select...
5
1
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 17 publications
(15 citation statements)
references
References 23 publications
1
14
0
Order By: Relevance
“…where we used Bayes' rule and whereâ L (u T , C), abbreviated toâ L , is the set of acoustic features observed by the listener, which is modeled as a deterministic function of the talker phoneme sequence u T and the speech modification parameters C. The first term of ( 16) is the likelihood of the talker phoneme sequence for the observed featuresâ L , the second term is the a-priori probability that the phoneme sequence u T is decoded by the listener, and the third term is the inverse a-priori probability of the listener observed features. Optimization of the likelihood term only reduces complexity and provides good results [11].…”
Section: B Intelligibility Enhancement Systemsmentioning
confidence: 99%
See 1 more Smart Citation
“…where we used Bayes' rule and whereâ L (u T , C), abbreviated toâ L , is the set of acoustic features observed by the listener, which is modeled as a deterministic function of the talker phoneme sequence u T and the speech modification parameters C. The first term of ( 16) is the likelihood of the talker phoneme sequence for the observed featuresâ L , the second term is the a-priori probability that the phoneme sequence u T is decoded by the listener, and the third term is the inverse a-priori probability of the listener observed features. Optimization of the likelihood term only reduces complexity and provides good results [11].…”
Section: B Intelligibility Enhancement Systemsmentioning
confidence: 99%
“…The optimization criteria vary widely as the signal processing algorithms are derived from different viewpoints and with different computational and delay constraints. Criteria used include the probability of correct phoneme recognition [11], auditory models [6], [13], [14], the articulation index [2], the speech intelligibility index [4], [8], mutual information [15], and sound-field distortion [16].…”
Section: Introductionmentioning
confidence: 99%
“…phoneLLdscr: This entry builds on phoneLLabso, augmenting the objective measure with the difference of the measure in phoneLLabso, and the log of the sum of likelihoods conditioned on alternative acoustic models [26]. To reduce complexity, the context (phone neighbours) is assumed known and only a subset of all alternative models is considered based on the proximity of their LL scores to that obtained by the correct model.…”
Section: Challenge Entriesmentioning
confidence: 99%
“…The noise corrupted speech signal is of poor quality, as the noise components suppress the speech components and corrupts its cleanliness. On the other hand, speech intelligibility is the quantitative representation of speaker's intended message and the listener's perceived message [2]. It represents the understandability of the contents of the speech signal and it depends on the sound units, as the contents are defined by the sound units.…”
Section: Introductionmentioning
confidence: 99%