2010 IEEE International Conference on Acoustics, Speech and Signal Processing 2010
DOI: 10.1109/icassp.2010.5495701
|View full text |Cite
|
Sign up to set email alerts
|

A short-time objective intelligibility measure for time-frequency weighted noisy speech

Abstract: Existing objective speech-intelligibility measures are suitable for several types of degradation, however, it turns out that they are less appropriate for methods where noisy speech is processed by a timefrequency (TF) weighting, e.g., noise reduction and speech separation. In this paper, we present an objective intelligibility measure, which shows high correlation (rho=0.95) with the intelligibility of both noisy, and TF-weighted noisy speech. The proposed method shows significantly better performance than th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
317
0
4

Year Published

2016
2016
2022
2022

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 705 publications
(321 citation statements)
references
References 14 publications
0
317
0
4
Order By: Relevance
“…Short-time objective intelligibility measure (STOI) is based on mean cross-correlations between processed and reference signals across time-frequency cells [30]. The STOI values for NB and WB speech are shown in Fig.…”
Section: Stoimentioning
confidence: 99%
See 1 more Smart Citation
“…Short-time objective intelligibility measure (STOI) is based on mean cross-correlations between processed and reference signals across time-frequency cells [30]. The STOI values for NB and WB speech are shown in Fig.…”
Section: Stoimentioning
confidence: 99%
“…5 (a) and (b) for NB and WB speech respectively. computed as in (30), where S x ( j, m) andSx ( j, m) denote the spectral slopes of the clean and enhanced signals respectively of the j th band, m th frame and W ( j, m) represents the weight [29]. …”
Section: Composite Objective Measurementioning
confidence: 99%
“…The STOI [38], [40] method achieved a Spearman correlation coefficient of 0.94 with subjective word intelligibility scores validating its use for the ADN-TN and additive noise partition of the C-Qual database. The speech intelligibility estimation experiments are restricted to the additive noise partitions of the database for which we aim to predict the output of the intrusive STOI method (which is well correlated with inteligibility) using our non-intrusive estimator.…”
Section: B Labelingmentioning
confidence: 99%
“…LCIA differs from the LCQA method [22] by employing an additional feature (iSNR), an external VAD, the use of a two-step feature selection and projection technique and training on databases labeled with STOI [38]. The LCIA method begins by deriving per frame features from the speech waveform, then applying a statistical model followed by a two-step dimensionality reduction and GMM mapping.…”
Section: A Lciamentioning
confidence: 99%
See 1 more Smart Citation