2017 25th European Signal Processing Conference (EUSIPCO) 2017
DOI: 10.23919/eusipco.2017.8081272
|View full text |Cite
|
Sign up to set email alerts
|

Effectiveness of ideal ratio mask for non-intrusive quality assessment of noise suppressed speech

Abstract: Abstract-The Ideal Ratio Mask (IRM) has proven to be very effective tool in many applications such as speech segregation, speech enhancement for hearing aid design and noise robust speech recognition tasks. The IRM provides information regarding the amount of signal power at each Time-Frequency (T-F) unit in a given signal-plus-noise mixture. In this paper, we propose to use the IRM for non-intrusive quality assessment of noise suppressed speech. Since the quality of noise suppressed speech is dependent on the… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 7 publications
(7 citation statements)
references
References 18 publications
0
7
0
Order By: Relevance
“…D. -S. Kim et al [14] proposed a perceptually motivated algorithm based on a temporal envelope representation of speech to assess speech quality. Meet H. Soni [15] used the ideal ratio mask (IRM) for non-intrusive quality assessment of noise suppressed speech. Wang [16] applied autoencoder to extract bottleneck features of speech signals and mapped the features to the predicted MOS using support vector regression (SVR) [17].…”
Section: Related Workmentioning
confidence: 99%
“…D. -S. Kim et al [14] proposed a perceptually motivated algorithm based on a temporal envelope representation of speech to assess speech quality. Meet H. Soni [15] used the ideal ratio mask (IRM) for non-intrusive quality assessment of noise suppressed speech. Wang [16] applied autoencoder to extract bottleneck features of speech signals and mapped the features to the predicted MOS using support vector regression (SVR) [17].…”
Section: Related Workmentioning
confidence: 99%
“…As will be explained in Chapter 4, in supervised learning terminology, non-intrusive quality estimation can be described as a multi-class classification or a regression problem, where the input and output are the signal features and the quality score respectively [1]. Several non-intrusive methods have recently been proposed in the context of quality assessment, using machine learning algorithms for estimating the score of audio signals [23,24,25,26,27,28,29,30,31]. Figure (1.1) shows the high-level structure of a non-intrusive quality assessment system.…”
Section: Approaches and Principlesmentioning
confidence: 99%
“…These systems may differ in the considered features, the predicting function, or both. The work de-scribed in [60] makes use of a classifier to predict the discrete value of quality score while the works described in [23,24,25,26,27,28,29,31,61,62] apply regression methods (with shallow architectures) to estimate the subjective Mean Opinion Score (MOS) assigned to a speech file. On the other hand, approaches in [16,30] use a combination of classification and regression algorithms as the predicting function.…”
Section: Relevant Workmentioning
confidence: 99%
See 2 more Smart Citations