ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2021
DOI: 10.1109/icassp39728.2021.9414878
|View full text |Cite
|
Sign up to set email alerts
|

Dnsmos: A Non-Intrusive Perceptual Objective Speech Quality Metric to Evaluate Noise Suppressors

Abstract: Human subjective evaluation is the "gold standard" to evaluate speech quality optimized for human perception. Perceptual objective metrics serve as a proxy for subjective scores. The conventional and widely used metrics require a reference clean speech signal, which is unavailable in real recordings. Previous no-reference approaches correlate poorly with human ratings and are not widely adopted in the research community. One of the biggest use cases of these perceptual objective metrics is to evaluate noise su… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
86
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 117 publications
(87 citation statements)
references
References 12 publications
1
86
0
Order By: Relevance
“…The speech recognition system was trained to handle audio with a wide range of energy levels so we do not expect any degradation of WAcc due to varying energy levels in clips from the blind test set. Table 1 shows the Pearson correlation coefficient (PCC) and Spearman's rank correlation coefficient (SRCC) between per-model subjective scores and corresponding DNSMOS P.835 scores [9]. The high correlation between subjective scores and DNSMOS P.835 in both tracks shows the efficacy of DNSMOS P.835 in ranking the DNS models.…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations
“…The speech recognition system was trained to handle audio with a wide range of energy levels so we do not expect any degradation of WAcc due to varying energy levels in clips from the blind test set. Table 1 shows the Pearson correlation coefficient (PCC) and Spearman's rank correlation coefficient (SRCC) between per-model subjective scores and corresponding DNSMOS P.835 scores [9]. The high correlation between subjective scores and DNSMOS P.835 in both tracks shows the efficacy of DNSMOS P.835 in ranking the DNS models.…”
Section: Resultsmentioning
confidence: 99%
“…We provided the participants with an Azure API for estimating WAcc on the development set. We computed DNSMOS P.835 [9] for each audio clip in the training set and provided this to participants. DNSMOS P.835 scores can be used to segment the training dataset for conducting the data ablation studies.…”
Section: Challenge Tracksmentioning
confidence: 99%
See 2 more Smart Citations
“…We use DNSMOS [19] which is a reliable non-intrusive objective speech quality metric as our evaluation metrics at training stag and take Mean Opinion Score (MOS) of the ITU-T P.835 framework as result. The results of the evaluation using the ITU-T P.835 criterion [20] which is provided by the organizer are shown in Table 2.…”
Section: Deep Noise Suppression Challengementioning
confidence: 99%