2020
DOI: 10.48550/arxiv.2010.15258
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

DNSMOS: A Non-Intrusive Perceptual Objective Speech Quality metric to evaluate Noise Suppressors

Abstract: Human subjective evaluation is the "gold standard" to evaluate speech quality optimized for human perception. Perceptual objective metrics serve as a proxy for subjective scores. The conventional and widely used metrics require a reference clean speech signal, which is unavailable in real recordings. The no-reference approaches correlate poorly with human ratings and are not widely adopted in the research community. One of the biggest use cases of these perceptual objective metrics is to evaluate noise suppres… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
18
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 9 publications
(19 citation statements)
references
References 9 publications
1
18
0
Order By: Relevance
“…Three metrics are utilized to evaluate the performance of our framework, namely perceptual evaluation of speech quality (PESQ) [36], extended short-time objective intelligibility (ES-TOI) [37], and DNSMOS [38]. PESQ, and ESTOI are to evaluate the objective performance of speech quality and intelligibility.…”
Section: Results and Analysismentioning
confidence: 99%
“…Three metrics are utilized to evaluate the performance of our framework, namely perceptual evaluation of speech quality (PESQ) [36], extended short-time objective intelligibility (ES-TOI) [37], and DNSMOS [38]. PESQ, and ESTOI are to evaluate the objective performance of speech quality and intelligibility.…”
Section: Results and Analysismentioning
confidence: 99%
“…Two metrics are utilized to evaluate the objective performance of different systems, namely perceptual evaluation of speech quality (PESQ) [28], and extended short-time objective intelligibility (ESTOI) [29]. Besides, to evaluate the subjective quality, DNSMOS is also adopted, which is a robust nonintrusive speech quality metric and well suitable for accurate subjective rating [30].…”
Section: Results and Analysismentioning
confidence: 99%
“…We use the following objective metrics to evaluate speech enhancement performance: the perceptual evaluation of speech quality (PESQ) [34], short-time objective intelligibility (STOI) [35], segmental signal-to-noise ratio (SegSNR), and three mean opinion score (MOS) prediction (i.e., signal distortion evaluation (CISG), the intrusiveness of background noise (CBAK) and overall effect (COVL)) [36]. We also evaluate the subjective quality by DNS-MOS [37], which is a robust non-intrusive perceptual speech quality metric. Higher values of all metrics indicate better performance.…”
Section: Results and Analysismentioning
confidence: 99%