2012
DOI: 10.1109/tasl.2012.2205242
|View full text |Cite
|
Sign up to set email alerts
|

A CASA-Based System for Long-Term SNR Estimation

Abstract: Abstract-We present a system for robust signal-to-noise ratio (SNR) estimation based on computational auditory scene analysis (CASA). The proposed algorithm uses an estimate of the ideal binary mask to segregate a time-frequency representation of the noisy signal into speech dominated and noise dominated regions. Energy within each of these regions is summated to derive the filtered global SNR. An SNR transform is introduced to convert the estimated filtered SNR to the true broadband SNR of the noisy signal. T… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
14
0

Year Published

2013
2013
2022
2022

Publication Types

Select...
5
3
1

Relationship

2
7

Authors

Journals

citations
Cited by 30 publications
(14 citation statements)
references
References 27 publications
(51 reference statements)
0
14
0
Order By: Relevance
“…This is because classifiers are trained to distinguish between speech-and noise-dominant T-F units, and the acoustic characteristics of speechdominant units are generally different from those of noisedominant units, even when that noise is babble. We consider SNR mismatch to be of less concern than noise mismatch because SNR estimation can be performed with reasonable accuracy (Kim and Stern, 2008;Narayanan and Wang, 2012). Regarding noise mismatch, recent effort has been made to address this issue.…”
Section: Discussionmentioning
confidence: 99%
“…This is because classifiers are trained to distinguish between speech-and noise-dominant T-F units, and the acoustic characteristics of speechdominant units are generally different from those of noisedominant units, even when that noise is babble. We consider SNR mismatch to be of less concern than noise mismatch because SNR estimation can be performed with reasonable accuracy (Kim and Stern, 2008;Narayanan and Wang, 2012). Regarding noise mismatch, recent effort has been made to address this issue.…”
Section: Discussionmentioning
confidence: 99%
“…The bottom-up mask is estimated using a recently proposed system described in [31], which combines masks estimated by CASA based [19] and speech enhancement based methods [32]. The speech enhancement based mask uses an LC of -5 dB.…”
Section: Methodsmentioning
confidence: 99%
“…After subtracting the noise spectrum from the input signal to obtain the clean signal, SNR is estimated. In [13], computational auditory scene analysis is used to estimate speech dominated and arXiv:1804.04353v1 [eess.AS] 12 Apr 2018 noise dominated portions of the signal in order to obtain SNR.…”
Section: Related Workmentioning
confidence: 99%