2017
DOI: 10.1016/j.dsp.2017.06.001
|View full text |Cite
|
Sign up to set email alerts
|

Audio enhancement using local SNR-based sparse binary mask estimation and spectral imputation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(4 citation statements)
references
References 27 publications
0
4
0
Order By: Relevance
“…The system processes an in-the-wild speech audio samples and attempts to remove the noise to its greatest extent, based on the identification of the speech surroundings, which is indicated by the negative recording. Considering the same test set evaluation metrics, the experimental [22,74]-in terms of speech distortion, as indicated by the levels of LSD, SDR, and MCD (cf. Table 6).…”
Section: Results On Speech Denoisingmentioning
confidence: 99%
See 1 more Smart Citation
“…The system processes an in-the-wild speech audio samples and attempts to remove the noise to its greatest extent, based on the identification of the speech surroundings, which is indicated by the negative recording. Considering the same test set evaluation metrics, the experimental [22,74]-in terms of speech distortion, as indicated by the levels of LSD, SDR, and MCD (cf. Table 6).…”
Section: Results On Speech Denoisingmentioning
confidence: 99%
“…According to AudioSet's ontology, excluding the noise recordings labelled as 'Human sounds', we considered 16 198 samples for training, 636 for development, and 714 for test. 8 A variety of evaluation metrics, including log spectral distortion (LSD), signal-todistortion ratio (SDR), perceptual evaluation of speech quality (PESQ), short-time objective intelligibility (STOI), Mel cepstral distortion (MCD), and segmental SNR (SSNR), which are widely used in prior work [22], were taken into account to assess the performance of N-HANS in several Signal-to-Noise Ratio (SNR) conditions. As selective noise suppression has not been explored in the literature yet, it is not possible to compare N-HANS performance on selective noise suppression with previous work.…”
Section: Dataset and Evaluation Metricsmentioning
confidence: 99%
“…For selective noise suppression, two samples from AudioSet are selected as positive and negative recordings for each noisy sample. Test set evaluation metrics, log spectral distortion (LSD), signal-to-distortion ratio (SDR), perceptual evaluation of speech quality (PESQ), short-time objective intelligibility (STOI), mel cepstral distortion (MCD), and segmental SNR (SSNR), which are widely used in prior works such as (Jeon and Kim, 2017), are demonstrated in Table 2 and Table 3 for different input signal-to-noise ratio (SNR) cases.…”
Section: Methodsmentioning
confidence: 99%
“…Despite good results obtained by machine learning approaches (see [1]- [3] for deep neural network or [4], [5] for dictionary-based methods), there is still room for unsupervised techniques, especially in applications where large enough databases are hardly available for all the types of noise and speech signals that can actually be encountered [6], [7]. This is the case in assisted listening for The associate editor coordinating the review of this manuscript and approving it for publication was Md.…”
Section: Introduction a Context And Motivationmentioning
confidence: 99%