Enhancement and Noise Statistics Estimation for Non-Stationary Voiced Speech

Nørholm, Sidsel Marie; Jensen, Jesper Rindom; Christensen, Mads Græsbøll

doi:10.1109/taslp.2016.2514492

Cited by 15 publications

(21 citation statements)

References 35 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…where (27) is due to the Cauchy-Schwarz inequality. Then, we make use of Nyquist's sampling theorem to expressF Δ μ aŝ…”

Section: Error Analysis For Reconstruction Without Pre-filteringmentioning

confidence: 99%

“…holds true for all k ∈ N, we conclude (31) from the dominated convergence theorem. The inequality (27) and the equality (31) suggest that the aliasing error…”

Section: Error Analysis For Reconstruction Without Pre-filteringmentioning

confidence: 99%

“…As for the study of non-stationary signals, we can only mention a small sample of recent papers [23]- [27]. Reference [28] is a special issue dedicated to the subject.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Choice of Sampling Interval and Extent for Finite-Energy Fields

Gulcu

Özaktaş

2017

IEEE Trans. Signal Process.

View full text Add to dashboard Cite

“…where (27) is due to the Cauchy-Schwarz inequality. Then, we make use of Nyquist's sampling theorem to expressF Δ μ aŝ…”

Section: Error Analysis For Reconstruction Without Pre-filteringmentioning

confidence: 99%

“…holds true for all k ∈ N, we conclude (31) from the dominated convergence theorem. The inequality (27) and the equality (31) suggest that the aliasing error…”

Section: Error Analysis For Reconstruction Without Pre-filteringmentioning

confidence: 99%

See 1 more Smart Citation

Choice of Sampling Interval and Extent for Finite-Energy Fields

Gulcu

Özaktaş

2017

IEEE Trans. Signal Process.

View full text Add to dashboard Cite

“…The HCM has only very recently been used in [8] as an alternative to modelling non-stationary speech using amplitude modulation models [9]. The HCM was also used in the context of speech processing in [10,11], but was considered in a more general framework in [12,13] in which animal sound signals were analysed. In all of these papers, the complex-valued HCM was used although we know of no application where such signals naturally occur.…”

Section: Introductionmentioning

confidence: 99%

Fast harmonic chirp summation

Nielsen

Jensen

et al. 2017

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Self Cite

View full text Add to dashboard Cite

The harmonic chirp signal model has only very recently been introduced for modelling approximately periodic signals with a timevarying fundamental frequency. A number of estimators for the parameters of this model have already been proposed, but they are either inaccurate, non-robust to noise, or very computationally intensive. In this paper, we propose a fast algorithm for the harmonic chirp summation method which has been demonstrated in the literature to be accurate and robust to noise. The proposed algorithm is orders of magnitudes faster than previous algorithms which is also demonstrated via timing studies.

show abstract

“…In the literature, speech enhancement approaches are designed to mainly improve the speech quality in adverse conditions [19] [20][21] [22]. These solutions identify the distortion components based on temporal and spectral estimators [23] or using decomposition techniques such as the EMD (Empirical Mode Decomposition) [24]. The OMLSA (Optimally-Modified Log-Spectral Amplitude) [19] adopts an acoustic noise estimator to access the spectral noise power and reconstruct the speech signal based on the minimization of the logspectral mean square error.…”

mentioning

confidence: 99%

Harmonic and non-Harmonic Based Noisy Reverberant Speech Enhancement in Time Domain

G.¹,

Coelho²

2021

Preprint

View full text Add to dashboard Cite

This paper introduces the single step time domain method named HnH-NRSE, whihc is designed for simultaneous speech intelligibility and quality improvement under noisyreverberant conditions. In this solution, harmonic and nonharmonic elements of speech are separated by applying zerocrossing and energy criteria. An objective evaluation of the its non-stationarity degree is further used for an adaptive gain to treat masking components. No prior knowledge of speech statistics or room information is required for this technique. Additionally, two combined solutions, IRMO and IRMN, are proposed as composite methods for improvement on noisyreverberant speech signals. The proposed and baseline methods are evaluated considering two intelligibility and three quality measures, applied for the objective prediction. The results show that the proposed scheme leads to a higher intelligibility and quality improvement when compared to competing methods in most scenarios. Additionally, a perceptual intelligibility listening test is performed, which corroborates with these results. Furthermore, the proposed HnH-NRSE solution attains SRMR quality measure with similar results when compared to the composed IRMO and IRMN techniques.

show abstract

Enhancement and Noise Statistics Estimation for Non-Stationary Voiced Speech

Cited by 15 publications

References 35 publications

Choice of Sampling Interval and Extent for Finite-Energy Fields

Choice of Sampling Interval and Extent for Finite-Energy Fields

Fast harmonic chirp summation

Harmonic and non-Harmonic Based Noisy Reverberant Speech Enhancement in Time Domain

Contact Info

Product

Resources

About