2016
DOI: 10.1109/taslp.2016.2514492
|View full text |Cite
|
Sign up to set email alerts
|

Enhancement and Noise Statistics Estimation for Non-Stationary Voiced Speech

Abstract: In this paper, single channel speech enhancement in the time domain is considered. We address the problem of modelling non-stationary speech by describing the voiced speech parts by a harmonic linear chirp model instead of using the traditional harmonic model. This means that the speech signal is not assumed stationary, instead the fundamental frequency can vary linearly within each frame. The linearly constrained minimum variance (LCMV) filter and the amplitude and phase estimation (APES) filter are derived i… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
19
0
2

Year Published

2017
2017
2024
2024

Publication Types

Select...
4
3

Relationship

1
6

Authors

Journals

citations
Cited by 15 publications
(21 citation statements)
references
References 35 publications
0
19
0
2
Order By: Relevance
“…where (27) is due to the Cauchy-Schwarz inequality. Then, we make use of Nyquist's sampling theorem to expressF Δ μ aŝ…”
Section: Error Analysis For Reconstruction Without Pre-filteringmentioning
confidence: 99%
See 2 more Smart Citations
“…where (27) is due to the Cauchy-Schwarz inequality. Then, we make use of Nyquist's sampling theorem to expressF Δ μ aŝ…”
Section: Error Analysis For Reconstruction Without Pre-filteringmentioning
confidence: 99%
“…holds true for all k ∈ N, we conclude (31) from the dominated convergence theorem. The inequality (27) and the equality (31) suggest that the aliasing error…”
Section: Error Analysis For Reconstruction Without Pre-filteringmentioning
confidence: 99%
See 1 more Smart Citation
“…The HCM has only very recently been used in [8] as an alternative to modelling non-stationary speech using amplitude modulation models [9]. The HCM was also used in the context of speech processing in [10,11], but was considered in a more general framework in [12,13] in which animal sound signals were analysed. In all of these papers, the complex-valued HCM was used although we know of no application where such signals naturally occur.…”
Section: Introductionmentioning
confidence: 99%
“…In the literature, speech enhancement approaches are designed to mainly improve the speech quality in adverse conditions [19] [20][21] [22]. These solutions identify the distortion components based on temporal and spectral estimators [23] or using decomposition techniques such as the EMD (Empirical Mode Decomposition) [24]. The OMLSA (Optimally-Modified Log-Spectral Amplitude) [19] adopts an acoustic noise estimator to access the spectral noise power and reconstruct the speech signal based on the minimization of the logspectral mean square error.…”
mentioning
confidence: 99%