2002
DOI: 10.1109/tsa.2002.803420
|View full text |Cite
|
Sign up to set email alerts
|

Speech enhancement using a mixture-maximum model

Abstract: Abstract-We present a spectral domain, speech enhancement algorithm. The new algorithm is based on a mixture model for the short time spectrum of the clean speech signal, and on a maximum assumption in the production of the noisy speech spectrum. In the past this model was used in the context of noise robust speech recognition. In this paper we show that this model is also effective for improving the quality of speech signals corrupted by additive noise. The computational requirements of the algorithm can be s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
55
0

Year Published

2005
2005
2016
2016

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 83 publications
(58 citation statements)
references
References 20 publications
0
55
0
Order By: Relevance
“…The frame index t is omitted for simplicity. We rewrite the full model as (11) where (8) and p(s) is the mixture probability.…”
Section: Two Signal Estimation Approachesmentioning
confidence: 99%
See 3 more Smart Citations
“…The frame index t is omitted for simplicity. We rewrite the full model as (11) where (8) and p(s) is the mixture probability.…”
Section: Two Signal Estimation Approachesmentioning
confidence: 99%
“…SuperGaussian priors, including Gaussian, Laplacian, and Gamma densities, have been used to model the real part and imaginary part of the frequency components [10], and the MMSE estimator used for signal estimation. The log-spectra of speech has often been explicitly and accurately modeled by the Gaussian mixture model (GMM) [11]- [13]. The GMM clusters similar log-spectra together and represents them by a mixture component.…”
mentioning
confidence: 99%
See 2 more Smart Citations
“…The developed HMM with gain adaptation has been applied to the speech enhancement [14] and to the recognition of clean and noisy speech [15]. In contrast to the frequency-domain models [12]- [15], the density of log-spectral amplitudes is modeled by a Gaussian mixture model (GMM) with parameters trained on the clean signals [16]- [18]. Spectrally similar signals are clustered and represented by their mixture components.…”
mentioning
confidence: 99%