2019
DOI: 10.1109/access.2019.2929864
|View full text |Cite
|
Sign up to set email alerts
|

Speech Enhancement Algorithm Based on Super-Gaussian Modeling and Orthogonal Polynomials

Abstract: Different types of noise from the surrounding always interfere with speech and produce annoying signals for the human auditory system. To exchange speech information in a noisy environment, speech quality and intelligibility must be maintained, which is a challenging task. In most speech enhancement algorithms, the speech signal is characterized by Gaussian or super-Gaussian models, and noise is characterized by a Gaussian prior. However, these assumptions do not always hold in real-life situations, thereby ne… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
34
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
10

Relationship

1
9

Authors

Journals

citations
Cited by 42 publications
(34 citation statements)
references
References 69 publications
(113 reference statements)
0
34
0
Order By: Relevance
“…Some scholars have adopted super-Gaussian functions to model speech signals because super-Gaussian priors allow for better reduction of noise between speech spectral harmonics than Gaussian estimators such as the Wiener filter [6]. Mahmmod et al propose an optimum low-distortion estimator with models that fit well with speech and noise signals to decrease the deviation of Gaussian or super-Gaussian models [7]. These algorithms are designed based on the complex statistical characteristics of the interaction between noise and clean speech, but they usually assume that the noise signal is relatively stable or changes slowly.…”
Section: In the 1980s Ephraim And Malah Proposed The Minimummentioning
confidence: 99%
“…Some scholars have adopted super-Gaussian functions to model speech signals because super-Gaussian priors allow for better reduction of noise between speech spectral harmonics than Gaussian estimators such as the Wiener filter [6]. Mahmmod et al propose an optimum low-distortion estimator with models that fit well with speech and noise signals to decrease the deviation of Gaussian or super-Gaussian models [7]. These algorithms are designed based on the complex statistical characteristics of the interaction between noise and clean speech, but they usually assume that the noise signal is relatively stable or changes slowly.…”
Section: In the 1980s Ephraim And Malah Proposed The Minimummentioning
confidence: 99%
“…A number of learning approaches have recently been proposed for estimating spectral-masks with confirmed notable results [17][18][19][20]. Few of the recent related work regarding supervised speech enhancement is available in [21][22][23].…”
Section: Introductionmentioning
confidence: 98%
“…Past speech enhancement methods fall into two categories: statistical-based approaches and data-driven approaches. Statistical-based methods are based on particular probabilistic models of noisy speech [4], [5], such as spectral subtraction methods [6], Wiener filtering [7], and minimum mean-square error (MMSE) of the spectral amplitude [8]. Recently, datadriven approaches use deep neural networks (DNNs) to estimate the ideal ratio mask (IRM) or ideal binary mask (IBM) and have demonstrated significant performance improvement over the conventional statistical-based methods [9], [10].…”
Section: Introductionmentioning
confidence: 99%