2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100)
DOI: 10.1109/icassp.2000.862114
|View full text |Cite
|
Sign up to set email alerts
|

Narrowband to wideband conversion of speech using GMM based transformation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
4
0

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 94 publications
(5 citation statements)
references
References 3 publications
0
4
0
Order By: Relevance
“…The first attempts at music audio bandwidth extension used nonlinear devices [16] and spectral band replication [17]. Other approaches relied on data-driven techniques, such as Gaussian mixture models [18], [19], Hidden Markov Models [20], and shallow [21], [22] and deep neural networks [23]- [25]. Nevertheless, these methods often yielded suboptimal quality due to their limited modeling capabilities.…”
Section: A Audio Bandwidth Extension and Super-resolutionmentioning
confidence: 99%
“…The first attempts at music audio bandwidth extension used nonlinear devices [16] and spectral band replication [17]. Other approaches relied on data-driven techniques, such as Gaussian mixture models [18], [19], Hidden Markov Models [20], and shallow [21], [22] and deep neural networks [23]- [25]. Nevertheless, these methods often yielded suboptimal quality due to their limited modeling capabilities.…”
Section: A Audio Bandwidth Extension and Super-resolutionmentioning
confidence: 99%
“…Systems based on dictionary learning to map low-frequency patterns to high-frequency components have been proposed in [16,17]. Classic machine learning methods have also been explored for BWE, such as Gaussian mixture models (GMMs) [18], hidden Markov models (HMM) [19,20], or non-negative matrix factorization (NMF) [21,22].…”
Section: Signal Processing Approachesmentioning
confidence: 99%
“…T and is fed into the GMM-based Bayesian estimator [11] T under the MMSE criterion. The joint vector of the HF and LF vectors is referred to as…”
Section: Hf Spectral Envelope Estimator Based On Gaussian Mixture Modelmentioning
confidence: 99%
“…By using Mel-scale filters and cepstrum analysis, MFCC provides more certainty about the HF components, which is quantified as the ratio of mutual information between the HF and LF parameters to the discrete entropy of HF parameters. Then, the joint probability density function of the HF and LF feature vectors is approximated by a Gaussian mixture model (GMM), and the HF spectral envelope is estimated according to the minimum mean square error (MMSE) criterion [11,12]. The method based on MFCC and GMM can effectively reduce the spectral distortion of the extended speech compared to the method based on line spectral frequency parameters [10] and also achieves a good extension performance for audio signals.…”
Section: Introductionmentioning
confidence: 99%