2007
DOI: 10.1109/tasl.2006.885934
|View full text |Cite
|
Sign up to set email alerts
|

Neural Network-Based Artificial Bandwidth Expansion of Speech

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
20
0
1

Year Published

2012
2012
2023
2023

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 43 publications
(22 citation statements)
references
References 18 publications
0
20
0
1
Order By: Relevance
“…A few techniques, such as linear mapping [2], piecewise linear mapping [3,4], codebook mapping [5,6], neural networks [7,8], Gaussian mixture model [9,10], and hidden Markov model [11,12] and non-negative hidden Markov model [13], have been explored. Linear predictive coefficients (LPCs) or line spectral frequencies (LSFs) [14,15] are widely used to represent the spectral envelope, while the excitation can be found by inverse filtering the signal with LPCs, modulation techniques, non-linear processing, and the application of function generators [1].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…A few techniques, such as linear mapping [2], piecewise linear mapping [3,4], codebook mapping [5,6], neural networks [7,8], Gaussian mixture model [9,10], and hidden Markov model [11,12] and non-negative hidden Markov model [13], have been explored. Linear predictive coefficients (LPCs) or line spectral frequencies (LSFs) [14,15] are widely used to represent the spectral envelope, while the excitation can be found by inverse filtering the signal with LPCs, modulation techniques, non-linear processing, and the application of function generators [1].…”
Section: Introductionmentioning
confidence: 99%
“…However, there are still some studies, such as folded spectrum adjusting [8] and sparse probabilistic state mapping [16]. The former one folds the narrowband spectrum and adjusts the level of the wideband spectrum, attempting to estimate the spectral envelope in a different way.…”
Section: Introductionmentioning
confidence: 99%
“…The present authors have worked on ABE for several years and proposed algorithms that have been evaluated with subjective listening tests (e.g., Laaksonen et al, 2005;Kontio et al, 2007;Laaksonen et al, 2009;Pulakka et al, 2008;Pulakka and Alku, 2011) The results have been consistent and promising, indicating that ABE improves both the quality and intelligibility of narrowband speech. In addition, listening tests in three different languages (English, Mandarin Chinese, and Russian) did not show substantial differences in the ABE performance between the examined languages (Pulakka et al, 2008).…”
Section: Introductionmentioning
confidence: 55%
“…The highband envelope can be parameterized, for instance, by line-spectral frequencies (LSF) or by cepstral coefficients (Iser and Schmidt, 2005). The parameters for the highband can be estimated from narrowband signal features using, for example, codebooks (Kornagel, 2006), hidden Markov models (HMM) (Jax and Vary, 2003), Gaussian mixture models (GMM) (Park and Kim, 2000;Liu et al, 2009;Nour-Eldin and Kabal, 2011), or neural networks (Iser and Schmidt, 2005;Kontio et al, 2007).…”
Section: Introductionmentioning
confidence: 99%
“…For the purpose of high-quality speech synthesis, it is therefore important to compensate for the lost frequency band of the speech data. Several statistical BWE approaches have been proposed, which are based on codebooks [12], [13], Gaussian mixture models (GMMs) [14], [15], hidden Markov models (HMMs) [16], [17], neural networks [18], and deep neural networks [19], [20]. These methods basically achieve the BWE task by estimating lost high-band components from remained low-band ones.…”
Section: Introductionmentioning
confidence: 99%