2010 IEEE International Conference on Acoustics, Speech and Signal Processing 2010
DOI: 10.1109/icassp.2010.5495149
|View full text |Cite
|
Sign up to set email alerts
|

Voice source estimation for artificial bandwidth extension of telephone speech

Abstract: Artificial bandwidth extension (ABWE) of speech signals aims to estimate wideband speech (50 Hz -7 kHz) from narrowband signals (300 Hz -3.4 kHz). Applying the source-filter model of speech, many existing algorithms estimate vocal tract filter parameters independently of the source signal. However, many current methods for extending the narrowband voice source signal are limited to straightforward signal processing techniques which are only effective for high-band estimation. This paper presents a method for A… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
5
0

Year Published

2012
2012
2021
2021

Publication Types

Select...
6
1
1

Relationship

1
7

Authors

Journals

citations
Cited by 11 publications
(5 citation statements)
references
References 9 publications
0
5
0
Order By: Relevance
“…In [25], proposed estimating the wideband voice source signal from the narrowband signal to extend the excitation of voiced speech. The technique was found to be especially effective in the low frequency range.…”
Section: Voice Source Modelingmentioning
confidence: 99%
“…In [25], proposed estimating the wideband voice source signal from the narrowband signal to extend the excitation of voiced speech. The technique was found to be especially effective in the low frequency range.…”
Section: Voice Source Modelingmentioning
confidence: 99%
“…Furthermore, a residual signal can be obtained by harmonic modeling [14]. Another approach leading to an estimated LB residual signal is to perform GMMbased classification among pretrained WB residual signals and subsequent lowpass filtering (known as modified data-driven voice source modeling) [39].…”
Section: Introductionmentioning
confidence: 99%
“…In [37] the LB spectral envelope is modeled by an autoregressive movingaverage (ARMA)-filter. Employing statistical models, a GMMbased envelope estimation scheme was presented in [40] and furthermore an HMM estimating the temporal envelope and adapting the estimated residual signal by gain manipulation was presented in [39], implicitly also defining the resulting LB spectral envelope. Using a statistical model for regression, a conventional neural network for direct LB spectral envelope estimation was developed [36].…”
Section: Introductionmentioning
confidence: 99%
“…For a low-band extension, however, the pitch of the given speaker needs to be reconstructed accurately to avoid annoying artifacts [9]. Unfortunately, pitch estimation -particularly in noise -is still quite challenging [7,10]. Due to the small dimension of their loudspeakers, mobile devices cannot sufficiently represent the lower frequencies anyway.…”
Section: Introductionmentioning
confidence: 99%
“…Some ABWE techniques extend both spectral gaps, i.e., the upper and lower one [7,8]. For a low-band extension, however, the pitch of the given speaker needs to be reconstructed accurately to avoid annoying artifacts [9].…”
Section: Introductionmentioning
confidence: 99%