2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2014
DOI: 10.1109/icassp.2014.6854292
|View full text |Cite
|
Sign up to set email alerts
|

Modeling speech with sum-product networks: Application to bandwidth extension

Abstract: Sum-product networks (SPNs) are a recently proposed type of probabilistic graphical models allowing complex variable interactions while still granting efficient inference. In this paper we demonstrate the suitability of SPNs for modeling log-spectra of speech signals using the application of artificial bandwidth extension, i.e. artificially replacing the high-frequency content which is lost in telephone signals. We use SPNs as observation models in hidden Markov models (HMMs), which model the temporal evolutio… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
36
0

Year Published

2015
2015
2022
2022

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 44 publications
(36 citation statements)
references
References 16 publications
0
36
0
Order By: Relevance
“…For re-synthesizing time signals from the log-magnitude spectrogram reconstructions we use the same approach as in [25], using 100 iterations of the Griffin&Lim algorithm [76] to synthesize phase for the reconstructed time-frequency bins. We replace the predicted telephone band by the original telephone band in all models and initialize Griffin&Lim with the corresponding phase.…”
Section: A Setupmentioning
confidence: 99%
See 2 more Smart Citations
“…For re-synthesizing time signals from the log-magnitude spectrogram reconstructions we use the same approach as in [25], using 100 iterations of the Griffin&Lim algorithm [76] to synthesize phase for the reconstructed time-frequency bins. We replace the predicted telephone band by the original telephone band in all models and initialize Griffin&Lim with the corresponding phase.…”
Section: A Setupmentioning
confidence: 99%
“…For objective evaluation, we use the log-spectral distortion (LSD) in the high-band, similar as in [55], [25]. Using 9 order LPC analysis of each frame we get the spectral envelope as (2) where is the square-root of the variance of the LPC-analyzed signal and are the LPC coefficients.…”
Section: B Evaluation Objectivesmentioning
confidence: 99%
See 1 more Smart Citation
“…ABE approaches based on source-filter modelling estimate separate spectral envelope and excitation components [2,3]. Other approaches operate directly on complex short-term spectral estimates derived, e.g., using the Fourier transform (STFT) [4,5] or constant-Q transform [6]. Complementary to short-term spectral estimates, is some form of contextual information, or memory which can be harnessed to improve the reliability of HB component estimation.…”
Section: Introductionmentioning
confidence: 99%
“…The SPN is regarded as observation models in HMMs modeling. The algorithm is robust and computationally inexpensive [10]. Deep learning has emerged as a new area of machine learning research [11].…”
Section: Introductionmentioning
confidence: 99%