2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2018
DOI: 10.1109/icassp.2018.8462362
|View full text |Cite
|
Sign up to set email alerts
|

A Simple Cepstral Domain DNN Approach to Artificial Speech Bandwidth Extension

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
7
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 11 publications
(7 citation statements)
references
References 15 publications
0
7
0
Order By: Relevance
“…In this work, a segmental signal‐to‐noise ratio (segSNR) [57], log spectral distance (LSD) [44], NB mean opinion score listening quality objective (MOS‐LQO) [57, 58], and WB MOS‐LQO [59, 60] as the objective measures are taken for examining the quality of artificially extended speech signals. Next, we convert the IIR filter Kopt into an approximate FIR filter by using the Taylor series truncation method.…”
Section: Experiments Analysis and Resultsmentioning
confidence: 99%
See 3 more Smart Citations
“…In this work, a segmental signal‐to‐noise ratio (segSNR) [57], log spectral distance (LSD) [44], NB mean opinion score listening quality objective (MOS‐LQO) [57, 58], and WB MOS‐LQO [59, 60] as the objective measures are taken for examining the quality of artificially extended speech signals. Next, we convert the IIR filter Kopt into an approximate FIR filter by using the Taylor series truncation method.…”
Section: Experiments Analysis and Resultsmentioning
confidence: 99%
“…Here, the proposed approach is compared with other existing approaches by maintaining the same experimental conditions such as LPF, HPF, the dimension of HB feature, DNN architecture (seven hidden layers and 256 neurons in each hidden layer), data set, and NB signal processing. Two recently reported current works such as the modulation technique [14] with a slight modification and a cepstral domain approach [44] are included for comparison. The gain for the modulation technique is calculated by following [15] and the cepstrum feature is used for representing the NB information as well as the HB spectral envelope information.…”
Section: Experiments Analysis and Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…Furthermore, DNNs have been employed for estimating the UB spectral envelope directly (regression) [25]- [28]. Opposed to the source-filter model, UB spectral magnitudes and UB phases can be estimated right away using sum-product networks (SPMs) [29], DNNs [30], [31], or recurrent neural networks (RNNs) [32], which can then be transformed back to the time domain by an overlap-add (OLA) structure. In several studies, an increased speech quality when using ABE solutions was shown [18], [33].…”
Section: Introductionmentioning
confidence: 99%