2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2016
DOI: 10.1109/icassp.2016.7472812
|View full text |Cite
|
Sign up to set email alerts
|

A subjective listening test of six different artificial bandwidth extension approaches in English, Chinese, German, and Korean

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2016
2016
2023
2023

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 11 publications
(4 citation statements)
references
References 10 publications
0
4
0
Order By: Relevance
“…Most of the formant frequencies are still present in the AMR condition, however, with a missing LB, a spectral imbalance towards high frequencies results, which consequently affects female speakers stronger than male speakers. Of course, UB-ABE improves speech quality [28], [33], however, it does not sufficiently restore spectral balance over sounds, especially for female speakers. In [12] it was already stated, that only the simultaneous extension towards high and low frequencies leads to the maximum improvement possible, rather than the exclusive use of only one of the techniques.…”
Section: Subjective Assessmentmentioning
confidence: 95%
See 1 more Smart Citation
“…Most of the formant frequencies are still present in the AMR condition, however, with a missing LB, a spectral imbalance towards high frequencies results, which consequently affects female speakers stronger than male speakers. Of course, UB-ABE improves speech quality [28], [33], however, it does not sufficiently restore spectral balance over sounds, especially for female speakers. In [12] it was already stated, that only the simultaneous extension towards high and low frequencies leads to the maximum improvement possible, rather than the exclusive use of only one of the techniques.…”
Section: Subjective Assessmentmentioning
confidence: 95%
“…Opposed to the source-filter model, UB spectral magnitudes and UB phases can be estimated right away using sum-product networks (SPMs) [29], DNNs [30], [31], or recurrent neural networks (RNNs) [32], which can then be transformed back to the time domain by an overlap-add (OLA) structure. In several studies, an increased speech quality when using ABE solutions was shown [18], [33].…”
Section: Introductionmentioning
confidence: 99%
“…Early works used signal processing methods such as a source-filter model [23], [24], nonlinear devices [19], or spectral band replication [25]. Other approaches were based on data-driven techniques, such as Gaussian mixture models [26], [27], hidden Markov models [28], or shallow neural networks [29], [30].…”
Section: A Audio Bandwidth Extensionmentioning
confidence: 99%
“…Early works in audio bandwidth extension focused on speech signals and employed diverse signal processing methods, including source-filter models [13], [14], and codebook mapping [15]. The first attempts at music audio bandwidth extension used nonlinear devices [16] and spectral band replication [17].…”
Section: A Audio Bandwidth Extension and Super-resolutionmentioning
confidence: 99%