2009 IEEE International Conference on Acoustics, Speech and Signal Processing 2009
DOI: 10.1109/icassp.2009.4959545
|View full text |Cite
|
Sign up to set email alerts
|

Scalable superwideband extension for wideband coding

Abstract: Recent trends in speech and audio codec standardization include scalability and extending the signal bandwidth beyond wideband (WB) to superwideband (SWB). In this paper we introduce a SWB extension for the ITU-T G.718 WB codec. In the SWB extension the high frequency content is generated utilizing the quantized MDCT domain coefficients of the WB core, which enables low additional delay. The proposed implementation is scalable with 4 kbps layers. In the first layer two different coding modes are used depending… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2010
2010
2015
2015

Publication Types

Select...
4
3

Relationship

1
6

Authors

Journals

citations
Cited by 10 publications
(11 citation statements)
references
References 9 publications
0
11
0
Order By: Relevance
“…Carrying out spectrum replication in the Modified Discrete Cosine Transform domain (MDCT) forms an efficient bandwidth extension algorithm as described in [18]. Using MDCT makes it relatively easy to detect similar shapes and patterns in the spectrum for high quality replication.…”
Section: Super Wideband and Fullbandmentioning
confidence: 99%
See 1 more Smart Citation
“…Carrying out spectrum replication in the Modified Discrete Cosine Transform domain (MDCT) forms an efficient bandwidth extension algorithm as described in [18]. Using MDCT makes it relatively easy to detect similar shapes and patterns in the spectrum for high quality replication.…”
Section: Super Wideband and Fullbandmentioning
confidence: 99%
“…With suitable scaling applied for each selected subband, the high-frequency half can be reproduced from the lowfrequency half without the need to transmit the actual high-frequency half of the signal. In [18], the search for the best match is done in two steps to facilitate an optimal match: first in the linear domain to match the spectral amplitude peaks and then in the logarithmic domain to provide a perceptually better match with the finer details of the spectral shape. Highly periodic tonal signals that have clear energy peaks would need an unrealistically high number of subbands for accurate spectrum replication.…”
Section: Super Wideband and Fullbandmentioning
confidence: 99%
“…However, for general signals, using the MDCT for signal analysis [5], [14], such as evaluating energy, delay, and correlations, is still not straightforward.…”
Section: Modified Discrete Cosine Transformmentioning
confidence: 99%
“…SWB references were produced with the G.729.1 Annex E at 36 kbit/s. The core layer of G.729.1 Annex E with the bit rate of 32 kbit/s is fully identical to ITU-T G.729.1 to reproduce the WB audio signals, and the SWB extension layer with the additional bit rate of 4 kbit/s adopts a two-mode BWE method [34] in the modified discrete cosine transform (MDCT) domain to extend the bandwidth of the reproduced audio to 14 kHz. The mode selection is done by estimating the tonality of the input audio signals.…”
Section: Test Data and Reference Methodsmentioning
confidence: 99%