2004 IEEE International Conference on Acoustics, Speech, and Signal Processing
DOI: 10.1109/icassp.2004.1325975
|View full text |Cite
|
Sign up to set email alerts
|

A scalable speech and audio coding scheme with continuous bitrate flexibility

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
5
0

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 11 publications
(5 citation statements)
references
References 3 publications
0
5
0
Order By: Relevance
“…The scalable codec uses an efficient frame erasure concealment algorithm [18] that can correct the effect of missing frames even in severe packet loss conditions. A more detailed description of the codec can be found in [19]. Coming back to the scalability issue, which is of interest for us in this paper, one of the main advantages of this codec is its high granularity.…”
Section: Scalable Audio Codingmentioning
confidence: 98%
“…The scalable codec uses an efficient frame erasure concealment algorithm [18] that can correct the effect of missing frames even in severe packet loss conditions. A more detailed description of the codec can be found in [19]. Coming back to the scalability issue, which is of interest for us in this paper, one of the main advantages of this codec is its high granularity.…”
Section: Scalable Audio Codingmentioning
confidence: 98%
“…The enhancement layers in the proposed scheme employ a hybrid harmonic/CELP scheme for highband coding. Since audible artifacts appear in high-band signals using only the CELP technique due to aliasing distortion (Kataoka et al 1989;Javovo et al 1997;Jung et al 2004), at present high-band coding generally employs transform-based methods (Ramprashad 1999;Kovesi et al 2004;You et al 2005). However, audible artifacts may sometimes appear due to frequency distortion, especially at lower bit rates.…”
Section: Introductionmentioning
confidence: 96%
“…Efforts to construct a wideband speech codec with an embedded structure based on applying a standard narrowband codec to a base core can be divided into two main approaches, according to the way the target signal is fed into the enhancement layer. In the first, the enhancement layer encodes the coding error in the base layer (Ramprashad 1998;Koishida et al 2000;Kim et al 2002;Dong and Gibson 2006), while in the second the previously split high-band signals are encoded (Kataoka et al 1989;Javovo et al 1997;Jung et al 2004;Ramprashad 1999;Kovesi et al 2004;You et al 2005). Scalability with the latter type of improvement is referred to as bandwidth scalable codec or sub-band codec.…”
Section: Introductionmentioning
confidence: 97%
“…The MDCT coefficients in each subband are vector quantized, using trained spherical codebooks that are embedded in size and composed of an union of permutation codes. The subbands are transmitted by order of perceptual importance as in [7].…”
Section: Predictive Mdct Coding (Layers Above 14 Kbit/s)mentioning
confidence: 99%
“…It also maps signals into a weighted domain similar to the CELP target domain used at 8 and 12 kbit/s. The MDCT encoder is a variant of the "TDAC coder" described in [6,7]. The difference y(k) = y1(k) − y2(k) is computed in the 0-3.4 Hz band, while the weighted original signal is taken (i.e.…”
Section: Predictive Mdct Coding (Layers Above 14 Kbit/s)mentioning
confidence: 99%