1990
DOI: 10.1109/26.61370
|View full text |Cite
|
Sign up to set email alerts
|

Vector quantization of pitch information in Mandarin speech

Abstract: By taking advantage of the simple tone structure of pitch contours in Mandarin speech, pitch information is orthogonally transformed and vector quantized. An average bit rate of 0.78 bits/frame (34.67 hits/s) for voiced sounds was achieved.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
0
0

Year Published

2006
2006
2017
2017

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 76 publications
(3 citation statements)
references
References 4 publications
(2 reference statements)
0
0
0
Order By: Relevance
“…is the observed log-F0 contour of the n-th syllable of an N-syllable word and is represented by the first four orthogonally-transformed parameters [4]; -1 -1…”
Section: Syllable F0 Contour Modelmentioning
confidence: 99%
“…is the observed log-F0 contour of the n-th syllable of an N-syllable word and is represented by the first four orthogonally-transformed parameters [4]; -1 -1…”
Section: Syllable F0 Contour Modelmentioning
confidence: 99%
“…The example features used include the average value of the pitch within the syllable, the average of the absolute value of the pitch slope within the syllable, the range of the pitch within the syllable, the pitch reset across the boundary, and so on. In order to represent the shape of the pitch contour within a syllable, we also used the first four coefficients of the Legendre discrete polynomial expansion of the contour [8], for which the zero-th order coefficient represents the level of the contour, and the other three coefficients represent the key characteristics of the contour shape. A total of 16 pitch-related attributes were used here for each syllable boundary.…”
Section: Prosodic Featuresmentioning
confidence: 99%
“…For example, predictive quantization approach was used in [1], and in [2] the pitch values were coded using a shaped lattice quantizer. In [3], the exploitation of redundancies was taken one step further: the pitch information was orthogonally transformed and vector quantized by taking into account the tonal nature of Mandarin speech.…”
Section: Introductionmentioning
confidence: 99%