[Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing 1991
DOI: 10.1109/icassp.1991.150451
|View full text |Cite
|
Sign up to set email alerts
|

A segment-based approach to voice conversion

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4

Citation Types

0
12
0
2

Year Published

1992
1992
2021
2021

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 26 publications
(14 citation statements)
references
References 8 publications
0
12
0
2
Order By: Relevance
“…In the technique, the phone units are used as the segments, and a mapping table is generated between triphones of the source and target speakers. Although this unitselection-based approach significantly improved the conventional VQ-based frame-by-frame mapping one [11], it was also pointed out that a large amount of speech data of the source and target speakers must be prepared to achieve high conversion performance [10].…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations
“…In the technique, the phone units are used as the segments, and a mapping table is generated between triphones of the source and target speakers. Although this unitselection-based approach significantly improved the conventional VQ-based frame-by-frame mapping one [11], it was also pointed out that a large amount of speech data of the source and target speakers must be prepared to achieve high conversion performance [10].…”
Section: Introductionmentioning
confidence: 99%
“…As a result, the conversion performance is not always satisfactory, and highly depends on the combination of the source and target speakers. To alleviate this problem, a segment-based voice conversion using unit selection is one of the effective approaches where the dynamic characteristics of speaker individuality is converted as well as the static ones [10]. In the technique, the phone units are used as the segments, and a mapping table is generated between triphones of the source and target speakers.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…It has been documented that spectral conversion is a feasible technique for modifying articulation-related parameters of speech (Shikano et al, 1986;Abe et al, 1988;Nakamura and Shikano, 1989;Abe et al, 1990;Abe, 1991;Shikano et al, 1991;Valbret et al, 1992). Spectral conversion was originally used for speaker adaptation in speech recognition systems, where the spectral information is described by a codebook.…”
Section: Introductionmentioning
confidence: 99%
“…The technique of spectral conversion was also used in normal voice conversion systems (Abe et al, 1988;Abe et al, 1990;Abe, 1991). To accomplish the voice conversion, the spectral spaces of an input speaker and a target speaker were reduced to, and represented by two codebooks that were obtained using vector quantization (VQ) techniques.…”
Section: Introductionmentioning
confidence: 99%