First European Conference on Speech Communication and Technology (Eurospeech 1989) 1989
DOI: 10.21437/eurospeech.1989-172
|View full text |Cite
|
Sign up to set email alerts
|

Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
9
0

Year Published

1991
1991
2009
2009

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 99 publications
(14 citation statements)
references
References 0 publications
0
9
0
Order By: Relevance
“…A phrase-level concatenative synthesizer was designed for the PEGASUS in much the same way as was done for the WHEELS system. For example, flight numbers can be synthesized from smaller constituents (e.g., 4695 can be synthesized from 3[4]7, 7 [6]41, 8 [9]2, and 56 [5].) Multiple carrier phrases were designed for the speaking of estimated arrival and departure times, for example.…”
Section: Full Sentence Experimentsmentioning
confidence: 99%
See 1 more Smart Citation
“…A phrase-level concatenative synthesizer was designed for the PEGASUS in much the same way as was done for the WHEELS system. For example, flight numbers can be synthesized from smaller constituents (e.g., 4695 can be synthesized from 3[4]7, 7 [6]41, 8 [9]2, and 56 [5].) Multiple carrier phrases were designed for the speaking of estimated arrival and departure times, for example.…”
Section: Full Sentence Experimentsmentioning
confidence: 99%
“…This can be corrected after the fact by prosodic modification algorithms. An example of an algorithm which happens to operate in the time domain is the Time-Domain Pitch-Synchronous Overlap-and-Add algorithm (TD-PSOLA) [5,14].…”
Section: Introductionmentioning
confidence: 99%
“…In the other end of the synthesizer continua we have the PSOLA type of method (Carpentier & Moulines, 1989). The algorithms are based on a pitch-synchronous overlap-add approach for modifying the speech prosody and concatenating diphone waveforms.…”
Section: Synthesizers and Control Parametersmentioning
confidence: 99%
“…This is because a previous study reported that PSOLA conducted directly on speech waveforms sometimes causes spectral distortion and leads to the speech quality degradation [1]. And this spectral distortion is thought to be able to be suppressed by doing the re-arrangement on source waveforms [4]. In our previous study [3], source waveforms were obtained by using LMA (Log Magnitude Approximation) inverse filter, which was designed only by using cepstrum coefficients and could precisely approximate magnitude characteristics of vocal tract in a logarithmic scale [5].…”
Section: Introductionmentioning
confidence: 99%