International Conference on Acoustics, Speech, and Signal Processing
DOI: 10.1109/icassp.1989.266409
|View full text |Cite
|
Sign up to set email alerts
|

A diphone synthesis system based on time-domain prosodic modifications of speech

Abstract: This paper p-t s a new timedomain algorithm for text-to-speech synthesis using diphone concatenation. The algorithm is based on the pitch-synchronous overlap-add (PSO-LA) approach, and is capable of good quatity prosodic modifications of natural speech. The algorithm can be seen as a simplif'icatio~~ of a p i o u s algorithm combining the PSOLA approach and fquencydomain uansfonnations. On the other hand, it appears as a gcnemlhtion of previous timedomain methods that perform pitch-synchronous cut-and-splice o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
33
0

Publication Types

Select...
8
2

Relationship

0
10

Authors

Journals

citations
Cited by 84 publications
(34 citation statements)
references
References 9 publications
0
33
0
Order By: Relevance
“…The coef®cients of the ®lter were used mainly to derive the noisy residual signal by inverse ®ltering. Retaining the waveform bells around the glottal closure produces good quality speech as was demonstrated in PSOLA based Text-to-Speech system (TTS) (Hamon et al, 1989).…”
Section: Nature Of Lp Residual Signalmentioning
confidence: 99%
“…The coef®cients of the ®lter were used mainly to derive the noisy residual signal by inverse ®ltering. Retaining the waveform bells around the glottal closure produces good quality speech as was demonstrated in PSOLA based Text-to-Speech system (TTS) (Hamon et al, 1989).…”
Section: Nature Of Lp Residual Signalmentioning
confidence: 99%
“…With the knowledge of the epochs, it is possible to determine the characteristics of the voice source by a careful analysis of the signal within a glottal pulse. The epochs can be used as pitch markers for prosody manipulation, which is useful in applications like text-to-speech synthesis, voice conversion and speech rate conversion [3], [4]. Knowledge of the epoch locations may be used for estimating the time-delay between speech signals collected over a pair of spatially distributed microphones [5].…”
Section: A Significance Of Epochs In Speech Analysismentioning
confidence: 99%
“…This glottal pulse shape has also a position in time c i . Speech processing techniques often define c i as glottal closure instants [48,49], or as energy local maxima of a residual signal [50,51], or as pitch pulse onsets [12,14,27] for centering windows and to synchronize instantaneous phase parameters. Even though such a definition is necessary for many approaches, we will show below that it is not necessary when using the Relative Phase Shift (RPS) [24,33] or PD, which avoids an extra estimation procedure and its potential misestimation errors.…”
Section: Theoretical Model Of the Instantaneous Phasementioning
confidence: 99%