1999
DOI: 10.1121/1.425343
|View full text |Cite
|
Sign up to set email alerts
|

A robust unit selection system for speech synthesis

Abstract: There has been interest for many years in diphone-based speech synthesis and, recently, a rapidly increasing interest in unit selection-based synthesis (as illustrated by interest in the CHATR system). The limits of both systems are well known. While intelligibility is generally very high for diphone-based systems, the resulting signals do not sound completely natural. This happens for several reasons, amongst them the limited number of phone variants present in a typical system, and the cost of concatenating … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
23
0

Year Published

2001
2001
2020
2020

Publication Types

Select...
5
4
1

Relationship

0
10

Authors

Journals

citations
Cited by 45 publications
(26 citation statements)
references
References 5 publications
0
23
0
Order By: Relevance
“…Many TTS systems proposed by [1][2][3][4][5] have been implemented by using concatenative method based on different speech units and they can generate high quality synthesized speech. For Myanmar language, there has been considerable effort on speech processing in Myanmar natural language processing.…”
Section: Related Workmentioning
confidence: 99%
“…Many TTS systems proposed by [1][2][3][4][5] have been implemented by using concatenative method based on different speech units and they can generate high quality synthesized speech. For Myanmar language, there has been considerable effort on speech processing in Myanmar natural language processing.…”
Section: Related Workmentioning
confidence: 99%
“…[1]- [4] attempts to avoid or reduce spectral discontinuities in formants and spectral tilt by choosing the acoustic units from a large inventory. The selection is based, among other things, on the minimization of the distance between magnitude spectra (usually represented by cepstrum coefficients, or line spectrum frequencies) from the concatenated acoustic units.…”
mentioning
confidence: 99%
“…The general search method (Hunt and Black, 1996) has been refined (e.g. Conkie, 1999;Taylor, 2000;Bulyko and Ostendorf, 2001) and complemented by other procedures for specific tasks such as limited domain speech synthesis (Black and Lenzo, 2000).…”
Section: Introductionmentioning
confidence: 99%