1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings
DOI: 10.1109/icassp.1996.541110
|View full text |Cite
|
Sign up to set email alerts
|

Unit selection in a concatenative speech synthesis system using a large speech database

Abstract: One approach to the generation of natural-sounding synthesized speech waveforms is to select and concatenate units from a large speech database. Units (in the current work, phonemes) are selected to produce a natural realisation of a target phoneme sequence predicted from text which is annotated with prosodic and phonetic context information. We propose that the units in a synthesis database can be considered as a state transition network in which the state occupancy cost is the distance between a database uni… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
474
0
26

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 779 publications
(501 citation statements)
references
References 4 publications
1
474
0
26
Order By: Relevance
“…Currently, unit selection [7] is still the underlying technique in most commercial systems. Since this requires large resources of well-recorded and labelled speech data to ensure the optimal unit coverage, its use is becoming less common for obtaining cheap and fast language portability, especially in the case of an under-resourced language.…”
Section: Related Workmentioning
confidence: 99%
“…Currently, unit selection [7] is still the underlying technique in most commercial systems. Since this requires large resources of well-recorded and labelled speech data to ensure the optimal unit coverage, its use is becoming less common for obtaining cheap and fast language portability, especially in the case of an under-resourced language.…”
Section: Related Workmentioning
confidence: 99%
“…A comparison between the Multisyn engine and two of the first unit selection implementations -CHATR (Hunt and Black, 1996) and Festival's "clunits" method (Black and Taylor, 1997)-is useful to clarify how Multisyn differs from other techniques.…”
Section: Comparison With Other Methodsmentioning
confidence: 99%
“…A full tutorial on unit selection speech synthesis is beyond the scope of this paper; we refer the reader to Hunt and Black (1996). However, we will define the terminology to be used in the rest of this paper.…”
Section: Unit Selection Speech Synthesismentioning
confidence: 99%
See 2 more Smart Citations