1997
DOI: 10.1121/1.420883
|View full text |Cite
|
Sign up to set email alerts
|

Voice selection for speech synthesis

Abstract: A TTS voice quality experiment was conducted to select a speaker and to evaluate synthesis techniques. Small-scale TTS diphone inventories using six professional female speakers who were pre-selected in an audition were recorded. Two types of inventories were recorded for each speaker: a series of nonsense words and a series of English sentences. Using these 12 inventories, two synthesis methods were compared: PSOLA [Charpentier and Moulines, Eurospeech ’89] and Harmonic Plus Noise (HNM) [Stylianou et al., Eur… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

1999
1999
2005
2005

Publication Types

Select...
1
1

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 0 publications
0
1
0
Order By: Relevance
“…Formal listening tests were conducted throughout the research and development phase of Next-Gen TTS. We believe that selecting the voice for rendering the many hours of inventory speech was the most critical decision [7]. We also have identified acoustic correlates of listener ratings relevant to speaker selection [8].…”
Section: Discussionmentioning
confidence: 99%
“…Formal listening tests were conducted throughout the research and development phase of Next-Gen TTS. We believe that selecting the voice for rendering the many hours of inventory speech was the most critical decision [7]. We also have identified acoustic correlates of listener ratings relevant to speaker selection [8].…”
Section: Discussionmentioning
confidence: 99%