2022
DOI: 10.1121/10.0010274
|View full text |Cite
|
Sign up to set email alerts
|

The clear speech intelligibility benefit for text-to-speech voices: Effects of speaking style and visual guise

Abstract: This study examined how speaking style and guise influence the intelligibility of text-to-speech (TTS) and naturally produced human voices. Results showed that TTS voices were less intelligible overall. Although using a clear speech style improved intelligibility for both human and TTS voices (using “newscaster” neural TTS), the clear speech effect was stronger for TTS voices. Finally, a visual device guise decreased intelligibility, regardless of voice type. The results suggest that both speaking style and vi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

1
11
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 17 publications
(20 citation statements)
references
References 20 publications
1
11
0
Order By: Relevance
“…This availability of tools to leverage AI speech could facilitate new advances in research and clinical disciplines, including hearing science, audiology, and speech pathology, in which creating and norming human-spoken speech materials for diagnostic and rehabilitative purposes is an ongoing challenge, and could potentially be substituted by AI speech synthesis. However, research thus far has focused on technology development towards human-like voices and the degree to which AI speech is perceived by young, normal-hearing listeners (e.g., Govender et al, 2019b;Govender et al, 2019a;Aoki et al, 2022). How older adults perceive and experience AI speech is unclear and topic of the current study.…”
Section: Introduction Introduction Introduction Introductionmentioning
confidence: 95%
See 4 more Smart Citations
“…This availability of tools to leverage AI speech could facilitate new advances in research and clinical disciplines, including hearing science, audiology, and speech pathology, in which creating and norming human-spoken speech materials for diagnostic and rehabilitative purposes is an ongoing challenge, and could potentially be substituted by AI speech synthesis. However, research thus far has focused on technology development towards human-like voices and the degree to which AI speech is perceived by young, normal-hearing listeners (e.g., Govender et al, 2019b;Govender et al, 2019a;Aoki et al, 2022). How older adults perceive and experience AI speech is unclear and topic of the current study.…”
Section: Introduction Introduction Introduction Introductionmentioning
confidence: 95%
“…A few recent studies have compared perception of human speech to AI-based synthesized speech in a variety of contexts Aoki et al, 2022). For example, studies have investigated the experienced valence of human vs AI speech or how humans produce speech when talking to voice-AI systems compared to humans Zellou et al, 2021;Cohn et al, 2022).…”
Section: Introduction Introduction Introduction Introductionmentioning
confidence: 99%
See 3 more Smart Citations