Simon Betz scite author profile

Simon Betz

5Publications

84Citation Statements Received

72Citation Statements Given

How they've been cited

How they cite others

106

Affiliations

Bielefeld University, Frankfurt University of Applied Sciences

Publications

Order By: Most citations

Speech Synthesis Evaluation — State-of-the-Art Assessment and Suggestion for a Novel Research Program

Wagner¹,

Beskow²,

Betz³

et al. 2019

View full text Add to dashboard Cite

Speech synthesis applications have become an ubiquity, in navigation systems, digital assistants or as screen or audio book readers. Despite their impact on the acceptability of the systems in which they are embedded, and despite the fact that different applications probably need different types of TTS voices, TTS evaluation is still largely treated as an isolated problem. Even though there is strong agreement among researchers that the mainstream approaches to Text-to-Speech (TTS) evaluation are often insufficient and may even be misleading, there exist few clear-cut suggestions as to (1) how TTS evaluations may be realistically improved on a large scale, and (2) how such improvements may lead to an informed feedback for system developers and, ultimately, better systems relying on TTS. This paper reviews the current state-of-the-art in TTS evaluation, and suggests a novel user-centered research program for this area.

show abstract

Interactive Hesitation Synthesis: Modelling and Evaluation

Betz

Carlmeyer

Wagner

et al. 2018

MTI

View full text Add to dashboard Cite

Abstract:Conversational spoken dialogue systems that interact with the user rather than merely reading the text can be equipped with hesitations to manage dialogue flow and user attention. Based on a series of empirical studies, we elaborated a hesitation synthesis strategy for dialogue systems, which inserts hesitations of a scalable extent wherever needed in the ongoing utterance. Previously, evaluations of hesitation systems have shown that synthesis quality is affected negatively by hesitations, but that they result in improvements of interaction quality. We argue that due to its conversational nature, hesitation synthesis needs interactive evaluation rather than traditional mean opinion score (MOS)-based questionnaires. To validate this claim, we dually evaluate our system's speech synthesis component, on the one hand, linked to the dialogue system evaluation, and on the other hand, in a traditional MOS way. We are thus able to analyze and discuss differences that arise due to the evaluation methodology. Our results suggest that MOS scales are not sufficient to assess speech synthesis quality, leading to implications for future research that are discussed in this paper. Furthermore, our results indicate that synthetic hesitations are able to increase task performance and that an elaborated hesitation strategy is necessary to avoid likability issues.

show abstract

Fill the silence! Basics for modeling hesitation

Betz

Kosmala

2019

View full text Add to dashboard Cite

In order to model hesitations for technical applications such as conversational speech synthesis, it is desirable to understand interactions between individual hesitation markers. In this study, we explore two markers that have been subject to many discussions: silences and fillers. While it is generally acknowledged that fillers occur in two distinct forms, um and uh, it is not agreed on whether these forms systematically influence the length of associated silences. This notion will be investigated on a small dataset of English spontaneous speech data, and the measure of distance between filler and silence will be introduced to the analyses. Results suggest that filler type influences associated silence duration systematically and that silences tend to gravitate towards fillers in utterances, exhibiting systematically lower duration when preceding them. These results provide valuable insights for improving existing hesitation models.

show abstract

Biological responses to individualized small titanium implants for the treatment of focal full-thickness knee cartilage defects in a sheep model

et al. 2020

View full text Add to dashboard Cite

The Hesitating Robot - Implementation and First Impressions

Carlmeyer

Betz

Wagner

et al. 2018

View full text Add to dashboard Cite

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Simon Betz

Speech Synthesis Evaluation — State-of-the-Art Assessment and Suggestion for a Novel Research Program

Interactive Hesitation Synthesis: Modelling and Evaluation

Fill the silence! Basics for modeling hesitation

Biological responses to individualized small titanium implants for the treatment of focal full-thickness knee cartilage defects in a sheep model

The Hesitating Robot - Implementation and First Impressions

Contact Info

Product

Resources

About