2004
DOI: 10.1023/b:ijst.0000037070.31146.f9
|View full text |Cite
|
Sign up to set email alerts
|

Automatic User-Adaptive Speaking Rate Selection

Abstract: Today there are many services which provide information over the phone using a prerecorded or synthesized voice. These voices are invariant in speed. Humans giving information over the telephone, however, tend to adapt the speed of their presentation to suit the needs of the listener. This paper presents a preliminary model of this adaptation.In a corpus of simulated directory assistance dialogs the operator's speed in number-giving correlates with the speed of the user's initial response and with the user's s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
12
0
1

Year Published

2004
2004
2014
2014

Publication Types

Select...
7

Relationship

0
7

Authors

Journals

citations
Cited by 17 publications
(13 citation statements)
references
References 21 publications
(12 reference statements)
0
12
0
1
Order By: Relevance
“…Users' amicability for a machine increases when it adapts to their prosody (Suzuki and Katagiri, 2007). Ward and Nakagawa (2002) found for instance that a telephony system that adapts its speech rate with the users' is rated more favourably than those that do not.…”
Section: Functional Role In Social Interactionmentioning
confidence: 99%
See 1 more Smart Citation
“…Users' amicability for a machine increases when it adapts to their prosody (Suzuki and Katagiri, 2007). Ward and Nakagawa (2002) found for instance that a telephony system that adapts its speech rate with the users' is rated more favourably than those that do not.…”
Section: Functional Role In Social Interactionmentioning
confidence: 99%
“…These language technologies have use in a diverse range of fields including mobile communications (Ward and Nakagawa, 2002;Lu et al, 2011;Agarwal et al, 2011) internet search engines (Google, 2011;Apple Inc, 2011), games and assistive technologies developed for the elderly (Kleinberger et al, 2007) or communicatively impaired (Zhou et al, 2012). While these interactive systems can process the linguistic aspects of human communication, they are not yet capable of processing the important suprasegmental social information that is a pervasive part of human social interaction.…”
Section: Introductionmentioning
confidence: 99%
“…When people engage in conversation, they tailor their utterances to their conversational partners, whether these partners are other humans or computational systems (Brennan, 1991;Schober, 1998). This tailoring, or adaptation to the partner, has been shown to take place in all facets of human language use, including speaking rate and response delay Ward & Nakagawa, 2002), amplitude and prosodic range (Coulston, Oviatt, & Darves, 2002;McLemore, 1992), lexical and syntactic choice (Brennan, 1996;Kempen & Hoenkamp, 1987;Levelt & Kelter, 1982), choice and modality of referring expressions (Bell, Boye, Gustafson, & Wirn, 2000;Brennan & Clark, 1996;Garrod & Anderson, 1987;Schober, 1998) and in higher level discourse processes such as the selection of content and form for persuasive arguments and negotiation (Joshi, 1982;Joshi, Webber, & Weischedel, 1984;Mayberry & Golden, 1996;McGuire, 1968;Walker, 1996;Webber & Joshi, 1982). This adaptive behavior is based on a mental model or a user model of the conversational partner (Brennan & Clark, 1996;Levelt, 1989;Wahlster & Kobsa, 1989;Zukerman & Litman, 2001).…”
Section: Introductionmentioning
confidence: 99%
“…To reproduce an interaction with speech rate similarity, studies have focused on mechanisms to measure user speech rates (Takamaru, Hiroshige, Araki & Tochinai 2000) and to pace synthesized voices to match user speech rates (Iwase & Ward 1998;Ward & Nakagawa 2002). Another study argues that task structure rather than the partner's speech rate is the dominant factor in human conversation and determines the speaking rate (Ward & Mamidipally 2008).…”
Section: B Related Research In Spoken Dialogsmentioning
confidence: 99%