2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721)
DOI: 10.1109/asru.2003.1318442
|View full text |Cite
|
Sign up to set email alerts
|

Issues in the evaluation of spoken dialogue systems using objective and subjective measures

Abstract: This paper presents results and conclusions about the current evaluation methodologies for Spoken Dialogue Systems (SDS). The PARADISE paradigm, used for evaluation in the DARPA Communicator project is briefly introduced and discussed through the application to the OVID home banking dialogue system. It is shown to provide results consistent with those obtained by the DARPA community, but a number of problems and limitations are pointed out.The issue of user attitude measures through questionnaires is discussed… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
9
0
4

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 19 publications
(14 citation statements)
references
References 2 publications
0
9
0
4
Order By: Relevance
“…confirming a purchase with a yes/no question), however this is not always possible and user responses can be noisy [8] which results in slower learning.…”
Section: Introductionmentioning
confidence: 96%
“…confirming a purchase with a yes/no question), however this is not always possible and user responses can be noisy [8] which results in slower learning.…”
Section: Introductionmentioning
confidence: 96%
“…Recently, PARADISE was the subject of several investigations, among others (Aguilera et al, 2004;Larsen, 2003b;Paek, 2001;Whittaker, Terveen, & Nardi, 2000). The main limitation found was that tasks have to be clearly defined so that they can be described by an AVM.…”
Section: Discussionmentioning
confidence: 95%
“…These aspects can be used as the basis for usability evaluation strategies. Many frameworks and methodologies have been developed and used for evaluation of spoken dialogue systems in recent works [8,9,13,15,17,18,21,24,30].…”
Section: 12mentioning
confidence: 99%