Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics on Computational Linguistics - 1999
DOI: 10.3115/1034678.1034729
|View full text |Cite
|
Sign up to set email alerts
|

Automatic detection of poor speech recognition at the dialogue level

Abstract: The dialogue strategies used by a spoken dialogue system strongly influence performance and user satisfaction. An ideal system would not use a single fixed strategy, but would adapt to the circumstances at hand. To do so, a system must be able to identify dialogue properties that suggest adaptation. This paper focuses on identifying situations where the speech recognizer is performing poorly. We adopt a machine learning approach to learn rules from a dialogue corpus for identifying these situations. Our result… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
23
0

Year Published

2002
2002
2017
2017

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 42 publications
(23 citation statements)
references
References 14 publications
0
23
0
Order By: Relevance
“…Other studies (Hirschberg, Litman, and Swerts, 1999;Litman, Walker, and Kearns, 1999) reported that the ASR string is highly relevant in predicting recognition errors (which partly corresponds to our CTP task). In the study of (Hirschberg, Litman, and Swerts, 1999) the recognized string was the best performing isolated feature, yielding an error rate of 14.4% (85.6% accuracy).…”
Section: Non-prosodic Features In the Error Detection Taskmentioning
confidence: 99%
See 2 more Smart Citations
“…Other studies (Hirschberg, Litman, and Swerts, 1999;Litman, Walker, and Kearns, 1999) reported that the ASR string is highly relevant in predicting recognition errors (which partly corresponds to our CTP task). In the study of (Hirschberg, Litman, and Swerts, 1999) the recognized string was the best performing isolated feature, yielding an error rate of 14.4% (85.6% accuracy).…”
Section: Non-prosodic Features In the Error Detection Taskmentioning
confidence: 99%
“…Our experiments have indeed shown this discrepancy. (Litman, Walker, and Kearns, 1999) employ the ASR text feature as a setvalued lexical feature in RIPPER where it also turns out to be the most predictive feature in isolation (72% accuracy) for detecting poor speech recognition. It is noteworthy that for our task the ASR string feature is less beneficial.…”
Section: Non-prosodic Features In the Error Detection Taskmentioning
confidence: 99%
See 1 more Smart Citation
“…Strategies for correcting, rejecting, or changing misrecognized hypotheses have been proposed [29,30,31]. Prosodic features such as F0 perturbation, duration, and loudness were shown to significantly characterize failed recognition runs in terms of word-accuracy and conception-accuracy [32].…”
Section: Speech Correctionmentioning
confidence: 99%
“…To improve dialog performance, much effort has also been put on techniques to automatically detect errors during interaction. It has shown that during human machine dialog, there are sufficient cues for machines to automatically identify error conditions (Levow, 1998;Litman et al, 1999;Hirschberg et al, 2001;Walker et al, 2002). The awareness of erroneous situations can help systems make intelligent decisions about how to best guide human partners through the conversation and accomplish the tasks.…”
Section: Related Workmentioning
confidence: 99%