2005
DOI: 10.1121/1.1843171
|View full text |Cite
|
Sign up to set email alerts
|

Use of acoustic prior information for confidence measure in ASR (automatic speech recognition) applications

Abstract: Abstract:In this paper, a new acoustic confidence measure of automatic speech recognition hypothesis is proposed and it is compared to approaches proposed in the literature. This approach takes into account prior information on the acoustic model performance specific to each phoneme. The new method is tested on two types of recognition errors: the out-of-vocabulary words and the errors due to additive noise. An efficient way to interpret the raw confidence measure as a correctness prior probability is also pro… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
4
0

Year Published

2006
2006
2021
2021

Publication Types

Select...
3
1

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(5 citation statements)
references
References 13 publications
(8 reference statements)
1
4
0
Order By: Relevance
“…In This result is quite similar to that obtained using the average CM (6). The advantage of this technique, however, is that the threshold has a meaningful interpretation and is easily adjustable according to the application requirements.…”
Section: Normalized Confidence Measuresupporting
confidence: 77%
“…In This result is quite similar to that obtained using the average CM (6). The advantage of this technique, however, is that the threshold has a meaningful interpretation and is easily adjustable according to the application requirements.…”
Section: Normalized Confidence Measuresupporting
confidence: 77%
“…To avoid substantial variance in the DNN output, we use the confidence measurement [ 34 ] to analyze the output of the DNN-based NC. Based on the confidence measurement score, a threshold is used to determine the classification results.…”
Section: Methodsmentioning
confidence: 99%
“…Typical spoken language systems consist of two main sub-systems: an ASR (automatic speech recognition) front-end, which generates a recognition hypothesis (or N-best list of recognition hypotheses) for each input utterance, and an NLP (natural language processing) back-end, which performs semantic understanding, dialogue management, and response generation. While conventional approaches [8]- [15] generate confidence measures based on the information obtained during decoding in the ASR front-end, this paper focuses on the incorporation of "high-level " knowledge sources from the back-end system.…”
Section: Proposed Confidence Scoring Frameworkmentioning
confidence: 99%
“…Explicit model-based schemes [10]- [12] compare the candidate model to a competing model (an anti-model, background model or set of cohort models) via a likelihood ratio test. Posterior probability-based approaches, including [13]- [15], estimate the posterior probability of a recognized entity (word or utterance) considering all competing hypotheses (typically in an N-best list or word graph).…”
Section: Introductionmentioning
confidence: 99%