The thoughtful elephant: strategies for spoken dialog systems

Souvignier, B.; Kellner, Aaron; Rueber, B.; Schramm, Hauke; Seide, Frank

doi:10.1109/89.817453

Cited by 47 publications

(19 citation statements)

References 25 publications

(28 reference statements)

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Although it has turned out to be a rather difficult task to beat the (almost) standard class/word n-grams (typically Ò ¾ or ¿), there has been a great deal of interest in grammar based language models [1]. A promising approach for limited domain applications is the use of semantically motivated phrase level stochastic context free grammars (SCFGs) to parse a sentence into a sequence of semantic tags which are further modeled using Ò-grams [2,9,10,3].…”

Section: Introductionmentioning

confidence: 99%

On Combining Language Models: Oracle Approach

Hacıoğlu¹,

Ward²

2001

View full text Add to dashboard Cite

In this paper, we address the problem of combining several language models (LMs). We find that simple interpolation methods, like log-linear and linear interpolation, improve the performance but fall short of the performance of an oracle. The oracle knows the reference word string and selects the word string with the best performance (typically, word or semantic error rate) from a list of word strings, where each word string has been obtained by using a different LM. Actually, the oracle acts like a dynamic combiner with hard decisions using the reference. We provide experimental results that clearly show the need for a dynamic language model combination to improve the performance further. We suggest a method that mimics the behavior of the oracle using a neural network or a decision tree. The method amounts to tagging LMs with confidence measures and picking the best hypothesis corresponding to the LM with the best confidence.

show abstract

Section: Introductionmentioning

confidence: 99%

On Combining Language Models: Oracle Approach

Hacıoğlu¹,

Ward²

2001

View full text Add to dashboard Cite

show abstract

“…The idea behind this approach is that well-understood parts of a sentence occur in most hypotheses of an N -best list, whereas for misrecognitions different candidates usually appear in different hypotheses. Thus, the effect of a recognition error is distributed over several competing hypotheses and does not result in a strong error reinforcement (Souvignier et al, 2000). A variation on the same concept was recently presented in (Gretter and Riccardi, 2001), based on exploiting sausages rather than N -best lists.…”

Section: Adaptation Datamentioning

confidence: 97%

Statistical language model adaptation: review and perspectives

Bellegarda

2004

Speech Communication

222

119

View full text Add to dashboard Cite

“…For example, sub-dialogues can be used for the confirmation of the understood concepts, error recovery, reduction or expansion of the scope of the user's request, clarification of the ambiguities, etc. [105]- [107]. The response to the user is finally formulated as sentences and produced as speech signals to be transmitted to the user.…”

Section: Spoken Dialoguementioning

confidence: 99%

Voice access of global information for broad-band wireless: technologies of today and challenges of tomorrow

Lee

2001

Proc. IEEE

View full text Add to dashboard Cite

The rapid development of the Internet and the World Wide Web has created a global network that will soon become a physical embodiment of the entire human knowledge and a complete integration of the global information activities. The traditional approach to access the network is through a computer physically tied to the network. As broad-band wireless takes off, the traditional tethered approach will gradually become obsolete. It is believed that one of the most natural and user-friendly approaches for accessing the network will be via human voice, and the integration of spoke language processing technologies with broad-band wireless technologies will be a key to the evolution of a broad-band wireless information community. This paper offers a vision of the above concept. Technical considerations and some typical example applications of accessing the information and services using voice in a broad-band wireless environment are discussed. Fundamentals of spoken language processing technologies that are crucial in such a broad-band wireless environment are briefly reviewed. Technical challenges caused by the unique nature of wireless mobile communications are also presented along with some possible solutions.

show abstract

The thoughtful elephant: strategies for spoken dialog systems

Cited by 47 publications

References 25 publications

On Combining Language Models: Oracle Approach

On Combining Language Models: Oracle Approach

Statistical language model adaptation: review and perspectives

Voice access of global information for broad-band wireless: technologies of today and challenges of tomorrow

Contact Info

Product

Resources

About