Bayesian update of dialogue state for robust dialogue systems

Thomson, Blaise; Schatzmann, Jost; Young, Steve

doi:10.1109/icassp.2008.4518765

Cited by 35 publications

(26 citation statements)

References 7 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…This highlights the importance of effective and efficient mechanisms for dialogue history tracking. Future research can incorporate beliefs into the knowledge rich-states of the proposed framework with ideas from approaches such as regression methods , POMDPs (Williams, 2006), or Bayesian updates (Thomson et al, 2008).…”

Section: Discussionmentioning

confidence: 99%

Evaluation of a hierarchical reinforcement learning spoken dialogue system

Cuayáhuitl

Renals

Lemon

et al. 2010

Computer Speech & Language

View full text Add to dashboard Cite

Section: Discussionmentioning

confidence: 99%

Evaluation of a hierarchical reinforcement learning spoken dialogue system

Cuayáhuitl

Renals

Lemon

et al. 2010

Computer Speech & Language

View full text Add to dashboard Cite

“…All q f (τ j ) approximations are constrained to the Dirichlet distribution, with the parameters denoted by α f,j . The approximations for the other factors are fixed and the cavity distributions for the variables are defined as per (8). In the case of the discrete variables g t and g t−1 , the cavity distributions are computed by multiplying all factor approximations except forf .…”

Section: Expectation Propagationmentioning

confidence: 99%

“…Details can be found in [19]. The full algorithm operates by repeatedly choosing a factor to update, computing the cavity distributions in terms of the current approximations (8) and (9) and then updating the current approximating functions as per (10), (11) and (19). Similar to belief propagation, the process is repeated until changes in the approximating functions fall below a threshold.…”

Section: Expectation Propagationmentioning

confidence: 99%

“…Section 4 explains how a more general form of inference called Expectation Propagation(EP), which can be viewed as a form of expectation-maximisation, can be used for both belief tracking and parameter optimisation [5,6]. Section 5 explains how natural actor-critic reinforcement learning can be used to optimise the policy parameters P [7,8], and how with a simple extension, it can also be used to optimise the dialogue model parameters M [9]. Finally, section 6 addresses the problem of fast on-line policy optimisation using Gaussian processes as a non-parametric policy model [10,11,12].…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Optimisation for POMDP-Based Spoken Dialogue Systems

Gašić

Jurčíček

Thomson

et al. 2012

Data-Driven Methods for Adaptive Spoken Dialogue Systems

Self Cite

View full text Add to dashboard Cite

“…In the Bayesian Update of Dialogue State (BUDS) system, the user's goal is further factored into conditionally independent slots. The resulting system is then modelled as a dynamic Bayesian network (Thomson et al, 2008). A similar approach is also developed in (Bui et al, 2007a;Bui et al, 2007b).…”

Section: Introductionmentioning

confidence: 99%

Training and evaluation of the HIS POMDP dialogue system in noise

Gašić

Keizer

Mairesse

et al. 2008

Proceedings of the 9th SIGdial Workshop on Discourse and Dialogue - SIGdial '08

View full text Add to dashboard Cite

This paper investigates the claim that a dialogue manager modelled as a Partially Observable Markov Decision Process (POMDP) can achieve improved robustness to noise compared to conventional state-based dialogue managers. Using the Hidden Information State (HIS) POMDP dialogue manager as an exemplar, and an MDP-based dialogue manager as a baseline, evaluation results are presented for both simulated and real dialogues in a Tourist Information Domain. The results on the simulated data show that the inherent ability to model uncertainty, allows the POMDP model to exploit alternative hypotheses from the speech understanding system. The results obtained from a user trial show that the HIS system with a trained policy performed significantly better than the MDP baseline.

show abstract

Bayesian update of dialogue state for robust dialogue systems

Cited by 35 publications

References 7 publications

Evaluation of a hierarchical reinforcement learning spoken dialogue system

Evaluation of a hierarchical reinforcement learning spoken dialogue system

Optimisation for POMDP-Based Spoken Dialogue Systems

Training and evaluation of the HIS POMDP dialogue system in noise

Contact Info

Product

Resources

About