Extrinsic Evaluation of Dialog State Tracking and Predictive Metrics for Dialog Policy Optimization

Lee, Sung‐Jin

doi:10.3115/v1/w14-4342

Cited by 12 publications

(10 citation statements)

References 12 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Second, in DSTC2, the question of what to measure was posed differently, as "Which evaluation metric and schedule would best predict improvement in overall dialog performance?" (Lee, 2014). The author uses the data to optimize a reinforcement learning-based dialog manager, then runs a regression analysis to see which metrics are the best predictors of end-to-end dialog performance.…”

Section: Challenge Entries and Resultsmentioning

confidence: 99%

“…This has had unforeseen benefits: first, the DSTC data now forms a sort of benchmark for the field, with groups continuing to report results on it after the challenge proper (Lee, 2013;Ma and Fosler-Lussier, 2014b;Zilka and Jurčíček, 2015;Fix and Frezza-Buet, 2015). In addition, the DSTC1-3 corpora have been used to examine which state tracking evaluation metrics correlate with dialog success (Lee, 2014), perform detailed error analyses of state trackers (Smith, 2014), and for dialog act classification and SLU experimentation (Ma and Fosler-Lussier, 2014a;Ferreira et al, 2015). We encourage future challenges to continue this tradition.…”

Section: Featuresmentioning

confidence: 99%

See 1 more Smart Citation

The Dialog State Tracking Challenge Series: A Review

Williams

Raux

Henderson

2016

dad

167

119

View full text Add to dashboard Cite

In a spoken dialog system, dialog state tracking refers to the task of correctly inferring the state of the conversation -- such as the user's goal -- given all of the dialog history up to that turn. Dialog state tracking is crucial to the success of a dialog system, yet until recently there were no common resources, hampering progress. The Dialog State Tracking Challenge series of 3 tasks introduced the first shared testbed and evaluation metrics for dialog state tracking, and has underpinned three key advances in dialog state tracking: the move from generative to discriminative models; the adoption of discriminative sequential techniques; and the incorporation of the speech recognition results directly into the dialog state tracker. This paper reviews this research area, covering both the challenge tasks themselves and summarizing the work they have enabled.

show abstract

Section: Challenge Entries and Resultsmentioning

confidence: 99%

Section: Featuresmentioning

confidence: 99%

The Dialog State Tracking Challenge Series: A Review

Williams

Raux

Henderson

2016

dad

167

119

View full text Add to dashboard Cite

show abstract

“…In this paper, we use L2 metric as the loss function since it is found to be most influential to dialog system performance (Lee, 2014). The model is hence optimized to minimize the L2 loss function…”

Section: Optimizationmentioning

confidence: 99%

Dialog History Construction with Long-Short Term Memory for Robust Generative Dialog State Tracking

Byung-Jun¹,

Kim²

2016

dad

View full text Add to dashboard Cite

One of the crucial components of dialog system is the dialog state tracker, which infers user’s intention from preliminary speech processing. Since the overall performance of the dialog system is heavily affected by that of the dialog tracker, it has been one of the core areas of research on dialog systems. In this paper, we present a dialog state tracker that combines a generative probabilistic model of dialog state tracking with the recurrent neural network for encoding important aspects of the dialog history. We describe a two-step gradient descent algorithm that optimizes the tracker with a complex loss function. We demonstrate that this approach yields a dialog state tracker that performs competitively with top-performing trackers participated in the first and second Dialog State Tracking Challenges.

show abstract

“…A held-out dialog corpus is used as testing set, and the estimated cumulative reward for the testing dialogs when following the target DM policy is used as metric for performance. A similar approach has been taken in evaluating the effect of different dialog state tracker on end-to-end performance of a DM [15]. The estimation of Q-function is similar to Algorithm 2.…”

Section: On-corpus Dm Evaluationmentioning

confidence: 99%

Optimizing human-interpretable dialog management policy using genetic algorithm

Ren

Yan

2015

2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)

View full text Add to dashboard Cite

Automatic optimization of spoken dialog management policies that are robust to environmental noise has long been the goal for both academia and industry. Approaches based on reinforcement learning have been proved to be effective. However, the numerical representation of dialog policy is humanincomprehensible and difficult for dialog system designers to verify or modify, which limits its practical application. In this paper we propose a novel framework for optimizing dialog policies specified in domain language using genetic algorithm. The human-interpretable representation of policy makes the method suitable for practical employment. We present learning algorithms using user simulation and real human-machine dialogs respectively. Empirical experimental results are given to show the effectiveness of the proposed approach.

show abstract

Extrinsic Evaluation of Dialog State Tracking and Predictive Metrics for Dialog Policy Optimization

Cited by 12 publications

References 12 publications

The Dialog State Tracking Challenge Series: A Review

The Dialog State Tracking Challenge Series: A Review

Dialog History Construction with Long-Short Term Memory for Robust Generative Dialog State Tracking

Optimizing human-interpretable dialog management policy using genetic algorithm

Contact Info

Product

Resources

About