2012
DOI: 10.1109/jstsp.2012.2229965
|View full text |Cite
|
Sign up to set email alerts
|

Predicting User Satisfaction in Spoken Dialog System Evaluation With Collaborative Filtering

Abstract: Abstract-We propose a collaborative filtering (CF) model to predict user satisfaction in SDS evaluation. Inspired by the use of CF in recommendation systems, where a user's preference for a new item is assume to resemble that for similar items rated previously, we adapt the idea to predict user evaluations of unrated dialogs based on the ratings received by similar dialogs. Ratings of dialogs are gathered by crowdsourcing through Amazon Mechanical Turk. A reference baseline is provided by a linear regression m… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
14
0
1

Year Published

2014
2014
2020
2020

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 30 publications
(16 citation statements)
references
References 26 publications
(35 reference statements)
0
14
0
1
Order By: Relevance
“…Research into these questions is growing though [30], and will continue to given the natural progression towards multi-domain SDS [31,32,33]. A lot of work has looked at methods and metrics for evaluating SDS [34,35,36]. These have generally been considered as aids to system developers to experiment with design choices and recognise those that are leading to certain measures of good performance.…”
Section: Related Workmentioning
confidence: 99%
“…Research into these questions is growing though [30], and will continue to given the natural progression towards multi-domain SDS [31,32,33]. A lot of work has looked at methods and metrics for evaluating SDS [34,35,36]. These have generally been considered as aids to system developers to experiment with design choices and recognise those that are leading to certain measures of good performance.…”
Section: Related Workmentioning
confidence: 99%
“…One method for reward estimation is off-line learning with annotated data [121]. By taking the dialog utterances and intermediate annotations as input features, reward learning can be formulated as a supervised regression or classification task.…”
Section: User Goal Estimationmentioning
confidence: 99%
“…然而当系统真正与人进行交互的时候, 任务完成的程度是很难界定的, 不仅如此, 生成模型理论 上的有效性等一系列问题使得 PARADISE 的评价效果不尽如人意 [4] . 因此基于标注语料的数据驱动 型对话评价模型成为了一个被广泛讨论的方向: 2012 年有研究者提出用协同过滤的方法来实现对用 户反馈的表示 [5] ; 利用重塑反馈函数也可以起到加速对话策略学习的目的 [6] ; Ultes 与 Minker 等人 [7] 的研究发现专家满意度对于对话系统的回复成功率有很大的影响. 所有的这些方法和尝试都表明, 优 质的训练数据对于对话系统的生成结果是至关重要的.…”
Section: 任务型对话系统评价方法unclassified