Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conferen 2019
DOI: 10.18653/v1/d19-3014
|View full text |Cite
|
Sign up to set email alerts
|

Gunrock: A Social Bot for Complex and Engaging Long Conversations

Abstract: Gunrock is the winner of the 2018 Amazon Alexa Prize, as evaluated by coherence and engagement from both real users and Amazonselected expert conversationalists. We focus on understanding complex sentences and having in-depth conversations in open domains. In this paper, we introduce some innovative system designs and related validation analysis. Overall, we found that users produce longer sentences to Gunrock, which are directly related to users' engagement (e.g., ratings, number of turns). Additionally, user… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
15
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
1
1

Relationship

1
6

Authors

Journals

citations
Cited by 15 publications
(17 citation statements)
references
References 9 publications
0
15
0
Order By: Relevance
“…Different from previous empirical observations, we conduct a large-scale quantitative and qualitative data analysis of Likert score based ratings. To address the issue of Likert scores, the Alexa team proposed a rule-based ensemble of turn-granularity expert ratings (Yi et al, 2019), and automatic metrics like topical diversity and conversational breadth. ACUTE-EVAL ) makes a small-scale attempt to use multi-turn pair-wise comparison to rank different chatbots.…”
Section: Related Workmentioning
confidence: 99%
“…Different from previous empirical observations, we conduct a large-scale quantitative and qualitative data analysis of Likert score based ratings. To address the issue of Likert scores, the Alexa team proposed a rule-based ensemble of turn-granularity expert ratings (Yi et al, 2019), and automatic metrics like topical diversity and conversational breadth. ACUTE-EVAL ) makes a small-scale attempt to use multi-turn pair-wise comparison to rank different chatbots.…”
Section: Related Workmentioning
confidence: 99%
“…Spoken Language Understanding (SLU) 1 is at the front-end of many modern intelligent home devices, virtual assistants, and socialbots [1,2]: given a spoken command, an SLU engine should extract relevant semantics 2 from spoken commands for the appropriate downstream tasks. Since SLU tasks such as the Airline Travel Information System (ATIS) [4], the field has progressed from knowledge-based [5] to data-driven approaches, notably those based on neural networks.…”
Section: Introductionmentioning
confidence: 99%
“…The fourth author contributed to the work before joining Amazon. 1 SLU typically consists of Automatic Speech Recognition (ASR) and Natural Language Understanding (NLU). ASR maps audio to text, and NLU maps text to semantics.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Building on the idea of attention-based seq2seq models (Vaswani et al, 2017), recent language models such as BERT (Devlin et al, 2019) and GPT-2 (Radford et al, 2019) enable neural conversational models to generate responses that appear human-like and engaging (Yu et al, 2019). A closer look, however, reveals that the lack of long-term memory to represent consistent (world) knowledge and personality over multiple speaker turns can lead to incoherent content being generated (Li et al, 2016;Serban et al, 2017).…”
Section: Introductionmentioning
confidence: 99%