2018
DOI: 10.1007/978-3-319-94042-7_3
|View full text |Cite
|
Sign up to set email alerts
|

ConvAI Dataset of Topic-Oriented Human-to-Chatbot Dialogues

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 14 publications
(11 citation statements)
references
References 2 publications
0
11
0
Order By: Relevance
“…The dataset comprises 8,650 unique and unconstrained conversations conducted between June and October 2021 with English-speaking users in the US. With a total of 346,554 turns and an average of 44 turns per conversation, the dataset is almost twice the size of the existing human chat corpus Con-vAI (Logacheva et al, 2018). Further, with a ratio of 1.1 conversations per user, the corpus significantly exceeds the number of unique users, compared to similar previous studies (Völkel et al, 2021;Porcheron et al, 2018;Völkel et al, 2020).…”
Section: Datasetmentioning
confidence: 95%
“…The dataset comprises 8,650 unique and unconstrained conversations conducted between June and October 2021 with English-speaking users in the US. With a total of 346,554 turns and an average of 44 turns per conversation, the dataset is almost twice the size of the existing human chat corpus Con-vAI (Logacheva et al, 2018). Further, with a ratio of 1.1 conversations per user, the corpus significantly exceeds the number of unique users, compared to similar previous studies (Völkel et al, 2021;Porcheron et al, 2018;Völkel et al, 2020).…”
Section: Datasetmentioning
confidence: 95%
“…Ideally, chatbots would be interactively evaluated, but due to the high cost, next utterance simulation is used as a surrogate. Although next utterance generation is a more artificial task, Logacheva et al (2018) observed a Pearson correlation of 0.6 between conversation-level and utterance-level ratings.…”
Section: Chatbot Evaluationmentioning
confidence: 99%
“…For simplicity, we refer to the conversation history of all chatbots nurtured on the LightBlue platform as the LightBlue Corpus. As shown in [65] 0.051 0.012 0.233 Twitter [66] 0.038 0.028 0.734 Cornell movie dialogues [67] 0.019 0.017 0.896…”
Section: Social Bondmentioning
confidence: 99%