2019
DOI: 10.48550/arxiv.1901.05415
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Learning from Dialogue after Deployment: Feed Yourself, Chatbot!

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
43
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
7

Relationship

0
7

Authors

Journals

citations
Cited by 25 publications
(43 citation statements)
references
References 0 publications
0
43
0
Order By: Relevance
“…We also build on the growing body of work that fine-tunes models with human feedback. This has been applied in many domains including summarization (Böhm et al, 2019;Ziegler et al, 2019;Stiennon et al, 2020), dialogue (Jaques et al, 2019;Yi et al, 2019;Hancock et al, 2019), translation (Kreutzer et al, 2018;Bahdanau et al, 2016), semantic parsing (Lawrence and Riezler, 2018), story generation (Zhou and Xu, 2020), review generation (Cho et al, 2018), and evidence extraction (Perez et al, 2019), and agents in simulated environments (Christiano et al, 2017;Ibarz et al, 2018).…”
Section: Related Workmentioning
confidence: 99%
“…We also build on the growing body of work that fine-tunes models with human feedback. This has been applied in many domains including summarization (Böhm et al, 2019;Ziegler et al, 2019;Stiennon et al, 2020), dialogue (Jaques et al, 2019;Yi et al, 2019;Hancock et al, 2019), translation (Kreutzer et al, 2018;Bahdanau et al, 2016), semantic parsing (Lawrence and Riezler, 2018), story generation (Zhou and Xu, 2020), review generation (Cho et al, 2018), and evidence extraction (Perez et al, 2019), and agents in simulated environments (Christiano et al, 2017;Ibarz et al, 2018).…”
Section: Related Workmentioning
confidence: 99%
“…For example, Liu et al (2018) collect dialogue corrections from users during deployment, while Li et al (2017) collect both binary explicit feedback and implicit natural language feedback. Also, Hancock et al (2019) propose a lifetime learning framework to improve chatbot performance. The chatbot is trained not only to generate dialogues but also to predict user satisfactions.…”
Section: Dialogue and Question Answeringmentioning
confidence: 99%
“…For example, for text classification, HITL improves classification accuracy (Smith et al, 2018;Jandot et al, 2016). Similarly, dialogue and question answering systems have higher ranking metric hits after adapting a HITL approach (Hancock et al, 2019;Brown et al, 2020). Researchers also find HITL improves model's robustness and generalization on different data (Stiennon et al, 2020;Jandot et al, 2016).…”
Section: Dialogue and Question Answeringmentioning
confidence: 99%
See 2 more Smart Citations