2022
DOI: 10.48550/arxiv.2208.03270
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Learning New Skills after Deployment: Improving open-domain internet-driven dialogue with human feedback

Abstract: Frozen models trained to mimic static datasets can never improve their performance. Models that can employ internet-retrieval for up-todate information and obtain feedback from humans during deployment provide the promise of both adapting to new information, and improving their performance. In this work we study how to improve internet-driven conversational skills in such a learning framework. We collect deployment data, which we make publicly available, of human interactions, and collect various types of huma… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
16
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
3

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(16 citation statements)
references
References 17 publications
(21 reference statements)
0
16
0
Order By: Relevance
“…In a companion paper (Xu et al, 2022b) a study is conducted of how to improve dialogue models that employ internet-retrieval through the use of human feedback. Obtaining feedback from humans during deployment provides the promise of both improved input distributions that match user's requirements, and corrections to model predictions for those inputs.…”
Section: What's the Best Methods To Learn From Feedback?mentioning
confidence: 99%
See 4 more Smart Citations
“…In a companion paper (Xu et al, 2022b) a study is conducted of how to improve dialogue models that employ internet-retrieval through the use of human feedback. Obtaining feedback from humans during deployment provides the promise of both improved input distributions that match user's requirements, and corrections to model predictions for those inputs.…”
Section: What's the Best Methods To Learn From Feedback?mentioning
confidence: 99%
“…Generate internet search query We use the WizInt dataset which contains human-authored search queries during crowdsourced dialogue turns to directly train the internet search query generation module in a supervised fashion. We also use the newly collected Feedback on Interactive Talk & Search (FITS) dataset 2 (Xu et al, 2022b) of internet-augmented conversational tasks in a similar manner.…”
Section: Fine-tuningmentioning
confidence: 99%
See 3 more Smart Citations