2021
DOI: 10.48550/arxiv.2111.01398
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Integrating Pretrained Language Model for Dialogue Policy Learning

Abstract: Reinforcement Learning (RL) has been witnessed its potential for training a dialogue policy agent towards maximizing the accumulated rewards given from users. However, the reward can be very sparse for it is usually only provided at the end of a dialog session, which causes unaffordable interaction requirements for an acceptable dialog agent. Distinguished from many efforts dedicated to optimizing the policy and recovering the reward alternatively which suffers from easily getting stuck in local optima and mod… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 18 publications
(33 reference statements)
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?