2024
DOI: 10.1109/access.2024.3376418
|View full text |Cite
|
Sign up to set email alerts
|

Scheduled Curiosity-Deep Dyna-Q: Efficient Exploration for Dialog Policy Learning

Xuecheng Niu,
Akinori Ito,
Takashi Nose

Abstract: Training task-oriented dialog agents based on reinforcement learning is time-consuming and requires a large number of interactions with real users. How to grasp dialog policy within limited dialog experiences remains an obstacle that makes the agent training process less efficient. In addition, most previous frameworks start training by randomly choosing training samples, which differs from the human learning method and hurts the efficiency and stability of training. Therefore, we propose Scheduled Curiosity-D… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
references
References 35 publications
(45 reference statements)
0
0
0
Order By: Relevance