Learning Dialog Policies from Weak Demonstrations

Gordon-Hall, Gabriel; Gorinski, Philip John; Cohen, Shay B.

doi:10.18653/v1/2020.acl-main.129

Cited by 13 publications

(13 citation statements)

References 23 publications

(13 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…They train the proposed SDS using a network of DQN agents, which is similar to hierarchical DRL but with more flexibility for transitioning across dialogues domains. Another work-related to faster training is proposed by Gordon-Hall et al (2020), where the behaviour of RL agents is guided by expert demonstrations.…”

Section: Spoken Dialogue Systems (Sdss)mentioning

confidence: 99%

A survey on deep reinforcement learning for audio-based applications

Latif

Cuayáhuitl

Pervez³

et al. 2022

Artif Intell Rev

View full text Add to dashboard Cite

Deep reinforcement learning (DRL) is poised to revolutionise the field of artificial intelligence (AI) by endowing autonomous systems with high levels of understanding of the real world. Currently, deep learning (DL) is enabling DRL to effectively solve various intractable problems in various fields including computer vision, natural language processing, healthcare, robotics, to name a few. Most importantly, DRL algorithms are also being employed in audio signal processing to learn directly from speech, music and other sound signals in order to create audio-based autonomous systems that have many promising applications in the real world. In this article, we conduct a comprehensive survey on the progress of DRL in the audio domain by bringing together research studies across different but related areas in speech and music. We begin with an introduction to the general field of DL and reinforcement learning (RL), then progress to the main DRL methods and their applications in the audio domain. We conclude by presenting important challenges faced by audio-based DRL agents and by highlighting open areas for future research and investigation. The findings of this paper will guide researchers interested in DRL for the audio domain.

show abstract

Section: Spoken Dialogue Systems (Sdss)mentioning

confidence: 99%

A survey on deep reinforcement learning for audio-based applications

Latif

Cuayáhuitl

Pervez³

et al. 2022

Artif Intell Rev

View full text Add to dashboard Cite

show abstract

“…[Nishimoto and Reali Costa 2019] extended the first work by showing that a good balance in exploration and exploitation during training can significantly improve the performance. Some other recent works also used the classical DQN algorithm to train the policy [Gordon-Hall et al 2020, Wang et al 2020, showing that despite simple, this algorithm can provide good results [Mo et al 2018] and [Weisz et al 2018] tried out other RL algorithms to model the DM, such as SARSA and actor-critic, respectively. Finally, [Saha et al 2020] proposed a hierarchical deep reinforcement learning approach to deal with more complex dialogue systems and [Takanobu et al 2019] proposed a method to learn the reward and optimize the policy jointly.…”

Section: Related Workmentioning

confidence: 99%

“…However, it is more complicated and needs a lot of labeled data collected from experts. [Gordon-Hall et al 2020] proposed the Deep Q-learning from Demonstrations (DQfD), which uses expert demonstrators in a weakly supervised fashion.…”

Section: Related Workmentioning

confidence: 99%

“…In order to compare the results to a baseline, we adopt the same parameters' values from [Nishimoto and Reali Costa 2019] work. Although its a simple algorithm, there are still many works that use DQN (or some variant) to train the DM [Li et al 2017, Nishimoto and Reali Costa 2019, Gordon-Hall et al 2020, Wang et al 2020. For this reason we used the DQN algorithm and focused in improving the warm-up phase.…”

Section: Reinforcement Learning In Dialogue Systemsmentioning

confidence: 99%

“…End-to-End User Many works focus in the pipeline architecture, specially in the policy component of the DM module. In brief, they employ rule-based DM, supervised learning [Vlasov et al 2019, Hosseini-Asl et al 2020; and reinforcement learning [Saha et al 2020, Wang et al 2020, Gordon-Hall et al 2020.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Enhancing Designer Knowledge to Dialogue Management: A Comparison between Supervised and Reinforcement Learning Approaches

Nishimoto¹,

Cristo²,

Mansano³

et al. 2022

Anais Do XIX Encontro Nacional De Inteligência Artificial E Computacional (ENIAC 2022)

View full text Add to dashboard Cite

Task-oriented dialogue systems are complex natural language applications employed in various fields such as health care, sales assistance, and digital customer servicing. Although the literature suggests several approaches to managing this type of dialogue system, only a few of them compares the performance of different techniques. From this perspective, in this paper we present a comparison between supervised learning, using the transformer architecture, and reinforcement learning using two flavors of Deep Q-Learning (DQN) algorithms. Our experiments use the MultiWOZ dataset and a real-world digital customer service dataset, from which we show that integrating expert pre-defined rules with DQN allows outperforming supervised approaches. Additionally, we also propose a method to make better usage of the designer knowledge by improving how interactions collected in warm-up are used in training phase. Our results indicate a reduction in training time by preserving the designer’s knowledge, expressed as pre-defined rules in memory during the initial steps of the DQN training procedure.

show abstract

Slot Sharing Mechanism in Multi-domain Dialogue Systems

Nishimoto

Costa

2021

Intelligent Systems

View full text Add to dashboard Cite

Learning Dialog Policies from Weak Demonstrations

Cited by 13 publications

References 23 publications

A survey on deep reinforcement learning for audio-based applications

A survey on deep reinforcement learning for audio-based applications

Enhancing Designer Knowledge to Dialogue Management: A Comparison between Supervised and Reinforcement Learning Approaches

Slot Sharing Mechanism in Multi-domain Dialogue Systems

Contact Info

Product

Resources

About