Curriculum Design for Teaching via Demonstrations: Theory and Applications

Yengera, Gaurav; Devidze, Rati; Kamalaruban, Parameswaran; Singla, Adish

doi:10.48550/arxiv.2106.04696

Cited by 1 publication

(1 citation statement)

References 17 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Reward shaping needs in-depth insight to the environment and task to construct proper extra reward functions. Curriculum learning [33][34][35][36] is a methodology to optimize the order in which experience is accumulated by the agent, in order to accelerate the training process and increase the performance. Hierarchical reinforcement learning (HRL) [37][38][39] has recently shown its advantage in sampleefficient learning on the difficult long-horizon tasks.…”

Section: Introductionmentioning

confidence: 99%

A Data-efficiency Training Framework for Deep Reinforcement Learning

Wei¹,

Han²,

Lin³

et al. 2022

Preprint

View full text Add to dashboard Cite

Sparse reward long horizon task is a major challenge for deep reinforcement learning algorithm. One of the key barriers is data-inefficiency. Even in the simulation environment, it usually takes weeks to training the agent. In this study, a data-efficiency training framework is proposed, where a curriculum learning is design for the agent in the simulation scenario. Different distributions of the initial state are set for the agent to get more informative reward during the whole training process. A fine-tuning of the parameters in the output layer of the neural network for value function is conduct to bridge the gap between sim-to-real. An experiment of UAV maneuver control is conducted in the proposed training framework to verify the method more efficient. We demonstrate that data-efficiency is different for the same data in different training stages.

show abstract