Auto-Lambda: Disentangling Dynamic Task Relationships

Liu, Shikun; James, S. Jill; Davison, Andrew J.; Johns, Edward J.

doi:10.48550/arxiv.2202.03091

Cited by 1 publication

(2 citation statements)

References 22 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…To show the capability of incorporating prior knowledge for CAMRL, we perform extra experiments that formulate a new differentiable ranking loss for tasks where the relative magnitudes of the task difficulty are readily apparent in part. Specifically, we follow [36], [36], [26], [37], and [21] to obtain the public performance of existing state-of-the-arts methods for MT10, MT50, Atari, Ravens, and RLBench, respectively. Then we use the relative ranking of the public performance to formulate a new tanh-based differentiable ranking loss and incorporate it with Eq.…”

Section: Results On Gym-minigrid As Shown In Tablementioning

confidence: 99%

See 1 more Smart Citation

Curriculum-Based Asymmetric Multi-Task Reinforcement Learning

Huang

Shen

et al. 2023

IEEE Trans. Pattern Anal. Mach. Intell.

View full text Add to dashboard Cite

We introduce CAMRL, the first curriculum-based asymmetric multi-task learning (AMTL) algorithm for dealing with multiple reinforcement learning (RL) tasks altogether. To mitigate the negative influence of customizing the one-off training order in curriculum-based AMTL, CAMRL switches its training mode between parallel single-task RL and asymmetric multi-task RL (MTRL), according to an indicator regarding the training time, the overall performance, and the performance gap among tasks. To leverage the multi-sourced prior knowledge flexibly and to reduce negative transfer in AMTL, we customize a composite loss with multiple differentiable ranking functions and optimize the loss through alternating optimization and the Frank-Wolfe algorithm. The uncertainty-based automatic adjustment of hyper-parameters is also applied to eliminate the need of laborious hyper-parameter analysis during optimization. By optimizing the composite loss, CAMRL predicts the next training task and continuously revisits the transfer matrix and network weights. We have conducted experiments on a wide range of benchmarks in multi-task RL, covering Gym-minigrid, Meta-world, Atari video games, vision-based PyBullet tasks, and RLBench, to show the improvements of CAMRL over the corresponding single-task RL algorithm and state-of-the-art MTRL algorithms. The code is available at: https://github.com/huanghanchi/CAMRL.

show abstract

Section: Results On Gym-minigrid As Shown In Tablementioning

confidence: 99%

“…RLBench: RLBench is a large-scale environment designed to speed up vision-guided manipulation research. We follow [21]…”

Section: Environmentsmentioning

confidence: 99%