2022
DOI: 10.48550/arxiv.2202.03091
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Auto-Lambda: Disentangling Dynamic Task Relationships

Abstract: Understanding the structure of multiple related tasks allows for multi-task learning to improve the generalisation ability of one or all of them. However, it usually requires training each pairwise combination of tasks together in order to capture task relationships, at an extremely high computational cost. In this work, we learn task relationships via an automated weighting framework, named Auto-λ. Unlike previous methods where task relationships are assumed to be fixed, Auto-λ is a gradient-based meta learni… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 22 publications
0
2
0
Order By: Relevance
“…To show the capability of incorporating prior knowledge for CAMRL, we perform extra experiments that formulate a new differentiable ranking loss for tasks where the relative magnitudes of the task difficulty are readily apparent in part. Specifically, we follow [36], [36], [26], [37], and [21] to obtain the public performance of existing state-of-the-arts methods for MT10, MT50, Atari, Ravens, and RLBench, respectively. Then we use the relative ranking of the public performance to formulate a new tanh-based differentiable ranking loss and incorporate it with Eq.…”
Section: Results On Gym-minigrid As Shown In Tablementioning
confidence: 99%
See 1 more Smart Citation
“…To show the capability of incorporating prior knowledge for CAMRL, we perform extra experiments that formulate a new differentiable ranking loss for tasks where the relative magnitudes of the task difficulty are readily apparent in part. Specifically, we follow [36], [36], [26], [37], and [21] to obtain the public performance of existing state-of-the-arts methods for MT10, MT50, Atari, Ravens, and RLBench, respectively. Then we use the relative ranking of the public performance to formulate a new tanh-based differentiable ranking loss and incorporate it with Eq.…”
Section: Results On Gym-minigrid As Shown In Tablementioning
confidence: 99%
“…RLBench: RLBench is a large-scale environment designed to speed up vision-guided manipulation research. We follow [21]…”
Section: Environmentsmentioning
confidence: 99%