2022
DOI: 10.48550/arxiv.2204.08125
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

FedKL: Tackling Data Heterogeneity in Federated Reinforcement Learning by Penalizing KL Divergence

Abstract: As a distributed learning paradigm, Federated Learning (FL) faces the communication bottleneck issue due to many rounds of model synchronization and aggregation. Heterogeneous data further deteriorates the situation by causing slow convergence. Although the impact of data heterogeneity on supervised FL has been widely studied, the related investigation for Federated Reinforcement Learning (FRL) is still in its infancy. In this paper, we first define the type and level of data heterogeneity for policy gradient … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 13 publications
0
2
0
Order By: Relevance
“…New insights for protecting against the non-uniformity introduced by data heterogeneity in FL as a solution for backdooring attacks were shown in [145]. FedKL [146] used federated reinforcement learning to tackle heterogeneity issues. Data problems and resource heterogeneity in FL were discussed in [147].…”
Section: ) Research On Handling Corrupted or Noisy Clients In Flmentioning
confidence: 99%
“…New insights for protecting against the non-uniformity introduced by data heterogeneity in FL as a solution for backdooring attacks were shown in [145]. FedKL [146] used federated reinforcement learning to tackle heterogeneity issues. Data problems and resource heterogeneity in FL were discussed in [147].…”
Section: ) Research On Handling Corrupted or Noisy Clients In Flmentioning
confidence: 99%
“…The authors define heterogeneity as different state transition functions among siloed clients in the federated system, while for each client the environment is homogeneous. In another more recent study by Xie & Song (2022), heterogeneity of initial state distribution and heterogeneity of environment dynamics are both taken into consideration. Our problem is also closely related to Hidden Parameter Markov Decision Processes (HiP-MDPs) (Doshi-Velez & Konidaris, 2016) which consider a set of closely related MDPs which can be fully specified with a bounded number of latent parameters.…”
Section: Related Workmentioning
confidence: 99%