CoCo-FL: Communication- and Computation-Aware Federated Learning via Partial NN Freezing and Quantization

Pfeiffer, Kilian; Rapp, Martin; Khalili, Ramin; Henkel, Jörg

doi:10.48550/arxiv.2203.05468

Cited by 1 publication

(3 citation statements)

References 15 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In ZeroFL [79], dropout masks in combination with sparse convolutions are used to lower the computational complexity in training (FLOPS) and reduce the communication volume, although special hardware and software support is required to enable real-world gains. Lastly, in CoCoFL [77], a technique is presented that does not use subsets of an NN for training. Instead, only for some layers per round gradients get calculated while the remainder of the layers are frozen.…”

Section: Othersmentioning

confidence: 99%

“…The attributes scale and granularity are often neglected, are hidden behind the technique, and lack discussion in the papers. The reported scale in the resources supported by the techniques ranges from 4× − 25× [12,41,52,61,71,77,79,80,85,101] up to 100× − 250× [25,87], yet it remains unclear whether training at such high scales is still effective. Hence, while all approaches show the effectiveness of their solution in certain scenarios, it often remains unclear whether devices with low resources or stale devices can make a meaningful contribution that advances the global model.…”

Section: Open Problems and Future Directionsmentioning

confidence: 99%

“…the resource model. This is especially the case in techniques using subsets, where some model resources in terms of power usage [104], while others count the number of parameters [2,25,41,79], the number of multiplyaccumulate-operations [41,80], or the required training time [77]. Therefore, the supported scale of heterogeneity by the techniques is not comparable.…”

Section: Open Problems and Future Directionsmentioning

confidence: 99%

See 2 more Smart Citations

Federated Learning for Computationally Constrained Heterogeneous Devices: A Survey

et al. 2023

View full text Add to dashboard Cite

With an increasing number of smart devices like internet of things (IoT) devices deployed in the field, offloading training of neural networks (NNs) to a central server becomes more and more infeasible. Recent efforts to improve users’ privacy have led to on-device learning emerging as an alternative. However, a model trained only on a single device, using only local data, is unlikely to reach a high accuracy. Federated learning (FL) has been introduced as a solution, offering a privacy-preserving trade-off between communication overhead and model accuracy by sharing knowledge between devices but disclosing the devices’ private data. The applicability and the benefit of applying baseline FL are, however, limited in many relevant use cases due to the heterogeneity present in such environments. In this survey, we outline the heterogeneity challenges FL has to overcome to be widely applicable in real-world applications. We especially focus on the aspect of computation heterogeneity among the participating devices and provide a comprehensive overview of recent works on heterogeneity-aware FL. We discuss two groups: works that adapt the NN architecture and works that approach heterogeneity on a system level, covering Federated Averaging (FedAvg), distillation, and split learning-based approaches, as well as synchronous and asynchronous aggregation schemes.

show abstract

Section: Othersmentioning

confidence: 99%