FedHM: Efficient Federated Learning for Heterogeneous Models via Low-rank Factorization

Yao, Dezhong; Pan, Wanning; Wan, Yao; Jin, Hai; Sun, Lichao

doi:10.48550/arxiv.2111.14655

Cited by 6 publications

(11 citation statements)

References 15 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The major difference to its use for inference is that here, the low-rank NN is updated during training, and the low-rank updates are applied to the full model on the server. Yao et al [101] present FedHM, where they create low-complexity submodels on the server by doing a low-rank factorization of the full model. Layer parameters with dimensions 𝑚 × 𝑛 are decomposed into two matrices with dimensions 𝑚 × 𝑟 and 𝑟 × 𝑛.…”

Section: Nn Architecture Heterogeneity Based On Fedavgmentioning

confidence: 99%

“…The attributes scale and granularity are often neglected, are hidden behind the technique, and lack discussion in the papers. The reported scale in the resources supported by the techniques ranges from 4× − 25× [12,41,52,61,71,77,79,80,85,101] up to 100× − 250× [25,87], yet it remains unclear whether training at such high scales is still effective. Hence, while all approaches show the effectiveness of their solution in certain scenarios, it often remains unclear whether devices with low resources or stale devices can make a meaningful contribution that advances the global model.…”

Section: Open Problems and Future Directionsmentioning

confidence: 99%

See 1 more Smart Citation

Federated Learning for Computationally Constrained Heterogeneous Devices: A Survey

et al. 2023

View full text Add to dashboard Cite

With an increasing number of smart devices like internet of things (IoT) devices deployed in the field, offloading training of neural networks (NNs) to a central server becomes more and more infeasible. Recent efforts to improve users’ privacy have led to on-device learning emerging as an alternative. However, a model trained only on a single device, using only local data, is unlikely to reach a high accuracy. Federated learning (FL) has been introduced as a solution, offering a privacy-preserving trade-off between communication overhead and model accuracy by sharing knowledge between devices but disclosing the devices’ private data. The applicability and the benefit of applying baseline FL are, however, limited in many relevant use cases due to the heterogeneity present in such environments. In this survey, we outline the heterogeneity challenges FL has to overcome to be widely applicable in real-world applications. We especially focus on the aspect of computation heterogeneity among the participating devices and provide a comprehensive overview of recent works on heterogeneity-aware FL. We discuss two groups: works that adapt the NN architecture and works that approach heterogeneity on a system level, covering Federated Averaging (FedAvg), distillation, and split learning-based approaches, as well as synchronous and asynchronous aggregation schemes.

show abstract

Section: Nn Architecture Heterogeneity Based On Fedavgmentioning

confidence: 99%

Section: Open Problems and Future Directionsmentioning

confidence: 99%

Federated Learning for Computationally Constrained Heterogeneous Devices: A Survey

et al. 2023

View full text Add to dashboard Cite

show abstract

“…Such edge devices are typically resource-constrained, e.g., the computing, communication, and memory capacities are limited. Several research efforts have been conducted to enhance the computation and communication efficiency of cross-device FL via model updates sparsification, quantization, and low-rank factorization [25,51,56,21]. Training deep neural networks requires high memory consumption [47].…”

Section: Related Workmentioning

confidence: 99%

Memory-adaptive Depth-wise Heterogenous Federated Learning

Zhang¹,

Dai²,

Wang³

et al. 2023

Preprint

View full text Add to dashboard Cite

Federated learning is a promising paradigm that allows multiple clients to collaboratively train a model without sharing the local data. However, the presence of heterogeneous devices in federated learning, such as mobile phones and IoT devices with varying memory capabilities, would limit the scale and hence the performance of the model could be trained. The mainstream approaches to address memory limitations focus on width-slimming techniques, where different clients train subnetworks with reduced widths locally and then the server aggregates the subnetworks. The global model produced from these methods suffers from performance degradation due to the negative impact of the actions taken to handle the varying subnetwork widths in the aggregation phase. In this paper, we introduce a memory-adaptive depth-wise learning solution in FL called FEDEPTH, which adaptively decomposes the full model into blocks according to the memory budgets of each client and trains blocks sequentially to obtain a full inference model. Our method outperforms state-of-the-art approaches, achieving 5% and more than 10% improvements in top-1 accuracy on CIFAR-10 and CIFAR-100, respectively. We also demonstrate the effectiveness of depth-wise fine-tuning on ViT.

show abstract

“…As such, lately, there has been a line of work focusing on this very problem, where the discrepancy between the dynamics of different clients affects the convergence rate or fairness of the deployed system. Specifically, such solutions draw from efficient ML and attempt to dynamically alter the footprint of local models my means of structured (PruneFL [37]), unstructured (Adaptive Federated Dropout [13]) or importance-based pruning (FjORD [33]), quantisation (AQFL [2]), low-rank factorisation (FedHM [76]), sparsity-inducing training (ZeroFL [63]) or distillation (GKT [31]). However, each approach has limitations, either because they involve extra training overhead [31] and residence of multiple DNN copies in memory [2], or because they require specialised hardware for performance gains ( [63,13]).…”

Section: Related Workmentioning

confidence: 99%

“…However, each approach has limitations, either because they involve extra training overhead [31] and residence of multiple DNN copies in memory [2], or because they require specialised hardware for performance gains ( [63,13]). Last, some of the architectural changes proposed may not offer the degrees of freedom that NAS exposes [33,76].…”

Section: Related Workmentioning

confidence: 99%

FedorAS: Federated Architecture Search under system heterogeneity

Dudziak¹,

Laskaridis²,

Fernández-Marqués³

2022

Preprint

View full text Add to dashboard Cite

Federated learning (FL) has recently gained considerable attention due to its ability to use decentralised data while preserving privacy. However, it also poses additional challenges related to the heterogeneity of the participating devices, both in terms of their computational capabilities and contributed data. Meanwhile, Neural Architecture Search (NAS) has been successfully used with centralised datasets, producing state-of-the-art results in constrained (hardware-aware) and unconstrained settings. However, even the most recent work laying at the intersection of NAS and FL assumes homogeneous compute environment with datacenter-grade hardware and does not address the issues of working with constrained, heterogeneous devices. As a result, practical usage of NAS in a federated setting remains an open problem that we address in our work. We design our system, FedorAS, to discover and train promising architectures when dealing with devices of varying capabilities holding non-IID distributed data, and present empirical evidence of its effectiveness across different settings. Specifically, we evaluate FedorAS across datasets spanning three different modalities (vision, speech, text) and show its better performance compared to state-of-the-art federated solutions, while maintaining resource efficiency. * Indicates equal contribution.Preprint. Under review.

show abstract

FedHM: Efficient Federated Learning for Heterogeneous Models via Low-rank Factorization

Cited by 6 publications

References 15 publications

Federated Learning for Computationally Constrained Heterogeneous Devices: A Survey

Federated Learning for Computationally Constrained Heterogeneous Devices: A Survey

Memory-adaptive Depth-wise Heterogenous Federated Learning

FedorAS: Federated Architecture Search under system heterogeneity

Contact Info

Product

Resources

About