Abstract
Both Byzantine resilience and communication efficiency have attracted tremendous attention recently for their significance in edge federated learning. However, most existing algorithms may fail when dealing with real-world irregular data that behaves in a heavy-tailed manner. To address this issue, we study the stochastic convex and non-convex optimization problem for federated learning at edge and show how to handle heavy-tailed data while retaining the Byzantine resilience, communication efficiency and the optimal statistical error rates simultaneously. Specifically, we first present a Byzantine-resilient distributed gradient descent algorithm that can handle the heavy-tailed data and meanwhile converge under the standard assumptions. To reduce the communication overhead, we further propose another algorithm that incorporates gradient compression techniques to save communication costs during the learning process. Theoretical analysis shows that our algorithms achieve order-optimal statistical error rate in presence of Byzantine devices. Finally, we conduct extensive experiments on both synthetic and real-world datasets to verify the efficacy of our algorithms. © 2023 IEEE.
| Original language | English |
|---|---|
| Pages (from-to) | 2600-2614 |
| Journal | IEEE Transactions on Computers |
| Volume | 72 |
| Issue number | 9 |
| Online published | 15 Mar 2023 |
| DOIs | |
| Publication status | Published - Sept 2023 |
Funding
This work was supported in part by the National Key Research and Development Program of China Grant 2020YFB1005900, and in part by the National Natural Science Foundation of China (NSFC) under Grant 62122042.
Research Keywords
- Byzantine resilience
- communication efficiency
- Distributed databases
- edge intelligent systems
- Error analysis
- federated learning
- Heavily-tailed distribution
- Resilience
- Servers
- Training