Abstract
This paper addresses the distributed online bandit linear regression problems with privacy protection, in which the training data are spread in a multi-agent network. Each node identifies a linear predictor to fit the training data and experiences a square loss on each round. The purpose is to minimize the regret that assesses the difference of the accumulated loss between the online linear predictor and the optimal offline linear predictor. Moreover, the differential privacy strategy is adopted to prevent the adversary from inferring the parameter vector of any node. Two efficient differentially private distributed online regression algorithms are developed in the cases of one-point and two-point bandit feedback. Our analysis suggests that the developed algorithms achieve ϵ-differential privacy and establish the regret upper bounds in O(K3/4) and O(√K) for one-point and two-point bandit feedback, respectively, where K is the time horizon. We also show that there exists a tradeoff between our algorithms’ privacy level and convergence. Finally, the performance of the proposed algorithms is validated by a numerical example. © 2023 The Franklin Institute.
| Original language | English |
|---|---|
| Pages (from-to) | 11736-11759 |
| Journal | Journal of the Franklin Institute |
| Volume | 360 |
| Issue number | 16 |
| Online published | 6 Sept 2023 |
| DOIs | |
| Publication status | Published - Nov 2023 |
RGC Funding Information
- RGC-funded
Fingerprint
Dive into the research topics of 'Distributed online bandit linear regressions with differential privacy'. Together they form a unique fingerprint.-
GRF: Distributed Mirror Descent Algorithm over Multi-agent Networks with Imperfect Communication
HO, W. C. D. (Principal Investigator / Project Coordinator)
1/01/24 → …
Project: Research
-
GRF: Distributed Optimization over Multi-agent Networks
HO, W. C. D. (Principal Investigator / Project Coordinator)
1/01/22 → 6/11/25
Project: Research
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver