Privacy-Preserving Communication-Efficient Federated Multi-Armed Bandits

Tan Li*, Linqi Song*

*Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

27 Citations (Scopus)

Abstract

Communication bottleneck and data privacy are two critical concerns in federated multi-armed bandit (MAB) problems, such as situations in decision-making and recommendations of connected vehicles via wireless. In this paper, we design the privacy-preserving communication-efficient algorithm in such problems and study the interactions among privacy, communication and learning performance in terms of the regret. To be specific, we design privacy-preserving learning algorithms and communication protocols and derive the learning regret when networked private agents are performing online bandit learning in a master-worker, a decentralized and a hybrid structure. Our bandit learning algorithms are based on epoch-wise sub-optimal arm eliminations at each agent and agents exchange learning knowledge with the server/each other at the end of each epoch. Furthermore, we adopt the differential privacy (DP) approach to protect the data privacy at each agent when exchanging information; and we curtail communication costs by making less frequent communications with fewer agents participation. By analyzing the regret of our proposed algorithmic framework in the master-worker, decentralized and hybrid structures, we theoretically show trade-offs between regret and communication costs/privacy. Finally, we empirically show these trade-offs which are consistent with our theoretical analysis.
Original languageEnglish
Pages (from-to)773-787
Number of pages15
JournalIEEE Journal on Selected Areas in Communications
Volume40
Issue number3
Online published12 Jan 2022
DOIs
Publication statusPublished - Mar 2022

Funding

This work was supported in part by the Hong Kong Research Grants Council (RGC) under Grant ECS 21212419; in part by the Technological Breakthrough Project of Science, Technology and Innovation Commission of Shenzhen Municipality, under Grant JSGG20201102162000001; in part by the Guangdong Basic and Applied Basic Research Foundation under Key Project under Grant 2019B1515120032; in part by the Shenzhen Science and Technology Funding Fundamental Research Program under Project 2021Szvup126; and in part by the Hong Kong Laboratory for AI-Powered Financial Technologies.

Research Keywords

  • communication efficient learning
  • Costs
  • Data models
  • differential privacy
  • Federated learning
  • Knowledge engineering
  • multi-armed bandit
  • Privacy
  • Protocols
  • Servers
  • Wireless communication

Fingerprint

Dive into the research topics of 'Privacy-Preserving Communication-Efficient Federated Multi-Armed Bandits'. Together they form a unique fingerprint.

Cite this