Skip to main navigation Skip to search Skip to main content

Stochastic Bandits Robust to Adversarial Attacks

Xuchuang Wang, Maoli Liu, Jinhang Zuo, Xutong Liu*, John C.S. Lui, Mohammad Hajiesmaili

*Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

Abstract

This paper investigates stochastic bandit algorithms that are robust to adversarial attacks, where an attacker can first observe the learner's action and then alter their reward observation. We study two cases of this model, with or without the knowledge of an attack budget C, defined as an upper bound of the summation of the difference between the actual and altered rewards. For both cases, we devise two types of algorithms with regret bounds having additive or multiplicative C dependence terms. For the known attack budget case, we prove our algorithms achieve the regret bound of O((K/∆) log T + KC) and Õ(√KTC) for the additive and multiplicative C terms, respectively, where K is the number of arms, T is the time horizon, ∆ is the gap between the expected rewards of the optimal arm and the second-best arm, and Õ hides the logarithmic factors. For the unknown case, we prove our algorithms achieve the regret bound of Õ(√KT + KC2) and Õ(KC√T) for the additive and multiplicative C terms, respectively. In addition to these upper bound results, we provide several lower bounds showing the tightness of our bounds and the optimality of our algorithms. These results delineate an intrinsic separation between the bandits with attacks and corruption models.
Original languageEnglish
Title of host publication13th International Conference on Learning Representations, ICLR 2025
EditorsY. Yue, A. Garg, N. Peng, F. Sha, R. Yu
PublisherInternational Conference on Learning Representations, ICLR
Pages89797-89816
Number of pages20
ISBN (Electronic)9798331320850
Publication statusPublished - Jun 2025
Event13th International Conference on Learning Representations (ICLR 2025) - Singapore EXPO, Singapore, Singapore
Duration: 24 Apr 202528 Apr 2025
https://iclr.cc/Conferences/2025

Publication series

NameICLR Proceedings

Conference

Conference13th International Conference on Learning Representations (ICLR 2025)
Abbreviated titleICLR 2025
PlaceSingapore
CitySingapore
Period24/04/2528/04/25
Internet address

Bibliographical note

Research Unit(s) information for this publication is provided by the author(s) concerned.

Funding

The authors want to thank Thodoris Lykouris for suggesting this topic in an AIM’s SQuaREs meeting and for his invaluable feedback and insightful discussions throughout the research. The work of Jinhang Zuo was supported by CityU 9610706. The work of John C.S. Lui was supported in part by the RGC GRF14202923. The work of Mohammad Hajiesmaili was supported by NSF CPS2136199, CAREER-2045641, and CNS-2325956. The work of Xutong Liu was supported in part by a fellowship award from the Research Grants Council of the Hong Kong Special Administrative Region, China (CUHK PDFS2324-4S04). Xutong Liu is the corresponding author.

RGC Funding Information

  • RGC-funded

Fingerprint

Dive into the research topics of 'Stochastic Bandits Robust to Adversarial Attacks'. Together they form a unique fingerprint.

Cite this