Contextual Combinatorial Bandits with Probabilistically Triggered Arms

Xutong Liu*, Jinhang Zuo, Siwei Wang, John C.S. Lui*, Mohammad Hajiesmaili, Adam Wierman, Wei Chen*

*Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

1 Citation (Scopus)

Abstract

We study contextual combinatorial bandits with probabilistically triggered arms (C2MAB-T) under a variety of smoothness conditions that capture a wide range of applications, such as contextual cascading bandits and contextual influence maximization bandits. Under the triggering probability modulated (TPM) condition, we devise the C2-UCB-T algorithm and propose a novel analysis that achieves an Õ(d√KT) regret bound, removing a potentially exponentially large factor O(1/pmin), where d is the dimension of contexts, pmin is the minimum positive probability that any arm can be triggered, and batch-size K is the maximum number of arms that can be triggered per round. Under the variance modulated (VM) or triggering probability and variance modulated (TPVM) conditions, we propose a new variance-adaptive algorithm VAC2-UCB and derive a regret bound Õ(d√T), which is independent of the batch-size K. As a valuable by-product, our analysis technique and variance-adaptive algorithm can be applied to the CMAB-T and C2MAB setting, improving existing results there as well. We also include experiments that demonstrate the improved performance of our algorithms compared with benchmark algorithms on synthetic and real-world datasets.

© 2023 by the author(s)
Original languageEnglish
Title of host publicationProceedings of the 40th International Conference on Machine Learning
PublisherML Research Press
Publication statusPublished - Jul 2023
Externally publishedYes
Event40th International Conference on Machine Learning (ICML 2023) - Hawaii Convention Center, Honolulu, United States
Duration: 23 Jul 202329 Jul 2023
https://icml.cc/

Publication series

NameProceedings of Machine Learning Research
Volume202
ISSN (Print)2640-3498

Conference

Conference40th International Conference on Machine Learning (ICML 2023)
Abbreviated titleICML'23
PlaceUnited States
CityHonolulu
Period23/07/2329/07/23
Internet address

Funding

The work of John C.S. Lui was supported in part by RGC’s GRF 14215722. The work of Mohammad Hajiesmaili is supported by NSF CAREER-2045641, CPS-2136199, CNS2106299, and CNS-2102963. Wierman is supported by NSF grants CNS-2146814, CPS-2136197, CNS-2106403, and NGSDI-2105648.

Fingerprint

Dive into the research topics of 'Contextual Combinatorial Bandits with Probabilistically Triggered Arms'. Together they form a unique fingerprint.

Cite this