Abstract
We study contextual combinatorial bandits with probabilistically triggered arms (C2MAB-T) under a variety of smoothness conditions that capture a wide range of applications, such as contextual cascading bandits and contextual influence maximization bandits. Under the triggering probability modulated (TPM) condition, we devise the C2-UCB-T algorithm and propose a novel analysis that achieves an Õ(d√KT) regret bound, removing a potentially exponentially large factor O(1/pmin), where d is the dimension of contexts, pmin is the minimum positive probability that any arm can be triggered, and batch-size K is the maximum number of arms that can be triggered per round. Under the variance modulated (VM) or triggering probability and variance modulated (TPVM) conditions, we propose a new variance-adaptive algorithm VAC2-UCB and derive a regret bound Õ(d√T), which is independent of the batch-size K. As a valuable by-product, our analysis technique and variance-adaptive algorithm can be applied to the CMAB-T and C2MAB setting, improving existing results there as well. We also include experiments that demonstrate the improved performance of our algorithms compared with benchmark algorithms on synthetic and real-world datasets.
© 2023 by the author(s)
© 2023 by the author(s)
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 40th International Conference on Machine Learning |
| Publisher | ML Research Press |
| Publication status | Published - Jul 2023 |
| Externally published | Yes |
| Event | 40th International Conference on Machine Learning (ICML 2023) - Hawaii Convention Center, Honolulu, United States Duration: 23 Jul 2023 → 29 Jul 2023 https://icml.cc/ |
Publication series
| Name | Proceedings of Machine Learning Research |
|---|---|
| Volume | 202 |
| ISSN (Print) | 2640-3498 |
Conference
| Conference | 40th International Conference on Machine Learning (ICML 2023) |
|---|---|
| Abbreviated title | ICML'23 |
| Place | United States |
| City | Honolulu |
| Period | 23/07/23 → 29/07/23 |
| Internet address |
Funding
The work of John C.S. Lui was supported in part by RGC’s GRF 14215722. The work of Mohammad Hajiesmaili is supported by NSF CAREER-2045641, CPS-2136199, CNS2106299, and CNS-2102963. Wierman is supported by NSF grants CNS-2146814, CPS-2136197, CNS-2106403, and NGSDI-2105648.