Optimal Allocation Strategies in a Discrete-Time Exponential Bandit Problem

Research output: Conference PapersRGC 32 - Refereed conference paper (without host publication)peer-review

View graph of relations

Author(s)

Related Research Unit(s)

Detail(s)

Original languageEnglish
Number of pages28
Publication statusPublished - Aug 2024

Conference

Title76th Econometric Society European Summer Meeting (ESEM 2024)
LocationERASMUS University
PlaceNetherlands
CityRotterdam
Period26 - 30 August 2024

Abstract

This study addresses a theoretic-bandit problem involving a "safe" and a "risky" arm across countable periods. Departing from the "either-or" binary choices in previous studies, we explore smooth allocation strategies using the first-order approach. Modelling both the action and the posterior as state variables, we obtain clear characterizations of the optimal allocation strategies and comparative statics. The optimal plan significantly enhances the binary strategies, yielding a higher probability of breakthrough and a higher expected payoff. The Goldilocks principle emerges in that the incentives for exploring the risky arm peak at a level that is neither too difficult nor too easy.

Research Area(s)

  • two-armed bandit, first-order approach, discrete time, exponential distribution, Goldilocks principle

Citation Format(s)

Optimal Allocation Strategies in a Discrete-Time Exponential Bandit Problem. / Hu, Audrey; Zou, Liang.
2024. Paper presented at 76th Econometric Society European Summer Meeting (ESEM 2024)
, Rotterdam, Netherlands.

Research output: Conference PapersRGC 32 - Refereed conference paper (without host publication)peer-review