Technical note: Finite-time regret analysis of Kiefer-Wolfowitz stochastic approximation algorithm and nonparametric multi-product dynamic pricing with unknown demand

L. Jeff Hong*, Chenghuai Li, Jun Luo

*Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

6 Citations (Scopus)

Abstract

We consider the problem of nonparametric multi-product dynamic pricing with unknown demand and show that the problem may be formulated as an online model-free stochastic program, which can be solved by the classical Kiefer-Wolfowitz stochastic approximation (KWSA) algorithm. We prove that the expected cumulative regret of the KWSA algorithm is bounded above by κ1√T κwhere κ1, κ2 are positive constants and T is the number of periods for any T = 1, 2, …. Therefore, the regret of the KWSA algorithm grows in the order of √T , which achieves the lower bounds known for parametric dynamic pricing problems and shows that the nonparametric problems are not necessarily more difficult to solve than the parametric ones. Numerical experiments further demonstrate the effectiveness and efficiency of our proposed KW pricing policy by comparing with some pricing policies in the literature.

Original languageEnglish
Pages (from-to)368-379
JournalNaval Research Logistics
Volume67
Issue number5
Online published24 Apr 2020
DOIs
Publication statusPublished - Aug 2020
Externally publishedYes

Funding

National Natural Science Foundation of China, 71531010; 71722006; 71991473; The Hong Kong Research Grants Council, GRF 11504017 Funding information The authors would like to thank the Department Editor, Prof Rene Caldentey, the Associate Editor, and two reviewers for their insightful and detailed comments that have significantly improved this paper. This research of the first author was supported in part by the Hong Kong Research Grants Council (GRF 11504017) and the Natural Science Foundation of China (Grant 71991473), and this research of the third author was supported in part by the Natural Science Foundation of China (Grants 71722006 and 71531010). information National Natural Science Foundation of China, 71531010; 71722006; 71991473; The Hong Kong Research Grants Council, GRF 11504017The authors would like to thank the Department Editor, Prof Rene Caldentey, the Associate Editor, and two reviewers for their insightful and detailed comments that have significantly improved this paper. This research of the first author was supported in part by the Hong Kong Research Grants Council (GRF 11504017) and the Natural Science Foundation of China (Grant 71991473), and this research of the third author was supported in part by the Natural Science Foundation of China (Grants 71722006 and 71531010).

Research Keywords

  • dynamic pricing and learning
  • Kiefer-Wolfowitz algorithm
  • nonparametric pricing policy
  • revenue management
  • stochastic approximation

Fingerprint

Dive into the research topics of 'Technical note: Finite-time regret analysis of Kiefer-Wolfowitz stochastic approximation algorithm and nonparametric multi-product dynamic pricing with unknown demand'. Together they form a unique fingerprint.

Cite this