Fast Bellman Updates for Wasserstein Distributionally Robust MDPs

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

5 Citations (Scopus)

Abstract

Markov decision processes (MDPs) often suffer from the sensitivity issue under model ambiguity. In recent years, robust MDPs have emerged as an effective framework to overcome this challenge. Distributionally robust MDPs extend the robust MDP framework by incorporating distributional information of the uncertain model parameters to alleviate the conservative nature of robust MDPs. This paper proposes a computationally efficient solution framework for solving distributionally robust MDPs with Wasserstein ambiguity sets. By exploiting the specific problem structure, the proposed framework decomposes the optimization problems associated with distributionally robust Bellman updates into smaller subproblems, which can be solved efficiently. The overall complexity of the proposed algorithm is quasi-linear in both the numbers of states and actions when the distance metric of the Wasserstein distance is chosen to be L1, L2, or L norm, and so the computational cost of distributional robustness is substantially reduced. Our numerical experiments demonstrate that the proposed algorithms outperform other state-of-the-art solution methods. © 2023 Neural information processing systems foundation. All rights reserved.
Original languageEnglish
Title of host publicationNeurIPS Proceedings
Subtitle of host publicationAdvances in Neural Information Processing Systems 36 (NeurIPS 2023)
EditorsA. Oh, T. Naumann, G. Globerson, K. Saenko, M. Hardt, S. Levine
PublisherNeural Information Processing Systems (NeurIPS)
Volume36
Publication statusPublished - Dec 2023
Event37th Conference on Neural Information Processing Systems (NeurIPS 2023) - New Orleans Ernest N. Morial Convention Center, New Orleans, United States
Duration: 10 Dec 202316 Dec 2023
https://papers.nips.cc/paper_files/paper/2023
https://nips.cc/Conferences/2023

Publication series

NameAdvances in Neural Information Processing Systems
ISSN (Print)1049-5258

Conference

Conference37th Conference on Neural Information Processing Systems (NeurIPS 2023)
Abbreviated titleNIPS '23
Country/TerritoryUnited States
CityNew Orleans
Period10/12/2316/12/23
Internet address

Funding

We thank the anonymous reviewers for their supportive comments. This work was supported, in part, by the CityU Start-Up Grant (Project No. 9610481), the National Natural Science Foundation of China (Project No. 72032005), and Chow Sang Sang Group Research Fund sponsored by Chow Sang Sang Holdings International Limited (Project No. 9229076).

Fingerprint

Dive into the research topics of 'Fast Bellman Updates for Wasserstein Distributionally Robust MDPs'. Together they form a unique fingerprint.

Cite this