Projects per year
Abstract
Markov decision processes (MDPs) often suffer from the sensitivity issue under model ambiguity. In recent years, robust MDPs have emerged as an effective framework to overcome this challenge. Distributionally robust MDPs extend the robust MDP framework by incorporating distributional information of the uncertain model parameters to alleviate the conservative nature of robust MDPs. This paper proposes a computationally efficient solution framework for solving distributionally robust MDPs with Wasserstein ambiguity sets. By exploiting the specific problem structure, the proposed framework decomposes the optimization problems associated with distributionally robust Bellman updates into smaller subproblems, which can be solved efficiently. The overall complexity of the proposed algorithm is quasi-linear in both the numbers of states and actions when the distance metric of the Wasserstein distance is chosen to be L1, L2, or L∞ norm, and so the computational cost of distributional robustness is substantially reduced. Our numerical experiments demonstrate that the proposed algorithms outperform other state-of-the-art solution methods. © 2023 Neural information processing systems foundation. All rights reserved.
Original language | English |
---|---|
Title of host publication | NeurIPS Proceedings |
Subtitle of host publication | Advances in Neural Information Processing Systems 36 (NeurIPS 2023) |
Editors | A. Oh, T. Naumann, G. Globerson, K. Saenko, M. Hardt, S. Levine |
Publisher | Neural Information Processing Systems (NeurIPS) |
Volume | 36 |
Publication status | Published - Dec 2023 |
Event | 37th Conference on Neural Information Processing Systems (NeurIPS 2023) - New Orleans Ernest N. Morial Convention Center, New Orleans, United States Duration: 10 Dec 2023 → 16 Dec 2023 https://papers.nips.cc/paper_files/paper/2023 https://nips.cc/Conferences/2023 |
Publication series
Name | Advances in Neural Information Processing Systems |
---|---|
ISSN (Print) | 1049-5258 |
Conference
Conference | 37th Conference on Neural Information Processing Systems (NeurIPS 2023) |
---|---|
Abbreviated title | NIPS '23 |
Country/Territory | United States |
City | New Orleans |
Period | 10/12/23 → 16/12/23 |
Internet address |
Funding
We thank the anonymous reviewers for their supportive comments. This work was supported, in part, by the CityU Start-Up Grant (Project No. 9610481), the National Natural Science Foundation of China (Project No. 72032005), and Chow Sang Sang Group Research Fund sponsored by Chow Sang Sang Holdings International Limited (Project No. 9229076).
Fingerprint
Dive into the research topics of 'Fast Bellman Updates for Wasserstein Distributionally Robust MDPs'. Together they form a unique fingerprint.Projects
- 1 Active
-
DON_RMG: Artificial Intelligence with Imperfect Models and Data - RMGS
HO, C. P. (Principal Investigator / Project Coordinator)
1/07/21 → …
Project: Research