Coresets for Wasserstein Distributionally Robust Optimization Problems

Ruomin Huang, Jiawei Huang, Wenjie Liu, Hu Ding*

*Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

5 Citations (Scopus)

Abstract

Wasserstein distributionally robust optimization (WDRO) is a popular model to enhance the robustness of machine learning with ambiguous data. However, the complexity of WDRO can be prohibitive in practice since solving its “minimax” formulation requires a great amount of computation. Recently, several fast WDRO training algorithms for some specific machine learning tasks (e.g., logistic regression) have been developed. However, the research on designing efficient algorithms for general large-scale WDROs is still quite limited, to the best of our knowledge. Coreset is an important tool for compressing large dataset, and thus it has been widely applied to reduce the computational complexities for many optimization problems. In this paper, we introduce a unified framework to construct the ϵ-coreset for the general WDRO problems. Though it is challenging to obtain a conventional coreset for WDRO due to the uncertainty issue of ambiguous data, we show that we can compute a “dual coreset” by using the strong duality property of WDRO. Also, the error introduced by the dual coreset can be theoretically guaranteed for the original WDRO objective. To construct the dual coreset, we propose a novel grid sampling approach that is particularly suitable for the dual formulation of WDRO. Finally, we implement our coreset approach and illustrate its effectiveness for several WDRO problems in the experiments. See arXiv:2210.04260 for the full version of this paper. The code is available at https://github.com/h305142/WDRO_coreset. © 2022 Neural information processing systems foundation. All rights reserved.
Original languageEnglish
Title of host publicationThirty-Sixth Conference on Neural Information Processing Systems, NeurIPS 2022
EditorsS. Koyejo, S. Mohamed, A. Agarwal, D. Belgrave, K. Cho, A. Oh
PublisherNeural Information Processing Systems (NeurIPS)
Number of pages14
ISBN (Print)9781713871088
Publication statusPublished - Nov 2022
Event36th Conference on Neural Information Processing Systems (NeurIPS 2022) - Hybrid, New Orleans Convention Center, New Orleans, United States
Duration: 28 Nov 20229 Dec 2022
https://neurips.cc/
https://nips.cc/Conferences/2022
https://proceedings.neurips.cc/paper_files/paper/2022

Publication series

NameAdvances in Neural Information Processing Systems
Volume35
ISSN (Print)1049-5258

Conference

Conference36th Conference on Neural Information Processing Systems (NeurIPS 2022)
Abbreviated titleNIPS '22
PlaceUnited States
CityNew Orleans
Period28/11/229/12/22
Internet address

Fingerprint

Dive into the research topics of 'Coresets for Wasserstein Distributionally Robust Optimization Problems'. Together they form a unique fingerprint.

Cite this