Abstract
Wasserstein distributionally robust optimization (WDRO) is a popular model to enhance the robustness of machine learning with ambiguous data. However, the complexity of WDRO can be prohibitive in practice since solving its “minimax” formulation requires a great amount of computation. Recently, several fast WDRO training algorithms for some specific machine learning tasks (e.g., logistic regression) have been developed. However, the research on designing efficient algorithms for general large-scale WDROs is still quite limited, to the best of our knowledge. Coreset is an important tool for compressing large dataset, and thus it has been widely applied to reduce the computational complexities for many optimization problems. In this paper, we introduce a unified framework to construct the ϵ-coreset for the general WDRO problems. Though it is challenging to obtain a conventional coreset for WDRO due to the uncertainty issue of ambiguous data, we show that we can compute a “dual coreset” by using the strong duality property of WDRO. Also, the error introduced by the dual coreset can be theoretically guaranteed for the original WDRO objective. To construct the dual coreset, we propose a novel grid sampling approach that is particularly suitable for the dual formulation of WDRO. Finally, we implement our coreset approach and illustrate its effectiveness for several WDRO problems in the experiments. See arXiv:2210.04260 for the full version of this paper. The code is available at https://github.com/h305142/WDRO_coreset. © 2022 Neural information processing systems foundation. All rights reserved.
| Original language | English |
|---|---|
| Title of host publication | Thirty-Sixth Conference on Neural Information Processing Systems, NeurIPS 2022 |
| Editors | S. Koyejo, S. Mohamed, A. Agarwal, D. Belgrave, K. Cho, A. Oh |
| Publisher | Neural Information Processing Systems (NeurIPS) |
| Number of pages | 14 |
| ISBN (Print) | 9781713871088 |
| Publication status | Published - Nov 2022 |
| Event | 36th Conference on Neural Information Processing Systems (NeurIPS 2022) - Hybrid, New Orleans Convention Center, New Orleans, United States Duration: 28 Nov 2022 → 9 Dec 2022 https://neurips.cc/ https://nips.cc/Conferences/2022 https://proceedings.neurips.cc/paper_files/paper/2022 |
Publication series
| Name | Advances in Neural Information Processing Systems |
|---|---|
| Volume | 35 |
| ISSN (Print) | 1049-5258 |
Conference
| Conference | 36th Conference on Neural Information Processing Systems (NeurIPS 2022) |
|---|---|
| Abbreviated title | NIPS '22 |
| Place | United States |
| City | New Orleans |
| Period | 28/11/22 → 9/12/22 |
| Internet address |