Abstract
In safety-critical applications such as medical imaging and autonomous driving, where decisions have profound implications for patient health and road safety, it is imperative to maintain both high adversarial robustness to protect against potential adversarial attacks and reliable uncertainty quantification in decision-making. With extensive research focused on enhancing adversarial robustness through various forms of adversarial training (AT), a notable knowledge gap remains concerning the uncertainty inherent in adversarially trained models. To address this gap, this study investigates the uncertainty of deep learning models by examining the performance of conformal prediction (CP) in the context of standard adversarial attacks within the adversarial defense community. It is first unveiled that existing CP methods do not produce informative prediction sets under the commonly used l∞-norm bounded attack if the model is not adversarially trained, which underpins the importance of adversarial training for CP. Our paper next demonstrates that the prediction set size (PSS) of CP using adversarially trained models with AT variants is often worse than using standard AT, inspiring us to research into CP-efficient AT for improved PSS. We propose to optimize a Beta-weighting loss with an entropy minimization regularizer during AT to improve CP-efficiency, where the Beta-weighting loss is shown to be an upper bound of PSS at the population level by our theoretical analysis. Moreover, our empirical study on four image classification datasets across three popular AT baselines validates the effectiveness of the proposed Uncertainty-Reducing AT (AT-UR). Copyright 2024 by the author(s)
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 41st International Conference on Machine Learning |
| Editors | Ruslan Salakhutdinov, Zico Kolter, Katherine Heller |
| Publisher | ML Research Press |
| Pages | 30908-30928 |
| Publication status | Published - Jul 2024 |
| Event | 41st International Conference on Machine Learning (ICML 2024) - Messe Wien Exhibition Congress Center, Vienna, Austria Duration: 21 Jul 2024 → 27 Jul 2024 https://proceedings.mlr.press/v235/ https://icml.cc/ |
Publication series
| Name | Proceedings of Machine Learning Research |
|---|---|
| Volume | 235 |
| ISSN (Print) | 2640-3498 |
Conference
| Conference | 41st International Conference on Machine Learning (ICML 2024) |
|---|---|
| Place | Austria |
| City | Vienna |
| Period | 21/07/24 → 27/07/24 |
| Internet address |
Bibliographical note
Full text of this publication does not contain sufficient affiliation information. With consent from the author(s) concerned, the Research Unit(s) information for this record is based on the existing academic department affiliation of the author(s).Funding
This work was supported by a grant from the Research Grants Council of the Hong Kong Special Administrative Region, China (Project No. CityU 11215820).
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 3 Good Health and Well-being
-
SDG 11 Sustainable Cities and Communities
Fingerprint
Dive into the research topics of 'The Pitfalls and Promise of Conformal Inference Under Adversarial Attacks'. Together they form a unique fingerprint.Projects
- 1 Finished
-
GRF: Defending against Adversarial Examples in Deep Learning: New Regularization and Training Methods
CHAN, A. B. (Principal Investigator / Project Coordinator)
1/01/21 → 23/06/25
Project: Research
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver