On the design of black-box adversarial examples by leveraging gradient-free optimization and operator splitting method

Pu Zhao, Sijia Liu, Pin-Yu Chen, Trong Nghia Hoang, Kaidi Xu, Bhavya Kailkhura, Xue Lin

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

Abstract

Robust machine learning is currently one of the most prominent topics which could potentially help shaping a future of advanced AI platforms that not only perform well in average cases but also in worst cases or adverse situations. Despite the long-term vision, however, existing studies on black-box adversarial attacks are still restricted to very specific settings of threat models (e.g., single distortion metric and restrictive assumption on target model's feedback to queries) and/or suffer from prohibitively high query complexity. To push for further advances in this field, we introduce a general framework based on an operator splitting method, the alternating direction method of multipliers (ADMM) to devise efficient, robust black-box attacks that work with various distortion metrics and feedback settings without incurring high query complexity. Due to the black-box nature of the threat model, the proposed ADMM solution framework is integrated with zeroth-order (ZO) optimization and Bayesian optimization (BO), and thus is applicable to the gradient-free regime. This results in two new black-box adversarial attack generation methods, ZO-ADMM and BO-ADMM. Our empirical evaluations on image classification datasets show that our proposed approaches have much lower function query complexities compared to state-of-the-art attack methods, but achieve very competitive attack success rates. © 2019 IEEE.
Original languageEnglish
Title of host publicationProceedings - 2019 International Conference on Computer Vision
PublisherIEEE
Pages121-130
ISBN (Electronic)9781728148038
ISBN (Print)978-1-7281-4804-5
DOIs
Publication statusPublished - Oct 2019
Externally publishedYes
Event17th IEEE/CVF International Conference on Computer Vision, ICCV 2019 - Seoul, Korea, Republic of
Duration: 27 Oct 20192 Nov 2019
https://iccv2019.thecvf.com/

Publication series

NameProceedings of the IEEE International Conference on Computer Vision
ISSN (Print)1550-5499
ISSN (Electronic)2380-7504

Conference

Conference17th IEEE/CVF International Conference on Computer Vision, ICCV 2019
PlaceKorea, Republic of
CitySeoul
Period27/10/192/11/19
Internet address

Funding

This work is partly supported by the National Science Foundation CNS-1932351.

Fingerprint

Dive into the research topics of 'On the design of black-box adversarial examples by leveraging gradient-free optimization and operator splitting method'. Together they form a unique fingerprint.

Cite this