Downstream-agnostic Adversarial Examples

Ziqi Zhou, Shengshan Hu*, Ruizhi Zhao, Qian Wang, Leo Yu Zhang, Junhui Hou, Hai Jin

*Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

25 Citations (Scopus)

Abstract

Self-supervised learning usually uses a large amount of unlabeled data to pre-train an encoder which can be used as a general-purpose feature extractor, such that downstream users only need to perform fine-tuning operations to enjoy the benefit of "large model". Despite this promising prospect, the security of pre-trained encoder has not been thoroughly investigated yet, especially when the pre-trained encoder is publicly available for commercial use.
In this paper, we propose AdvEncoder, the first framework for generating downstream-agnostic universal adversarial examples based on the pre-trained encoder. AdvEncoder aims to construct a universal adversarial perturbation or patch for a set of natural images that can fool all the downstream tasks inheriting the victim pre-trained encoder. Unlike traditional adversarial example works, the pre-trained encoder only outputs feature vectors rather than classification labels. Therefore, we first exploit the high frequency component information of the image to guide the generation of adversarial examples. Then we design a generative attack framework to construct adversarial perturbations/patches by learning the distribution of the attack surrogate dataset to improve their attack success rates and transferability. Our results show that an attacker can successfully attack downstream tasks without knowing either the pre-training dataset or the downstream dataset. We also tailor four defences for pre-trained encoders, the results of which further prove the attack ability of AdvEncoder. Our codes are available at: https://github.com/CGCL-codes/AdvEncoder. ©2023 IEEE.
Original languageEnglish
Title of host publicationProceedings - 2023 IEEE/CVF International Conference on Computer Vision ICCV 2023
PublisherIEEE
Pages4322-4332
Number of pages11
ISBN (Electronic)979-8-3503-0718-4
ISBN (Print)979-8-3503-0719-1
DOIs
Publication statusPublished - Oct 2023
Event2023 IEEE/CVF International Conference on Computer Vision (ICCV 2023) - Paris Convention Center, Paris, France
Duration: 2 Oct 20236 Oct 2023
https://iccv2023.thecvf.com/

Publication series

NameProceedings of the IEEE International Conference on Computer Vision
ISSN (Print)1550-5499
ISSN (Electronic)2380-7504

Conference

Conference2023 IEEE/CVF International Conference on Computer Vision (ICCV 2023)
Abbreviated titleICCV23
PlaceFrance
CityParis
Period2/10/236/10/23
Internet address

Fingerprint

Dive into the research topics of 'Downstream-agnostic Adversarial Examples'. Together they form a unique fingerprint.

Cite this