Skip to main navigation Skip to search Skip to main content

Universal Adversarial Perturbations for Vision-Language Pre-trained Models

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

Abstract

Vision-language pre-trained (VLP) models have been the foundation of numerous vision-language tasks. Given their prevalence, it becomes imperative to assess their adversarial robustness, especially when deploying them in security-crucial real-world applications. Traditionally, adversarial perturbations generated for this assessment target specific VLP models, datasets, and/or downstream tasks. This practice suffers from low transferability and additional computation costs when transitioning to new scenarios. In this work, we thoroughly investigate whether VLP models are commonly sensitive to imperceptible perturbations of a specific pattern for the image modality. To this end, we propose a novel black-box method to generate Universal Adversarial Perturbations (UAPs), which is so called the Effective and Transferable Universal Adversarial Attack (ETU), aiming to mislead a variety of existing VLP models in a range of downstream tasks. The ETU comprehensively takes into account the characteristics of UAPs and the intrinsic cross-modal interactions to generate effective UAPs. Under this regime, the ETU encourages both global and local utilities of UAPs. This benefits the overall utility while reducing interactions between UAP units, improving the transferability. To further enhance the effectiveness and transferability of UAPs, we also design a novel data augmentation method named ScMix. ScMix consists of self-mix and cross-mix data transformations, which can effectively increase the multi-modal data diversity while preserving the semantics of the original data. Through comprehensive experiments on various downstream tasks, VLP models, and datasets, we demonstrate that the proposed method is able to achieve effective and transferrable universal adversarial attacks. © 2024 ACM.
Original languageEnglish
Title of host publicationSIGIR '24 - Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval
Place of PublicationNew York, NY
PublisherAssociation for Computing Machinery
Pages862-871
Number of pages10
ISBN (Print)9798400704314
DOIs
Publication statusPublished - Jul 2024
Externally publishedYes
Event47th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2024) - Washington, United States
Duration: 14 Jul 202418 Jul 2024
https://sigir-2024.github.io/index.html

Publication series

NameSIGIR - Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval

Conference

Conference47th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2024)
Abbreviated titleACM SIGIR 24
PlaceUnited States
CityWashington
Period14/07/2418/07/24
Internet address

Funding

This work was partially supported by Australian Research Council Discovery Project (DP230101196, CE200100025).

Research Keywords

  • multi-modal learning
  • transferrable attack
  • universal adversarial perturbations
  • vision-language pre-training

Fingerprint

Dive into the research topics of 'Universal Adversarial Perturbations for Vision-Language Pre-trained Models'. Together they form a unique fingerprint.

Cite this