Skip to main navigation Skip to search Skip to main content

Improve Video Representation with Temporal Adversarial Augmentation

  • Jinhao Duan
  • , Quanfu Fan
  • , Hao Cheng
  • , Xiaoshuang Shi*
  • , Kaidi Xu*
  • *Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

Abstract

Recent works reveal that adversarial augmentation benefits the generalization of neural networks (NNs) if used in an appropriate manner. In this paper, we introduce Temporal Adversarial Augmentation (TA), a novel video augmentation technique that utilizes temporal attention. Unlike conventional adversarial augmentation, TA is specifically designed to shift the attention distributions of neural networks with respect to video clips by maximizing a temporal-related loss function. We demonstrate that TA will obtain diverse temporal views, which significantly affect the focus of neural networks. Training with these examples remedies the flaw of unbalanced temporal information perception and enhances the ability to defend against temporal shifts, ultimately leading to better generalization. To leverage TA, we propose Temporal Video Adversarial Fine-tuning (TAF) framework for improving video representations. TAF is a model-agnostic, generic, and interpretability-friendly training strategy. We evaluate TAF with four powerful models (TSM, GST, TAM, and TPN) over three challenging temporal-related benchmarks (Something-something V1&V2 and diving48). Experimental results demonstrate that TAF effectively improves the test accuracy of these models with notable margins without introducing additional parameters or computational costs. As a byproduct, TAF also improves the robustness under out-of-distribution (OOD) settings. Code is available at https://github.com/jinhaoduan/TAF. © 2023 International Joint Conferences on Artificial Intelligence. All rights reserved.
Original languageEnglish
Title of host publicationProceedings of the Thirty-Second International Joint Conference on Artificial Intelligence
EditorsEdith Elkind
PublisherInternational Joint Conferences on Artificial Intelligence
Pages708-716
ISBN (Electronic)9781956792034
DOIs
Publication statusPublished - Aug 2023
Externally publishedYes
Event32nd International Joint Conference on Artificial Intelligence (IJCAI 2023) - Sheraton Grand Macao, Macao, China
Duration: 19 Aug 202325 Aug 2023
https://ijcai-23.org/

Publication series

NameIJCAI International Joint Conference on Artificial Intelligence
ISSN (Print)1045-0823

Conference

Conference32nd International Joint Conference on Artificial Intelligence (IJCAI 2023)
Abbreviated titleIJCAI-23
PlaceMacao, China
Period19/08/2325/08/23
Internet address

Fingerprint

Dive into the research topics of 'Improve Video Representation with Temporal Adversarial Augmentation'. Together they form a unique fingerprint.

Cite this