Concept-wise Fine-tuning Matters in Preventing Negative Transfer

Yunqiao Yang, Long-Kai Huang, Ying Wei*

*Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

4 Citations (Scopus)

Abstract

A multitude of prevalent pre-trained models mark a major milestone in the development of artificial intelligence, while fine-tuning has been a common practice that enables pretrained models to figure prominently in a wide array of target datasets. Our empirical results reveal that off-the-shelf finetuning techniques are far from adequate to mitigate negative transfer caused by two types of underperforming features in a pre-trained model, including rare features and spuriously correlated features. Rooted in structural causal models of predictions after fine-tuning, we propose a Concept-wise fine-tuning (Concept-Tuning) approach which refines feature representations in the level of patches with each patch encoding a concept. Concept-Tuning minimizes the negative impacts of rare features and spuriously correlated features by (1) maximizing the mutual information between examples in the same category with regard to a slice of rare features (a patch) and (2) applying front-door adjustment via attention neural networks in channels and feature slices (patches). The proposed Concept-Tuning consistently and significantly (by up to 4.76%) improves prior state-of-the-art fine-tuning methods on eleven datasets, diverse pre-training strategies (supervised and self-supervised ones), various network architectures, and sample sizes in a target dataset. © 2023 IEEE.

Original languageEnglish
Title of host publicationProceedings - 2023 IEEE/CVF International Conference on Computer Vision, ICCV 2023
Place of PublicationLos Alamitos, Calif.
PublisherIEEE
Pages18707-18717
Number of pages11
ISBN (Electronic)9798350307184
ISBN (Print)9798350307191
DOIs
Publication statusPublished - Oct 2023
Event2023 IEEE/CVF International Conference on Computer Vision (ICCV 2023) - Paris Convention Center, Paris, France
Duration: 2 Oct 20236 Oct 2023
https://iccv2023.thecvf.com/

Publication series

NameProceedings of the IEEE International Conference on Computer Vision
ISSN (Print)1550-5499
ISSN (Electronic)2380-7504

Conference

Conference2023 IEEE/CVF International Conference on Computer Vision (ICCV 2023)
Abbreviated titleICCV23
PlaceFrance
CityParis
Period2/10/236/10/23
Internet address

Fingerprint

Dive into the research topics of 'Concept-wise Fine-tuning Matters in Preventing Negative Transfer'. Together they form a unique fingerprint.

Cite this