Skip to main navigation Skip to search Skip to main content

Explaining Domain Shifts in Language: Concept erasing for Interpretable Image Classification

  • Zequn Zeng (Co-first Author)
  • , Yudi Su (Co-first Author)
  • , Jianqiao Sun
  • , Tiansheng Wen
  • , Hao Zhang
  • , Zhengjue Wang
  • , Bo Chen*
  • , Hongwei Liu*
  • , Jiawei Ma
  • *Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

Abstract

Concept-based models can map black-box representations to human-understandable concepts, which makes the decision-making process more transparent and then allows users to understand the reason behind predictions. However, domain-specific concepts often impact the final predictions, which subsequently undermine the model generalization capabilities, and prevent the model from being used in high-stake applications. In this paper, we propose a novel Language-guided Concept-Erasing (LanCE) framework. In particular, we empirically demonstrate that pre-trained vision-language models (VLMs) can approximate distinct visual domain shifts via domain descriptors while prompting large Language Models (LLMs) can easily simulate a wide range of descriptors of unseen visual domains. Then, we introduce a novel plug-in domain descriptor orthogonality (DDO) regularizer to mitigate the impact of these domain-specific concepts on the final predictions. Notably, the DDO regularizer is agnostic to the design of conceptbased models and we integrate it into several prevailing models. Through evaluation of domain generalization on four standard benchmarks and three newly introduced benchmarks, we demonstrate that DDO can significantly improve the out-of-distribution (OOD) generalization over the previous state-of-the-art concept-based models. Our code is available at https://github. com/joeyz0z/LanCE. ©2025 IEEE.
Original languageEnglish
Title of host publicationProceedings - 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition CVPR 2025
PublisherIEEE
Pages9517-9526
Number of pages10
ISBN (Electronic)979-8-3315-4364-8
ISBN (Print)979-8-3315-4365-5
DOIs
Publication statusPublished - 2025
Event2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2025) - Music City Center, Nashville, United States
Duration: 11 Jun 202515 Jun 2025
https://cvpr.thecvf.com/Conferences/2025
https://cvpr.thecvf.com/

Publication series

NameProceedings - IEEE Computer Society Conference on Computer Vision and Pattern Recognition
ISSN (Print)1063-6919
ISSN (Electronic)2575-7075

Conference

Conference2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2025)
Abbreviated titleCVPR2025
PlaceUnited States
CityNashville
Period11/06/2515/06/25
Internet address

Bibliographical note

Research Unit(s) information for this publication is provided by the author(s) concerned.

Funding

This work was supported in part by the National Natural Science Foundation of China under Grant U21B2006; in part by Shaanxi Youth Innovation Team Project; in part by the Fundamental Research Funds for the Central Universities QTZX24003 and QTZX22160; in part by the 111 Project under Grant B18039; Hao Zhang acknowledges the support of NSFC (62301384); Excellent Young Scientists Fund (Overseas); Foundation of National Key Laboratory of Radar Signal Processing under Grant JKW202308. Zhengjue Wang acknowledges the support of NSFC (62301407).

Fingerprint

Dive into the research topics of 'Explaining Domain Shifts in Language: Concept erasing for Interpretable Image Classification'. Together they form a unique fingerprint.

Cite this