Abstract
Concept-based models can map black-box representations to human-understandable concepts, which makes the decision-making process more transparent and then allows users to understand the reason behind predictions. However, domain-specific concepts often impact the final predictions, which subsequently undermine the model generalization capabilities, and prevent the model from being used in high-stake applications. In this paper, we propose a novel Language-guided Concept-Erasing (LanCE) framework. In particular, we empirically demonstrate that pre-trained vision-language models (VLMs) can approximate distinct visual domain shifts via domain descriptors while prompting large Language Models (LLMs) can easily simulate a wide range of descriptors of unseen visual domains. Then, we introduce a novel plug-in domain descriptor orthogonality (DDO) regularizer to mitigate the impact of these domain-specific concepts on the final predictions. Notably, the DDO regularizer is agnostic to the design of conceptbased models and we integrate it into several prevailing models. Through evaluation of domain generalization on four standard benchmarks and three newly introduced benchmarks, we demonstrate that DDO can significantly improve the out-of-distribution (OOD) generalization over the previous state-of-the-art concept-based models. Our code is available at https://github. com/joeyz0z/LanCE. ©2025 IEEE.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition CVPR 2025 |
| Publisher | IEEE |
| Pages | 9517-9526 |
| Number of pages | 10 |
| ISBN (Electronic) | 979-8-3315-4364-8 |
| ISBN (Print) | 979-8-3315-4365-5 |
| DOIs | |
| Publication status | Published - 2025 |
| Event | 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2025) - Music City Center, Nashville, United States Duration: 11 Jun 2025 → 15 Jun 2025 https://cvpr.thecvf.com/Conferences/2025 https://cvpr.thecvf.com/ |
Publication series
| Name | Proceedings - IEEE Computer Society Conference on Computer Vision and Pattern Recognition |
|---|---|
| ISSN (Print) | 1063-6919 |
| ISSN (Electronic) | 2575-7075 |
Conference
| Conference | 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2025) |
|---|---|
| Abbreviated title | CVPR2025 |
| Place | United States |
| City | Nashville |
| Period | 11/06/25 → 15/06/25 |
| Internet address |
Bibliographical note
Research Unit(s) information for this publication is provided by the author(s) concerned.Funding
This work was supported in part by the National Natural Science Foundation of China under Grant U21B2006; in part by Shaanxi Youth Innovation Team Project; in part by the Fundamental Research Funds for the Central Universities QTZX24003 and QTZX22160; in part by the 111 Project under Grant B18039; Hao Zhang acknowledges the support of NSFC (62301384); Excellent Young Scientists Fund (Overseas); Foundation of National Key Laboratory of Radar Signal Processing under Grant JKW202308. Zhengjue Wang acknowledges the support of NSFC (62301407).
Fingerprint
Dive into the research topics of 'Explaining Domain Shifts in Language: Concept erasing for Interpretable Image Classification'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver