Abstract
Model editing aims to data-efficiently correct predictive errors of large pre-trained models while ensuring generalization to neighboring failures and locality to minimize unintended effects on unrelated examples. While significant progress has been made in editing Transformer-based large language models, effective strategies for editing vision Transformers (ViTs) in computer vision remain largely untapped. In this paper, we take initial steps towards correcting predictive errors of ViTs, particularly those arising from subpopulation shifts. Taking a locate-then-edit approach, we first address the “where-to-edit” challenge by meta-learning a hypernetwork on CutMix-augmented data generated for editing reliability. This trained hypernetwork produces generalizable binary masks that identify a sparse subset of structured model parameters, responsive to real-world failure samples. Afterward, we solve the “how-to-edit” problem by simply fine-tuning the identified parameters using a variant of gradient descent to achieve successful edits. To validate our method, we construct an editing benchmark that introduces subpopulation shifts towards natural underrepresented images and AI-generated images, thereby revealing the limitations of pre-trained ViTs for object recognition. Our approach not only achieves superior performance on the proposed benchmark but also allows for adjustable trade-offs between generalization and locality. Our code is available at https://github.com/hustyyq/Where-to-Edit. © 2024 Neural information processing systems foundation. All rights reserved.
| Original language | English |
|---|---|
| Title of host publication | NeurIPS Proceedings |
| Subtitle of host publication | Advances in Neural Information Processing Systems 37 (NeurIPS 2024) |
| Editors | A. Globerson, L. Mackey, D. Belgrave, A. Fan, U. Paquet, J. Tomczak, C. Zhang |
| Publisher | Neural Information Processing Systems (NeurIPS) |
| Publication status | Published - 2024 |
| Event | 38th Annual Conference on Neural Information Processing Systems (NeurIPS 2024) - Vancouver Convention Center, Vancouver, Canada Duration: 10 Dec 2024 → 15 Dec 2024 https://neurips.cc/ https://proceedings.neurips.cc/ |
Publication series
| Name | Advances in Neural Information Processing Systems |
|---|---|
| Publisher | Neural information processing systems foundation |
| ISSN (Print) | 1049-5258 |
Conference
| Conference | 38th Annual Conference on Neural Information Processing Systems (NeurIPS 2024) |
|---|---|
| Abbreviated title | NeurIPS 2024 |
| Place | Canada |
| City | Vancouver |
| Period | 10/12/24 → 15/12/24 |
| Internet address |