Abstract
We propose the gradient-weighted Object Detector Activation Maps (ODAM), a visual explanation technique for interpreting the predictions of object detectors. Utilizing the gradients of detector targets flowing into the intermediate feature maps, ODAM produces heat maps that show the influence of regions on the detector's decision for each predicted attribute. Compared to previous works on classification activation maps (CAM), ODAM generates instance-specific explanations rather than class-specific ones. We show that ODAM is applicable to one-stage, two-stage, and transformer-based detectors with different types of detector backbones and heads, and produces higher-quality visual explanations than the state-of-the-art in terms of both effectiveness and efficiency. We discuss two explanation tasks for object detection: 1) object specification: what is the important region for the prediction? 2) object discrimination: which object is detected? Aiming at these two aspects, we present a detailed analysis of the visual explanations of detectors and carry out extensive experiments to validate the effectiveness of the proposed ODAM. Furthermore, we investigate user trust on the explanation maps, how well the visual explanations of object detectors agrees with human explanations, as measured through human eye gaze, and whether this agreement is related with user trust. Finally, we also propose two applications, ODAM-KD and ODAM-NMS, based on these two abilities of ODAM. ODAM-KD utilizes the object specification of ODAM to generate top-down attention for key predictions and instruct the knowledge distillation of object detection. ODAM-NMS considers the location of the model's explanation for each prediction to distinguish the duplicate detected objects. A training scheme, ODAM-Train, is proposed to improve the quality on object discrimination, and help with ODAM-NMS.© 2024 IEEE.
| Original language | English |
|---|---|
| Pages (from-to) | 5967-5985 |
| Number of pages | 19 |
| Journal | IEEE Transactions on Pattern Analysis and Machine Intelligence |
| Volume | 46 |
| Issue number | 9 |
| Online published | 22 Mar 2024 |
| DOIs | |
| Publication status | Published - Sept 2024 |
Bibliographical note
Research Unit(s) information for this publication is provided by the author(s) concerned.Funding
We thank Alice Yang for help with collecting the eye gaze data. This work was supported by grants from Collaborative Research Fund #C7129 − 20G, Research Grant Council Hong Kong.
Research Keywords
- Deep learning
- Detectors
- explainable AI
- explaining object detection
- gradient-based explanation
- Heat maps
- human eye gaze
- instance-level explanation
- knowledge distillation
- non-maximum suppression
- Object detection
- object discrimination
- object specification
- Predictive models
- Task analysis
- Transformers
- Visualization
Publisher's Copyright Statement
- COPYRIGHT TERMS OF DEPOSITED POSTPRINT FILE: © 2024 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. Zhao, C., Hsiao, J. H., & Chan, A. B. (2024). Gradient-Based Instance-Specific Visual Explanations for Object Specification and Object Discrimination. IEEE Transactions on Pattern Analysis and Machine Intelligence, 46(9), 5967 - 5985. https://doi.org/10.1109/TPAMI.2024.3380604
RGC Funding Information
- RGC-funded
Fingerprint
Dive into the research topics of 'Gradient-Based Instance-Specific Visual Explanations for Object Specification and Object Discrimination'. Together they form a unique fingerprint.Projects
- 1 Finished
-
CRF-ExtU-Lead: Impact of Mask-use and Social Distancing during COVID-19 and NID Pandemic on Face and Facial Expression Recognition
HSIAO, J.H.-W. (Main Project Coordinator [External]) & CHAN, A. B. (Principal Investigator / Project Coordinator)
30/06/21 → 29/06/24
Project: Research
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver