Abstract
Image Coding for Machine (ICM) aims to compress an image so that the reconstructed one can meet the requirements of both human vision and machine vision. Existing methods apply the constraint from the downstream models to improve machine analytics performance while compromising the visual quality. This paper proposes a novel adversarially augmented adaptation route that achieves a better trade-off between the utility of the human and machine perspectives by making slight changes to the image manifold. In detail, a targeted adversarial attack is employed to generate subtle image perturbations that are nearly imperceptible to humans but significantly improve machine analytic performance. These perturbed images would be subsequently employed as ground truth to guide training/fine-tuning of an end-to-end image compression network. Note that, our method is a plug-and-play framework that does not rely on any change in existing architecture or loss functions. Extensive experimental results demonstrate the superiority of the proposed scheme over conventional ICM frameworks and the effectiveness of our design. © 2024 IEEE.
| Original language | English |
|---|---|
| Title of host publication | 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing - Proceedings |
| Publisher | IEEE |
| Pages | 3605-3609 |
| ISBN (Electronic) | 979-8-3503-4485-1 |
| DOIs | |
| Publication status | Published - Apr 2024 |
| Event | 49th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2024) - COEX, Seoul, Korea, Republic of Duration: 14 Apr 2024 → 19 Apr 2024 https://2024.ieeeicassp.org/ |
Publication series
| Name | ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings |
|---|---|
| ISSN (Print) | 1520-6149 |
Conference
| Conference | 49th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2024) |
|---|---|
| Place | Korea, Republic of |
| City | Seoul |
| Period | 14/04/24 → 19/04/24 |
| Internet address |
Bibliographical note
Full text of this publication does not contain sufficient affiliation information. With consent from the author(s) concerned, the Research Unit(s) information for this record is based on the existing academic department affiliation of the author(s).Research Keywords
- Image coding for machine
- Machine vision
- Machine vision coding
- Targeted adversarial attack