Projects per year
Abstract
Recent advancements in neural image compression have shown great potential in outperforming conventional standard codecs in terms of both rate-distortion and rate-analysis performance. However, there is an issue of divergent preferences in information preservation or reconstruction in the process of compression for humans and machines, respectively. Compression for humans tends to retain the signal fidelity or perceptual quality of visual appearance while compression for machines requires preserving critical semantic information, resulting in the limitation of the bitstream supporting only a single requirement during the compression. To bridge this gap, we propose a dynamic adaptation approach that generates a single bitstream serving both humans and machines. This approach aims to mitigate the domain gap among tasks, which facilitates maintaining the performance of out-of-scope tasks. Specifically, the proposed method concentrates on learning a dynamic adaptation process, i.e., optimizing the latent representation in the compressed domain in an end-to-end manner while adhering to the rate-performance constraint. Extensive results reveal that our paradigm significantly reduces the domain gap, surpassing existing codecs. © 2024 IEEE
| Original language | English |
|---|---|
| Title of host publication | 2024 IEEE International Conference on Image Processing (ICIP) - Proceedings |
| Publisher | IEEE |
| Pages | 1788-1794 |
| ISBN (Electronic) | 979-8-3503-4939-9 |
| DOIs | |
| Publication status | Published - Oct 2024 |
| Event | 31st IEEE International Conference on Image Processing (ICIP 2024): Trustworthy Visual Data Processing - Abu Dhabi, United Arab Emirates Duration: 27 Oct 2024 → 30 Oct 2024 https://2024.ieeeicip.org/ |
Publication series
| Name | Proceedings - International Conference on Image Processing, ICIP |
|---|---|
| ISSN (Print) | 1522-4880 |
Conference
| Conference | 31st IEEE International Conference on Image Processing (ICIP 2024) |
|---|---|
| Abbreviated title | IEEE ICIP 2024 |
| Place | United Arab Emirates |
| City | Abu Dhabi |
| Period | 27/10/24 → 30/10/24 |
| Internet address |
Funding
This work is supported in part by the Shenzhen Science and Technology Program under Project JCYJ20220530140816037, in part by the Hong Kong Research Grants Council General Research Fund 11203220, in part by the Innovation and Technology Fund Project GHP/044/21SZ, and in part by the (Guangdong Basic and Applied Basic Research Foundation) (2024A1515010454).
Research Keywords
- dynamic adaptation
- human-machine collaboration
- Learned image compression
RGC Funding Information
- RGC-funded
Fingerprint
Dive into the research topics of 'LEARNED IMAGE COMPRESSION FOR BOTH HUMANS AND MACHINES VIA DYNAMIC ADAPTATION'. Together they form a unique fingerprint.Projects
- 1 Finished
-
GRF: Towards Smart Visual Sensor Data Representation with Intelligent Sensing in the Internet of Video Things
WANG, S. (Principal Investigator / Project Coordinator), Huang, T. (Co-Investigator) & XUE, C. J. (Co-Investigator)
1/01/21 → 23/06/25
Project: Research