Abstract
Current test- or compression-time adaptation image compression (TTA-IC) approaches, which leverage both latent and decoder refinements as a two-step adaptation scheme, have potentially enhanced the rate-distortion (R-D) performance of learned image compression models on cross-domain compression tasks, e.g., from natural to screen content images. However, compared with the emergence of various decoder refinement variants, the latent refinement, as an inseparable ingredient, is barely tailored to cross-domain scenarios. To this end, we are interested in developing an advanced latent refinement method by extending the effective hybrid latent refinement (HLR) method, which is designed for in-domain inference improvement but shows noticeable degradation of the rate cost in cross-domain tasks. Specifically, we first provide theoretical analyses, in a cue of marginalization approximation from in- to cross-domain scenarios, to uncover that the vanilla HLR suffers from an underlying mismatch between refined Gaussian conditional and hyperprior distributions, leading to deteriorated joint probability approximation of marginal distribution with increased rate consumption. To remedy this issue, we introduce a simple Bayesian approximation-endowed distribution regularization to encourage learning a better joint probability approximation in a plug-and-play manner. Extensive experiments on six in- and cross-domain datasets demonstrate that our proposed method not only improves the R-D performance compared with other latent refinement counterparts, but also can be flexibly integrated into existing TTA-IC methods with incremental benefits. Our code is available at https://tonyckc.github.io/TTA-IC-DR/.
| Original language | English |
|---|---|
| Title of host publication | The Thirteenth International Conference on Learning Representations (ICLR 2025) |
| Editors | Y. Yue, A. Garg, N. Peng, F. Sha, R. Yu |
| Publisher | International Conference on Learning Representations, ICLR |
| Pages | 23686-23703 |
| ISBN (Electronic) | 9798331320850 |
| ISBN (Print) | 9798331320850 |
| Publication status | Presented - 25 Apr 2025 |
| Event | 13th International Conference on Learning Representations (ICLR 2025) - Singapore EXPO, Singapore Duration: 24 Apr 2025 → 28 Apr 2025 https://iclr.cc/Conferences/2025 |
Publication series
| Name | 13th International Conference on Learning Representations, ICLR 2025 |
|---|
Conference
| Conference | 13th International Conference on Learning Representations (ICLR 2025) |
|---|---|
| Abbreviated title | ICLR 2025 |
| Place | Singapore |
| Period | 24/04/25 → 28/04/25 |
| Internet address |
Bibliographical note
Research Unit(s) information for this publication is provided by the author(s) concerned.Funding
This work was supported in part by the Hong Kong Innovation and Technology Commission (ITC) (InnoHK Project CIMDA), in part by the Institute of Digital Medicine of City University of Hong Kong (Project 9229503), in part by the Hong Kong Research Grants Council under Projects 21200522, 11200323 and 11203220, in part by Chow Sang Sang Donation and Matching Fund (Project 9229161), and in part by the Hong Kong Innovation and Technology Commission (Project GHP/044/21SZ).
Research Keywords
- test-time adaptation
- image compression
- entropy coding
Publisher's Copyright Statement
- This full text is made available under CC-BY 4.0. https://creativecommons.org/licenses/by/4.0/
RGC Funding Information
- RGC-funded