Semantic-Aware Visual Decomposition for Image Coding

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

2 Scopus Citations
View graph of relations

Author(s)

  • Jianhui Chang
  • Jian Zhang
  • Jiguo Li
  • Qi Mao
  • Chuanmin Jia
  • Siwei Ma
  • Wen Gao

Related Research Unit(s)

Detail(s)

Original languageEnglish
Pages (from-to)2333–2355
Journal / PublicationInternational Journal of Computer Vision
Volume131
Issue number9
Online published2 Jun 2023
Publication statusPublished - Sept 2023

Abstract

In this paper, we propose a novel image coding framework with semantic-aware visual decomposition towards extremely low bitrate compression. In particular, an input image is analyzed into a semantic map as structural representation and semantic-wise texture representation and further compressed into bitstreams at the encoder side. On the decoder side, the received bitstreams of dual-layer representations are decoded and reconstructed for target image synthesis with generative models. Moreover, the attention mechanism is introduced into the model architecture for texture representation modeling and a coherency regularization is proposed to further optimize the texture representation space by aligning the representation space with the source pixel space for higher synthesis quality. Besides, we also propose a cross-channel entropy module and control the quantization scale to facilitate rate-distortion optimization. Upon compressing the decomposed components into the bitstream, the simple yet effective representation philosophy benefits image compression in many aspects. First, in terms of compression performance, compact representations, and high visual synthesis quality can bring remarkable advantages. Second, the proposed framework yields a physically explainable bitstream composed of the structural segment and semantic-wise texture segments. Third and most importantly, subsequent vision tasks (e.g., content manipulation) can receive fundamental support from the semantic-aware visual decomposition and synthesis mechanism. Extensive experimental results demonstrate the superiority of the proposed framework towards efficient visual representation learning, high efficiency image compression (< 0.1 bpp), and intelligent visual applications (e.g., manipulation and analysis). © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2023

Research Area(s)

  • Coherency regularization, Extremely low bitrate, Image coding, Semantic-aware visual decomposition, Structure–texture

Citation Format(s)

Semantic-Aware Visual Decomposition for Image Coding. / Chang, Jianhui; Zhang, Jian; Li, Jiguo et al.
In: International Journal of Computer Vision, Vol. 131, No. 9, 09.2023, p. 2333–2355.

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review