Towards Analysis-friendly Face Representation with Scalable Feature and Texture Compression

Research output: Journal Publications and Reviews (RGC: 21, 22, 62)21_Publication in refereed journalpeer-review

1 Scopus Citations
View graph of relations

Author(s)

  • Wenhan Yang
  • Xinfeng Zhang
  • Shanshe Wang
  • Siwei Ma
  • Wen Gao

Related Research Unit(s)

Detail(s)

Original languageEnglish
Pages (from-to)3169-3181
Number of pages13
Journal / PublicationIEEE Transactions on Multimedia
Volume24
Online published2 Jul 2021
Publication statusPublished - 2022

Abstract

Compactly representing visual information plays a fundamental role in optimizing the ultimate utility of myriad visual data-centered applications. Numerous approaches have been proposed to efficiently compress the texture and visual features for human visual perception and machine intelligence, respectively; however, much less work has been dedicated to studying the interactions between them. Here, we investigate the integration of feature and texture compression and show that a universal and collaborative visual information representation can be achieved in a hierarchical way. In particular, we study feature and texture compression in a scalable coding framework, where the base layer serves as the deep learning feature and the enhancement layer targets to perfectly reconstruct the texture. Based on the strong generative capability of deep neural networks, the gap between the base feature layer and enhancement layer is further filled with feature-level texture reconstruction, with the goal of further constructing texture representations from features. As such, the residuals between the original and reconstructed texture could be further conveyed in the enhancement layer. To improve the efficiency of the proposed framework, the base layer neural network is trained in a multitask manner such that the learned features enjoy both high-quality reconstruction and high-accuracy analysis. The framework and optimization strategies are further applied in face image compression, and promising coding performance has been achieved in terms of both rate-fidelity and rate-accuracy evaluations.

Research Area(s)

  • Deep learning, Feature compression, Feature extraction, Image coding, Image reconstruction, multitask learning, scalable coding, Standards, texture compression, Transform coding, Visualization

Citation Format(s)

Towards Analysis-friendly Face Representation with Scalable Feature and Texture Compression. / Wang, Shurun; Wang, Shiqi; Yang, Wenhan; Zhang, Xinfeng; Wang, Shanshe; Ma, Siwei; Gao, Wen.

In: IEEE Transactions on Multimedia, Vol. 24, 2022, p. 3169-3181.

Research output: Journal Publications and Reviews (RGC: 21, 22, 62)21_Publication in refereed journalpeer-review