Enabling Translatability of Generative Face Video Coding: A Unified Face Feature Transcoding Framework

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

7 Citations (Scopus)

Abstract

Generative face video coding (GFVC) can achieve high-quality visual face communication at ultra-low bit-rate ranges via strong facial prior learning and realistic generation. However, different kinds of feature representations hinder the interoperability of GFVC, as the bitstream generated from one type of feature representation can only be correctly understood by the corresponding decoder. In this paper, we make the first attempt to propose a face feature transcoding framework that enables translatability in GFVC. By integrating a face feature transcoder at the decoder side, received face features can be translated to decoder-specific ones for subsequent face reconstruction. Furthermore, the translation between different types of face features can be achieved using a unified transcoding framework, facilitating seamless interoperability between different facial representations and their associated decoders. Experimental results demonstrate that three main-stream GFVC codecs, each utilizing different face features, can be effectively adapted to one another while retaining promising coding performance, largely extending the generality of the GFVC system. The project page can be found at https://github.com/xyzysz/GFVC_Software-Decoder_Interoperability. © 2024 IEEE.
Original languageEnglish
Title of host publicationProceedings - DCC 2024, 2024 Data Compression Conference
EditorsAli Bilgin, James E. Fowler, Joan Serra-Sagrista, Yan Ye, James A. Storer
PublisherIEEE
Pages113-122
Number of pages10
ISBN (Electronic)979-8-3503-8587-8
ISBN (Print)979-8-3503-8588-5
DOIs
Publication statusPublished - 2024

Publication series

Name
ISSN (Print)1068-0314
ISSN (Electronic)2375-0359

Funding

This work was supported in part by the National Natural Science Foundation of China under Grant 62022002, in part by the Hong Kong Research Grants Council General Research Fund 11203220, in part by the Innovation and Technology Fund Project GHP/044/21SZ, and in part by the Alibaba Innovative Research.

Research Keywords

  • Video coding
  • Visualization
  • Codecs
  • Data compression
  • Transcoding
  • Standardization
  • Decoding
  • Generative coding
  • face video
  • Decoding interoperability

Fingerprint

Dive into the research topics of 'Enabling Translatability of Generative Face Video Coding: A Unified Face Feature Transcoding Framework'. Together they form a unique fingerprint.

Cite this