Projects per year
Abstract
Recently, deep generative models have greatly advanced the progress of face video coding towards promising rate-distortion performance and diverse application functionalities. Beyond traditional hybrid video coding paradigms, Generative Face Video Compression (GFVC) relying on the strong capabilities of deep generative models and the philosophy of early Model-Based Coding (MBC) can facilitate the compact representation and realistic reconstruction of visual face signal, thus achieving ultra-low bitrate face video communication. However, these GFVC algorithms are sometimes faced with unstable reconstruction quality and limited bitrate ranges. To address these problems, this paper proposes a novel Progressive Face Video Compression framework, namely PFVC, that utilizes adaptive visual tokens to realize exceptional trade-offs between reconstruction robustness and bandwidth intelligence. In particular, the encoder of the proposed PFVC projects the high-dimensional face signal into adaptive visual tokens in a progressive manner, whilst the decoder can further reconstruct these adaptive visual tokens for motion estimation and signal synthesis with different granularity levels. Experimental results demonstrate that the proposed PFVC framework can achieve better coding flexibility and superior rate-distortion performance in comparison with the latest Versatile Video Coding (VVC) codec and the state-of-the-art GFVC algorithms. The project page can be found at https://github.com/Berlin0610/PFVC.
©2025 IEEE
©2025 IEEE
Original language | English |
---|---|
Title of host publication | Proceedings 2025 Data Compression Conference |
Editors | Ali Bilgin, James E. Fowler, Joan Serra-Sagrista, Yan Ye , James A. Storer |
Publisher | IEEE Computer Society Conference Publishing Services (CPS) |
Pages | 163-172 |
Number of pages | 10 |
ISBN (Electronic) | 979-8-3315-3471-4 |
ISBN (Print) | 979-8-3315-3472-1 |
DOIs | |
Publication status | Published - 2025 |
Event | 2025 Data Compression Conference - Cliff Lodge convention center, Salt Lake City, United States Duration: 18 Mar 2025 → 21 Mar 2025 https://datacompressionconference.org/ |
Publication series
Name | |
---|---|
ISSN (Print) | 1068-0314 |
ISSN (Electronic) | 2375-0359 |
Conference
Conference | 2025 Data Compression Conference |
---|---|
Abbreviated title | DCC 2025 |
Country/Territory | United States |
City | Salt Lake City |
Period | 18/03/25 → 21/03/25 |
Internet address |
Bibliographical note
Research Unit(s) information for this publication is provided by the author(s) concerned.Funding
This work is supported in part by the Hong Kong Research Grants Council General Research Fund 11200323, in part by the Innovation and Technology Fund Project GHP/044/21SZ, and in part by the Alibaba Innovative Research.
Research Keywords
- Video coding
- Visualization
- Codecs
- Bit rate
- Rate-distortion
- Bandwidth
- Video compression
- Encoding
- Signal synthesis
- Faces
Fingerprint
Dive into the research topics of 'Beyond GFVC: Progressive Face Video Compression Framework with Adaptive Visual Tokens'. Together they form a unique fingerprint.Projects
- 2 Active
-
GRF: Semantic Visual Data Compression for Vehicular Communications in Intelligent Driving Systems
WANG, S. (Principal Investigator / Project Coordinator) & WU, D. (Co-Investigator)
1/01/24 → …
Project: Research
-
ITF: Compression, Transmission and National Standardization for Ultra-high-definition VR Videos
WANG, S. (Principal Investigator / Project Coordinator), CHEN, B. (Co-Investigator), KWONG, T. W. S. (Co-Investigator) & ZHU, L. (Co-Investigator)
1/07/23 → …
Project: Research