Abstract
We propose a talking face video compression framework by implicitly transforming the temporal evolution into compact feature representation. More specifically, the temporal evolution of faces, which is complex, non-linear and difficult to extrapolate, is modelled in an end-to-end inference framework based upon very compact features. This enables the high-quality rendering of the face videos, which benefits from the learning of dense motion map with compact feature representation. Therefore, the proposed framework can accommodate ultra-low bandwidth video communication and maintain the quality of the reconstructed videos. Experimental results demonstrate that compared with the state-of-the-art video coding standard Versatile Video Coding (VVC) as well as the latest generative compression scheme Face Video-to-Video Synthesis (Facevid2vid), the proposed scheme is superior in terms of both objective and subjective quality assessment methods.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - DCC 2022: 2022 Data Compression Conference |
| Editors | Ali Bilgin, Michael W. Marcellin, Joan Serra-Sagrista, James A. Storer |
| Publisher | IEEE |
| Pages | 13-22 |
| ISBN (Electronic) | 9781665478939 |
| ISBN (Print) | 978166578946 |
| DOIs | |
| Publication status | Published - 2022 |
| Event | 2022 Data Compression Conference (DCC 2022) - Snowbird, United States Duration: 22 Mar 2022 → 25 Mar 2022 https://www.cs.brandeis.edu/~dcc/Program.html |
Publication series
| Name | Data Compression Conference Proceedings |
|---|---|
| Volume | 2022-March |
| ISSN (Print) | 1068-0314 |
| ISSN (Electronic) | 2375-0359 |
Conference
| Conference | 2022 Data Compression Conference (DCC 2022) |
|---|---|
| Place | United States |
| City | Snowbird |
| Period | 22/03/22 → 25/03/22 |
| Internet address |
Bibliographical note
Full text of this publication does not contain sufficient affiliation information. With consent from the author(s) concerned, the Research Unit(s) information for this record is based on the existing academic department affiliation of the author(s).Fingerprint
Dive into the research topics of 'Beyond Keypoint Coding: Temporal Evolution Inference with Compact Feature Representation for Talking Face Video Compression'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver