Beyond Keypoint Coding : Temporal Evolution Inference with Compact Feature Representation for Talking Face Video Compression

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

20 Scopus Citations
View graph of relations

Related Research Unit(s)

Detail(s)

Original languageEnglish
Title of host publicationProceedings - DCC 2022: 2022 Data Compression Conference
EditorsAli Bilgin, Michael W. Marcellin, Joan Serra-Sagrista, James A. Storer
PublisherInstitute of Electrical and Electronics Engineers, Inc.
Pages13-22
ISBN (electronic)9781665478939
ISBN (print)978166578946
Publication statusPublished - 2022

Publication series

NameData Compression Conference Proceedings
Volume2022-March
ISSN (Print)1068-0314
ISSN (electronic)2375-0359

Conference

Title2022 Data Compression Conference (DCC 2022)
PlaceUnited States
CitySnowbird
Period22 - 25 March 2022

Abstract

We propose a talking face video compression framework by implicitly transforming the temporal evolution into compact feature representation. More specifically, the temporal evolution of faces, which is complex, non-linear and difficult to extrapolate, is modelled in an end-to-end inference framework based upon very compact features. This enables the high-quality rendering of the face videos, which benefits from the learning of dense motion map with compact feature representation. Therefore, the proposed framework can accommodate ultra-low bandwidth video communication and maintain the quality of the reconstructed videos. Experimental results demonstrate that compared with the state-of-the-art video coding standard Versatile Video Coding (VVC) as well as the latest generative compression scheme Face Video-to-Video Synthesis (Facevid2vid), the proposed scheme is superior in terms of both objective and subjective quality assessment methods.

Bibliographic Note

Full text of this publication does not contain sufficient affiliation information. With consent from the author(s) concerned, the Research Unit(s) information for this record is based on the existing academic department affiliation of the author(s).

Citation Format(s)

Beyond Keypoint Coding: Temporal Evolution Inference with Compact Feature Representation for Talking Face Video Compression. / Chen, Bolin; Wang, Zhao; Li, Binzhe et al.
Proceedings - DCC 2022: 2022 Data Compression Conference. ed. / Ali Bilgin; Michael W. Marcellin; Joan Serra-Sagrista; James A. Storer. Institute of Electrical and Electronics Engineers, Inc., 2022. p. 13-22 (Data Compression Conference Proceedings; Vol. 2022-March).

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review