Skip to main navigation Skip to search Skip to main content

Disentangled Visual Representations for Extreme Human Body Video Compression

Ruofan Wang, Qi Mao*, Shiqi Wang, Chuanmin Jia, Ronggang Wang, Siwei Ma

*Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

Abstract

Recent years have witnessed the great promise of deep neural video compression codecs. However, there are still unprecedented challenges ahead when the videos are expected to be encoded with extremely low bitrate. Motivated by recent attempts of layered conceptual image compression, we make the first attempt to leverage the disentangled visual representations for extreme human body video compression. More specifically, to capture the main structure, we adopt the inferred human pose keypoints as the structure code of each frame, thereby deriving the motion information from structure codes of adjacent frames for further compression. To better exploit the texture redundancy, all frames share the same texture codes by incorporating the proposed texture contrastive learning to ensure texture consistency within a video. Two branches are consequently transmitted in a separable manner, and the generator synthesizes the reconstructed video with the combination of all decoded representations at the decoder side. Both qualitative and quantitative experimental results demonstrate that the proposed scheme can produce perceptually pleasing reconstruction results in ultra-low bitrates far below that can be reached by other video codecs.
Original languageEnglish
Title of host publicationIEEE ICME - IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO 2022
Subtitle of host publicationICME 2022 - CONFERENCE PROCEEDINGS
PublisherIEEE Computer Society
Number of pages6
ISBN (Electronic)9781665485630
ISBN (Print)9781665485647
DOIs
Publication statusPublished - 2022
Event2022 IEEE International Conference on Multimedia and Expo (ICME 2022) - Hybrid, Taipei, Taiwan, China
Duration: 18 Jul 202222 Jul 2022
https://2022.ieeeicme.org/

Publication series

NameProceedings - IEEE International Conference on Multimedia and Expo
Volume2022-July
ISSN (Print)1945-7871
ISSN (Electronic)1945-788X

Conference

Conference2022 IEEE International Conference on Multimedia and Expo (ICME 2022)
Abbreviated titleIEEE ICME 2022
PlaceTaiwan, China
CityTaipei
Period18/07/2222/07/22
Internet address

Research Keywords

  • contrastive learning
  • disentangled representations
  • Generative adversarial network
  • Video compression

Fingerprint

Dive into the research topics of 'Disentangled Visual Representations for Extreme Human Body Video Compression'. Together they form a unique fingerprint.

Cite this