Abstract
Recent years have witnessed the great promise of deep neural video compression codecs. However, there are still unprecedented challenges ahead when the videos are expected to be encoded with extremely low bitrate. Motivated by recent attempts of layered conceptual image compression, we make the first attempt to leverage the disentangled visual representations for extreme human body video compression. More specifically, to capture the main structure, we adopt the inferred human pose keypoints as the structure code of each frame, thereby deriving the motion information from structure codes of adjacent frames for further compression. To better exploit the texture redundancy, all frames share the same texture codes by incorporating the proposed texture contrastive learning to ensure texture consistency within a video. Two branches are consequently transmitted in a separable manner, and the generator synthesizes the reconstructed video with the combination of all decoded representations at the decoder side. Both qualitative and quantitative experimental results demonstrate that the proposed scheme can produce perceptually pleasing reconstruction results in ultra-low bitrates far below that can be reached by other video codecs.
| Original language | English |
|---|---|
| Title of host publication | IEEE ICME - IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO 2022 |
| Subtitle of host publication | ICME 2022 - CONFERENCE PROCEEDINGS |
| Publisher | IEEE Computer Society |
| Number of pages | 6 |
| ISBN (Electronic) | 9781665485630 |
| ISBN (Print) | 9781665485647 |
| DOIs | |
| Publication status | Published - 2022 |
| Event | 2022 IEEE International Conference on Multimedia and Expo (ICME 2022) - Hybrid, Taipei, Taiwan, China Duration: 18 Jul 2022 → 22 Jul 2022 https://2022.ieeeicme.org/ |
Publication series
| Name | Proceedings - IEEE International Conference on Multimedia and Expo |
|---|---|
| Volume | 2022-July |
| ISSN (Print) | 1945-7871 |
| ISSN (Electronic) | 1945-788X |
Conference
| Conference | 2022 IEEE International Conference on Multimedia and Expo (ICME 2022) |
|---|---|
| Abbreviated title | IEEE ICME 2022 |
| Place | Taiwan, China |
| City | Taipei |
| Period | 18/07/22 → 22/07/22 |
| Internet address |
Research Keywords
- contrastive learning
- disentangled representations
- Generative adversarial network
- Video compression
Fingerprint
Dive into the research topics of 'Disentangled Visual Representations for Extreme Human Body Video Compression'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver