4D Gaussian Videos with Motion Layering

Pinxuan DAI (Co-first Author), Peiquan ZHANG (Co-first Author), Zheng DONG, Ke XU, Yifan PENG, Dandan DING, Yujun SHEN, Yin YANG, Xinguo LIU, Rynson W.H. LAU, Weiwei XU*

*Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

Abstract

Online free-view navigation in volumetric videos requires high-quality rendering and real-Time streaming in order to provide immersive user experiences. However, existing methods (e.g., dynamic NeRF and 3DGS) may not handle dynamic scenes with complex motions, and their models may not be streamable due to storage and bandwidth constraints. In this paper, we propose a novel 4D Gaussian Video (4DGV) approach that enables the creation and streaming of photorealistic, volumetric videos for dynamic scenes over the Internet. The core of our 4DGV is a novel streamable group of Gaussians (GOG) representation based on motion layering. Each GOG consists of static and dynamic points obtained via lifting 2D segmentation into 3D in motion layering, where the deformation of each dynamic point is represented as the temporal offset of its attributes. We also adaptively convert static points back to dynamic points to handle the appearance change, (e.g., moving shadows and reflections), of static objects through optimization. To support real-Time streaming of 4DGVs, we show that by applying quantization on Gaussian attributes and H.265 encoding on deformation offsets, our GOG representation can be significantly compressed (to around 6% of the original model size) without sacrificing the accuracy (PSNR loss less than 0.01dB). Extensive experiments on standard benchmarks demonstrate that our method outperforms state-of-The-Art volumetric video approaches, with superior rendering quality and minimum storage overheads. © 2025 Copyright held by the owner/author(s).
Original languageEnglish
Article number124
Number of pages14
JournalACM Transactions on Graphics
Volume44
Issue number4
Online published27 Jul 2025
DOIs
Publication statusPublished - Aug 2025

Bibliographical note

Full text of this publication does not contain sufficient affiliation information. With consent from the author(s) concerned, the Research Unit(s) information for this record is based on the existing academic department affiliation of the author(s).

Funding

We thank the anonymous reviewers for their constructive comments. We also thank Tongchen Zhang, Yaohui Niu, and Muwei Zhou for their assistance with dataset capturing, as well as Xiuchao Wu and Xin Zhang for their insightful discussions. Weiwei Xu is partially supported by NSFC grant No. 62421003, and Xinguo Liu is partially supported by NSFC grant No. 62032011. This paper is partially supported by the Yongjiang Innovation Project No. 2025Z062, a project from the Hong Kong Productivity Council (Ref.: 9231463), and Information Technology Center and State Key Lab of CAD&CG, Zhejiang University.

Research Keywords

  • gaussian video compression
  • group of gaussians
  • motion layering
  • volumetric video

Fingerprint

Dive into the research topics of '4D Gaussian Videos with Motion Layering'. Together they form a unique fingerprint.

Cite this