TY - JOUR
T1 - 4D Gaussian Videos with Motion Layering
AU - DAI, Pinxuan
AU - ZHANG, Peiquan
AU - DONG, Zheng
AU - XU, Ke
AU - PENG, Yifan
AU - DING, Dandan
AU - SHEN, Yujun
AU - YANG, Yin
AU - LIU, Xinguo
AU - LAU, Rynson W.H.
AU - XU, Weiwei
N1 - Full text of this publication does not contain sufficient affiliation information. With consent from the author(s) concerned, the Research Unit(s) information for this record is based on the existing academic department affiliation of the author(s).
PY - 2025/8
Y1 - 2025/8
N2 - Online free-view navigation in volumetric videos requires high-quality rendering and real-Time streaming in order to provide immersive user experiences. However, existing methods (e.g., dynamic NeRF and 3DGS) may not handle dynamic scenes with complex motions, and their models may not be streamable due to storage and bandwidth constraints. In this paper, we propose a novel 4D Gaussian Video (4DGV) approach that enables the creation and streaming of photorealistic, volumetric videos for dynamic scenes over the Internet. The core of our 4DGV is a novel streamable group of Gaussians (GOG) representation based on motion layering. Each GOG consists of static and dynamic points obtained via lifting 2D segmentation into 3D in motion layering, where the deformation of each dynamic point is represented as the temporal offset of its attributes. We also adaptively convert static points back to dynamic points to handle the appearance change, (e.g., moving shadows and reflections), of static objects through optimization. To support real-Time streaming of 4DGVs, we show that by applying quantization on Gaussian attributes and H.265 encoding on deformation offsets, our GOG representation can be significantly compressed (to around 6% of the original model size) without sacrificing the accuracy (PSNR loss less than 0.01dB). Extensive experiments on standard benchmarks demonstrate that our method outperforms state-of-The-Art volumetric video approaches, with superior rendering quality and minimum storage overheads. © 2025 Copyright held by the owner/author(s).
AB - Online free-view navigation in volumetric videos requires high-quality rendering and real-Time streaming in order to provide immersive user experiences. However, existing methods (e.g., dynamic NeRF and 3DGS) may not handle dynamic scenes with complex motions, and their models may not be streamable due to storage and bandwidth constraints. In this paper, we propose a novel 4D Gaussian Video (4DGV) approach that enables the creation and streaming of photorealistic, volumetric videos for dynamic scenes over the Internet. The core of our 4DGV is a novel streamable group of Gaussians (GOG) representation based on motion layering. Each GOG consists of static and dynamic points obtained via lifting 2D segmentation into 3D in motion layering, where the deformation of each dynamic point is represented as the temporal offset of its attributes. We also adaptively convert static points back to dynamic points to handle the appearance change, (e.g., moving shadows and reflections), of static objects through optimization. To support real-Time streaming of 4DGVs, we show that by applying quantization on Gaussian attributes and H.265 encoding on deformation offsets, our GOG representation can be significantly compressed (to around 6% of the original model size) without sacrificing the accuracy (PSNR loss less than 0.01dB). Extensive experiments on standard benchmarks demonstrate that our method outperforms state-of-The-Art volumetric video approaches, with superior rendering quality and minimum storage overheads. © 2025 Copyright held by the owner/author(s).
KW - gaussian video compression
KW - group of gaussians
KW - motion layering
KW - volumetric video
UR - http://www.scopus.com/inward/record.url?scp=105011873467&partnerID=8YFLogxK
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-105011873467&origin=recordpage
U2 - 10.1145/3731189
DO - 10.1145/3731189
M3 - RGC 21 - Publication in refereed journal
SN - 0730-0301
VL - 44
JO - ACM Transactions on Graphics
JF - ACM Transactions on Graphics
IS - 4
M1 - 124
ER -