Skip to main navigation Skip to search Skip to main content

PVNet: Point-Voxel Interaction LiDAR Scene Upsampling via Diffusion Models

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

Abstract

Accurate 3D scene understanding in outdoor environments heavily relies on high-quality point clouds. However, LiDAR-scanned data often suffer from extreme sparsity, severely hindering downstream 3D perception tasks. Existing point cloud upsampling methods primarily focus on individual objects, thus demonstrating limited generalization capability for complex outdoor scenes. To address this issue, we propose PVNet, a diffusion model-based point-voxel interaction framework to perform LiDAR point cloud upsampling without dense supervision. Specifically, we adopt the classifier-free guidance-based DDPMs to guide the generation, in which we employ a sparse point cloud as the guiding condition and the synthesized point clouds derived from its nearby frames as the input. Moreover, we design a voxel completion module to refine and complete the coarse voxel features for enriching the feature representation. In addition, we propose a point-voxel interaction module to integrate features from both points and voxels, which efficiently improves the environmental perception capability of each upsampled point. To the best of our knowledge, our approach is the first scene-level point cloud upsampling method supporting arbitrary upsampling rates. Extensive experiments on various benchmarks demonstrate that our method achieves state-of-the-art performance. The source code will be available at https://github.com/chengxianjing/PVNet. © 2025 IEEE.
Original languageEnglish
Pages (from-to)6895-6910
JournalIEEE Transactions on Image Processing
Volume34
DOIs
Publication statusPublished - Oct 2025

Funding

This work was supported by the Foundation of the Science and Technology project of Guangxi under Grant Guike AD21220114, the National Nature Science Foundation of China under Grants 61876051, 62466005, and 62422118, in part by the Shenzhen Key Laboratory of Visual Object Detection and Recognition under Grant ZDSYS20190902093015527, and in part by the Hong Kong Research Grants Council under Grants 11219324 11219422, and 11202320.

Research Keywords

  • LiDAR upsampling
  • diffusion model
  • voxel completion
  • point-voxel interaction

RGC Funding Information

  • RGC-funded

Fingerprint

Dive into the research topics of 'PVNet: Point-Voxel Interaction LiDAR Scene Upsampling via Diffusion Models'. Together they form a unique fingerprint.

Cite this