Skip to main navigation Skip to search Skip to main content

MonoPCFlow: Enabling Efficient Scene Flow Estimation From Monocular View

  • Chichao Cheng
  • , Guangming Wang
  • , Yin-Dong Zheng
  • , Lu Liu*
  • , Hesheng Wang*
  • *Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

Abstract

Scene flow captures the dynamic changes of points in a 3-D scene, essential for understanding motion in physical environments. Light detection and ranging (LiDAR)-based scene flow estimation methods face challenges related to resolution, refresh rate, and cost. In contrast, monocular image-based methods estimate optical flow and depth separately at different stages. This fragmented approach inevitably compromises spatial-temporal consistency and introduces error accumulation. We propose monocular point cloud FlowNet (MonoPCFlow), a novel framework for scene flow estimation directly from a pair of consecutive monocular images. We integrate pseudo-LiDAR representations with dense 3-D scene flow estimation, effectively bridging the 2-D-to-3-D domain gap for monocular motion analysis. We develop a depth-enhanced refinement module that mitigates information loss in pseudo-LiDAR generation, preserving critical geometric and appearance features to improve scene flow accuracy. Experimental validation demonstrates MonoPCFlow's superior performance, achieving 37.0% (FlyingThings3D) and 39.7% Karlsruhe Institute of Technology and Toyota Institute of Technology (KITTI) relative reductions in endpoint-error compared to contemporary benchmarks. © 2025 IEEE.
Original languageEnglish
Article number5043810
Number of pages10
JournalIEEE Transactions on Instrumentation and Measurement
Volume74
Online published3 Sept 2025
DOIs
Publication statusPublished - 2025

Funding

This work was supported in part by the Natural Science Foundation of China under Grant 62225309, Grant U24A20278, Grant 62361166632, Grant 62403311, and Grant 62373314 and in part by the Research Grants Council of the Hong Kong, Special Administrative Region of China, under Project CityU/11207323.

Research Keywords

  • Feature extraction
  • Three-dimensional displays
  • Point cloud compression
  • Estimation
  • Accuracy
  • Optical flow
  • Image motion analysis
  • Depth measurement
  • Computer vision
  • Laser radar
  • Monocular images
  • pseudo-light detection and ranging (LiDAR) point cloud
  • scene flow estimation

RGC Funding Information

  • RGC-funded

Fingerprint

Dive into the research topics of 'MonoPCFlow: Enabling Efficient Scene Flow Estimation From Monocular View'. Together they form a unique fingerprint.

Cite this