Abstract
Scene flow captures the dynamic changes of points in a 3-D scene, essential for understanding motion in physical environments. Light detection and ranging (LiDAR)-based scene flow estimation methods face challenges related to resolution, refresh rate, and cost. In contrast, monocular image-based methods estimate optical flow and depth separately at different stages. This fragmented approach inevitably compromises spatial-temporal consistency and introduces error accumulation. We propose monocular point cloud FlowNet (MonoPCFlow), a novel framework for scene flow estimation directly from a pair of consecutive monocular images. We integrate pseudo-LiDAR representations with dense 3-D scene flow estimation, effectively bridging the 2-D-to-3-D domain gap for monocular motion analysis. We develop a depth-enhanced refinement module that mitigates information loss in pseudo-LiDAR generation, preserving critical geometric and appearance features to improve scene flow accuracy. Experimental validation demonstrates MonoPCFlow's superior performance, achieving 37.0% (FlyingThings3D) and 39.7% Karlsruhe Institute of Technology and Toyota Institute of Technology (KITTI) relative reductions in endpoint-error compared to contemporary benchmarks. © 2025 IEEE.
| Original language | English |
|---|---|
| Article number | 5043810 |
| Number of pages | 10 |
| Journal | IEEE Transactions on Instrumentation and Measurement |
| Volume | 74 |
| Online published | 3 Sept 2025 |
| DOIs | |
| Publication status | Published - 2025 |
Funding
This work was supported in part by the Natural Science Foundation of China under Grant 62225309, Grant U24A20278, Grant 62361166632, Grant 62403311, and Grant 62373314 and in part by the Research Grants Council of the Hong Kong, Special Administrative Region of China, under Project CityU/11207323.
Research Keywords
- Feature extraction
- Three-dimensional displays
- Point cloud compression
- Estimation
- Accuracy
- Optical flow
- Image motion analysis
- Depth measurement
- Computer vision
- Laser radar
- Monocular images
- pseudo-light detection and ranging (LiDAR) point cloud
- scene flow estimation
RGC Funding Information
- RGC-funded
Fingerprint
Dive into the research topics of 'MonoPCFlow: Enabling Efficient Scene Flow Estimation From Monocular View'. Together they form a unique fingerprint.Projects
- 1 Active
-
GRF: Distributed Optimal Self-Deployment Control of Multiple Dynamic Systems subject to Various Uncertainties and Communication Constraints
LIU, L. (Principal Investigator / Project Coordinator)
1/01/24 → …
Project: Research
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver