3D Full-body Pose Estimation of Construction Machines Using Deep Neural Network and Stereo Vision

Han Luo, Mingzhu Wang*, Peter Kok-Yiu Wong, Pak Him Leung, Jingyuan Tang, Jack C.P. Cheng

*Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

Abstract

The movement and posture variability of construction machines is a significant contributor to safety hazards on construction sites. Even when a machine’s location is fixed, its moving parts may collide with on-site personnel or objects, leading to injuries or production loss. Accurate estimation of 3D full-body poses of machines can enhance safety by providing more precise spatial information. This paper proposes a framework to estimate 3D full-body poses of construction machines using deep neural networks (DNNs) and stereo vision. The proposed framework employs an entropy-based active learning method to select informative images for fine-tuning the DNN model for 2D pose estimation. 3D poses are estimated through stereo camera calibration, coarse-to-fine stereo matching, and triangulation. Experimental validation using an excavator model achieved an average error percentage (AEP) of 12.11%, demonstrating the framework's feasibility for enhancing safety management. © 2025 Proceedings of the International Symposium on Automation and Robotics in Construction. All rights reserved.
Original languageEnglish
Title of host publicationProceedings of the 42nd International Symposium on Automation and Robotics in Construction
PublisherInternational Association for Automation and Robotics in Construction (IAARC)
Pages1300-1307
ISBN (Print)9780645832228
DOIs
Publication statusPublished - Jul 2025
Event42nd International Symposium on Automation and Robotics in Construction (ISARC 2025) - Concordia University, Montreal, Canada
Duration: 28 Jul 202531 Jul 2025
https://www.iaarc.org/isarc-2025

Publication series

NameProceedings of the International Symposium on Automation and Robotics in Construction
ISSN (Electronic)2413-5844

Conference

Conference42nd International Symposium on Automation and Robotics in Construction (ISARC 2025)
Abbreviated titleISARC
PlaceCanada
CityMontreal
Period28/07/2531/07/25
Internet address

Funding

The authors are grateful for the financial support provided by the National Nature Science Foundation of China (Grant number: 72301043).

Research Keywords

  • Computer vision
  • Construction machine
  • Construction safety
  • Deep active learning
  • Deep learning
  • Pose estimation
  • Stereo vision

Fingerprint

Dive into the research topics of '3D Full-body Pose Estimation of Construction Machines Using Deep Neural Network and Stereo Vision'. Together they form a unique fingerprint.

Cite this