Semantic 3D reconstruction-oriented image dataset for building component segmentation

Mun On Wong, Huaquan Ying*, Mengtian Yin*, Xiaoyue Yi, Lizhao Xiao, Weilun Duan, Chenchen He, Llewellyn Tang

*Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

5 Citations (Scopus)

Abstract

In image-driven 3D building reconstruction, instance segmentation is fundamental to pixel-wise building component detection, which can be fused with 3D data like point clouds and meshes via camera projection for semantic reconstruction. While deep learning-based segmentation has obtained promising results, it relies heavily on large-scale datasets for training. Unfortunately, existing large-scale image datasets often include irrelevant objects that obstruct building components, making them unsuitable for 3D building reconstruction. This paper addresses this gap by introducing a large-scale building image dataset to facilitate building component segmentation for 3D reconstruction. The dataset comprises 3378 images captured from both interiors and exteriors of 36 university buildings, annotated with 49,380 object instances across 11 classes. Rigorous quality control measures were employed during data collection and annotation. Evaluation of five typical deep learning-based instance segmentation models demonstrates the dataset's suitability for training and its value as a benchmark dataset for building component segmentation. © 2024 Elsevier B.V. All rights are reserved, including those for text and data mining, AI training, and similar technologies.
Original languageEnglish
Article number105558
JournalAutomation in Construction
Volume165
Online published18 Jun 2024
DOIs
Publication statusPublished - Sept 2024

Research Keywords

  • 3D reconstruction
  • Building component segmentation
  • Building image dataset
  • Building Information Modeling (BIM)
  • Deep learning
  • Instance segmentation

Fingerprint

Dive into the research topics of 'Semantic 3D reconstruction-oriented image dataset for building component segmentation'. Together they form a unique fingerprint.

Cite this