Crack segmentation using discrete cosine transform in shadow environments

Yingchao Zhang, Cheng Liu*

*Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

17 Citations (Scopus)

Abstract

Accurate pavement crack segmentation is crucial for quantifying the extent of pavement damage. However, shadows from roadside trees or buildings often significantly impact the results of crack segmentation in actual crack detection or segmentation processes. To solve this problem, this paper presents a pavement crack segmentation network called SCSNet based on discrete cosine transform. SCSNet combines a proposed shadow removal module with a loss function based on pixel frequency distribution to further minimize the impact of shadows on accuracy. Additionally, a crack dataset with shadows was introduced. By comparing with the classical semantic segmentation network, the results show that SCSNet with training from scratch, outperforms classic models based on pre-trained weights. The result of the ablation experiments also demonstrate that the proposed tricks are effective. Finally, the actual crack segmentation results further demonstrate the superiority of SCSNet in segmenting cracks in complex environments. © 2024 Elsevier B.V

Original languageEnglish
Article number105646
Number of pages11
JournalAutomation in Construction
Volume166
Online published1 Aug 2024
DOIs
Publication statusPublished - Oct 2024

Bibliographical note

Publisher Copyright:
© 2024 Elsevier B.V.

Funding

The research is partly supported by the New Faculty Start-up Fund from City University of Hong Kong with grant number 9610612, partly supported by the Research Matching Grant Scheme with grant number 9229141, partly supported by the CityU Strategic Interdisciplinary Research Grant with grant number 7020076.

Research Keywords

  • Crack segmentation
  • Deep learning
  • Non-destructive testing
  • Shadow removal

Fingerprint

Dive into the research topics of 'Crack segmentation using discrete cosine transform in shadow environments'. Together they form a unique fingerprint.

Cite this