Temporal Consistency for RGB-Thermal Data-based Semantic Scene Understanding

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

1 Scopus Citations
View graph of relations

Author(s)

Related Research Unit(s)

Detail(s)

Original languageEnglish
Pages (from-to)9757-9764
Journal / PublicationIEEE Robotics and Automation Letters
Volume9
Issue number11
Online published10 Sept 2024
Publication statusPublished - Nov 2024

Abstract

Semantic scene understanding is a fundamental capability for autonomous vehicles. Under challenging lighting conditions, such as nighttime and on-coming headlights, the semantic scene understanding performance using only RGB images are usually degraded. Thermal images can provide complementary information to RGB images, so many recent semantic segmentation networks have been proposed using RGB-Thermal (RGB-T) images. However, most existing networks focus only on improving segmentation accuracy for single image frames, omitting the information consistency between consecutive frames. To provide a solution to this issue, we propose a temporal-consistent framework for RGB-T semantic segmentation, which introduces a virtual view image generation module to synthesize a virtual image for the next moment, and a consistency loss function to ensure the segmentation consistency. We also propose an evaluation metric to measure both the accuracy and consistency for semantic segmentation. Experimental results show that our framework outperforms state-of-the-art methods. © 2024 IEEE.

Research Area(s)

  • Autonomous Vehicles, Multi-modal Fusion, RGB-Thermal, Semantic Segmentation, Temporal Consistency

Citation Format(s)

Temporal Consistency for RGB-Thermal Data-based Semantic Scene Understanding. / Li, Haotian; Chu, Henry K.; Sun, Yuxiang.
In: IEEE Robotics and Automation Letters, Vol. 9, No. 11, 11.2024, p. 9757-9764.

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review