Skip to main navigation Skip to search Skip to main content

DSDFormer: An Innovative Transformer-Mamba Framework for Robust High-Precision Driver Distraction Identification

  • Junzhou Chen
  • , Zirui Zhang
  • , Jing Yu
  • , Heqiang Huang
  • , Ronghui Zhang*
  • , Xuemiao Xu
  • , Bin Sheng
  • , Hong Yan
  • *Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

Abstract

Driver distraction remains a leading cause of traffic accidents, posing a critical threat to road safety globally. As intelligent transportation systems evolve, accurate and real-time identification of driver distraction has become essential. However, existing methods struggle to capture both global contextual and fine-grained local features while contending with noisy labels in training datasets. To address these challenges, we propose DSDFormer, a novel framework that integrates the strengths of Transformer and Mamba architectures through a Dual State Domain Attention (DSDA) mechanism, enabling a balance between long-range dependencies and detailed feature extraction for robust driver behavior recognition. Additionally, we introduce Temporal Reasoning Confident Learning (TRCL), an unsupervised approach that refines noisy labels by leveraging spatiotemporal correlations in video sequences. Beyond achieving state-of-the-art results on AUC-V1, AUC-V2, and 100-Driver datasets, the proposed model is deployable in real-time on embedded platforms such as NVIDIA Jetson AGX Orin and Xavier. Extensive experimental results confirm that DSDFormer and TRCL significantly improve both the accuracy and robustness of driver distraction detection, offering a scalable solution to enhance road safety. © 2000-2011 IEEE.
Original languageEnglish
Pages (from-to)1312-1327
Number of pages16
JournalIEEE Transactions on Intelligent Transportation Systems
Volume27
Issue number1
Online published21 Nov 2025
DOIs
Publication statusPublished - Jan 2026

Funding

This work was supported in part by the National Natural Science Foundation of China under Grant 61003143, Grant 52172350, and Grant T2525004; in part by The Hong Kong Innovation and Technology Commission (InnoHK Project CIMDA); in part by the Tongchuang Intelligent Medical InterDisciplinary Talent Training Fund of Sun Yat-sen University under Grant 76160-54990001; in part by Guangdong Basic and Applied Research Foundation under Grant 2022B1515120072; in part by Guangzhou Science and Technology Plan Project under Grant 2024B01W0079; and in part by the Science and Technology Planning Project of Guangdong Province under Grant 2023B1212060029.

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being
  2. SDG 11 - Sustainable Cities and Communities
    SDG 11 Sustainable Cities and Communities

Research Keywords

  • confident learning
  • Driver distraction identification
  • Mamba
  • traffic accidents
  • transformers

Fingerprint

Dive into the research topics of 'DSDFormer: An Innovative Transformer-Mamba Framework for Robust High-Precision Driver Distraction Identification'. Together they form a unique fingerprint.

Cite this