Two-Stream 3D MobileNetV3 for Pedestrians Intent Prediction Based on Monocular Camera

Yi Jiang, Weizhen Han, Luyao Ye, Yang Lu, Bingyi Liu*

*Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

2 Citations (Scopus)

Abstract

Recent developments in the field of autonomous vehicles have led to a renewed interest in combining deep learning and autonomous driving, aiming to support vision-related safety applications in Intelligent Transportation System (ITS). Pedestrians intent prediction is critical in improving the efficiency and safety of autonomous driving since complex traffic scenarios are ubiquitous, specifically facing pedestrian intrusion and other emergencies. Substantial studies have focused on using skeleton features as input to the neural network to extract pedestrians’ features. However, most of them cannot avoid the loss of feature points and design a bloated neural network. In this paper, we propose a spatio-temporal MobileNetV3 based on Two-Stream to predict pedestrian intention. Based on the pedestrian images cropped by the bounding box, we use RGB images to generate optical flow to replace the pedestrian’s skeleton features as the network’s input. With the benefit of 3D depthwise separable convolution, our network’s Multiply-Accumulate Operations (MACs) is one-fifth of the baseline model. Finally, we conduct experiments on real traffic datasets to verify our proposed method’s effectiveness in different road environments.
Original languageEnglish
Title of host publicationNeural Computing for Advanced Applications
Subtitle of host publicationThird International Conference, NCAA 2022, Proceedings, Part II
EditorsHaijun Zhang, Yuehui Chen, Xianghua Chu, Zhao Zhang, Tianyong Hao, Zhou Wu, Yimin Yang
PublisherSpringer Singapore
Pages247-259
Edition1
ISBN (Electronic)978-981-19-6135-9
ISBN (Print)978-981-19-6134-2
DOIs
Publication statusPublished - 2022
Event3rd International Conference on Neural Computing for Advanced Applications (NCAA 2022) - University of Jinan, Jinan, China
Duration: 8 Jul 202210 Jul 2022
https://dl2link.com/ncaa2022/

Publication series

NameCommunications in Computer and Information Science
Volume1638 CCIS
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937

Conference

Conference3rd International Conference on Neural Computing for Advanced Applications (NCAA 2022)
Abbreviated title2022 NCAA
PlaceChina
CityJinan
Period8/07/2210/07/22
Internet address

Research Keywords

  • MobileNet
  • Pedestrians intent prediction
  • Two-stream network

Fingerprint

Dive into the research topics of 'Two-Stream 3D MobileNetV3 for Pedestrians Intent Prediction Based on Monocular Camera'. Together they form a unique fingerprint.

Cite this