Abstract
Multiple-object tracking (MOT) is a crucial component in autonomous driving systems. However, inaccurate object detection is always the bottleneck for MOT. Most detectors are not designed to take the temporal information across consecutive frames into consideration. To take advantage of such information, we design a novel data representation, the spatio-temporal (ST) map, which collects a batch of detection results spatio-temporally, and we train a novel network, ST-TrackNet, to assign predicted track IDs to each positive detection across a sequence. With our ST map detection fed into the tracker, the correlation of objects between adjacent frames becomes prominent, which improves the performance of the tracker in the data association step. Moreover, the long-term trajectory in a sequence also helps to refine the detection results. We train and evaluate our network on the KITTI dataset, a CARLA simulation dataset, and a dataset recorded in a factory environment. Our approach generally achieves superior performance over the state-of-the-art. © 2022 IEEE.
| Original language | English |
|---|---|
| Pages (from-to) | 284-295 |
| Journal | IEEE Transactions on Automation Science and Engineering |
| Volume | 21 |
| Issue number | 1 |
| Online published | 31 Oct 2022 |
| DOIs | |
| Publication status | Published - Jan 2024 |
| Externally published | Yes |
Funding
This article was recommended for publication by Associate Editor Y. Pan and Editor C. Seatzu upon evaluation of the reviewers’ comments. This work was supported in part by the Guangdong Basic and Applied Basic Research Foundation under Project 2021B1515120032 and in part by the Zhongshan Science and Technology Bureau Fund under Project 2020AG002. The work of Ming Liu was supported by the Project of Hetao Shenzhen-Hong Kong Science and Technology Innovation Cooperation Zone under Grant HZQBKCZYB-2020083. (
Research Keywords
- Autonomous driving
- deep learning
- Detectors
- Feature extraction
- Long short term memory
- multi-object tracking
- Object detection
- Point cloud compression
- spatio-temporal map
- Tracking
- Trajectory