Deep reinforcement learning based train door adaptive control in metro tunnel evacuation optimization

Yixin Shen, Jian Ma*, Hongqiang Fang, S. M. Lo, Congling Shi*

*Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

13 Citations (Scopus)

Abstract

Passenger evacuation in a metro tunnel is affected by distinct factors in the environment; thus, it is a huge challenge for both designers and managers throughout the metro system's life cycle. A lateral evacuation platform is usually designed in a modern metro tunnel, and its evacuation efficiency is directly affected by train door opening strategy. By comparing the strategies of opening one, two, three, and four train doors, it is found that opening more doors improves evacuation efficiency. However, when two or more doors are accessed, this can lead to congestion on the evacuation platform near the train door. To reduce the congestion and improve evacuation efficiency, an adaptive train door control policy is proposed in this study, which features an adaptive dynamic programming method, namely, deep q network (DQN). The information of the environment and the state of the train doors are selected as the input and output of the control policy. By simulating scenarios under random, sequence, and DQN policies, the evacuation efficiencies are detailed. Results show that the proposed adaptive control method effectively improves evacuation efficiency. This method helps overcome the disadvantages associated with the fixed train door opening strategy and provides an optimal train door control policy during tunnel evacuation.
Original languageEnglish
Article number104636
JournalTunnelling and Underground Space Technology
Volume128
Online published1 Aug 2022
DOIs
Publication statusPublished - Oct 2022

Funding

The research was fully supported by a grant from the Research Grants Council of Hong Kong Special Administrative Region, China (Project No. [T32-101/15-R]), the National Natural Science Foundation of China (No. 71871189), the Science and Technology Development Funds of Sichuan Province (No. 2020YFS0291), and the Open Research Fund of SKLFS (No. HZ2019-KF14).

Research Keywords

  • Adaptive control
  • DQN
  • Pedestrian dynamics
  • Reinforcement learning
  • Tunnel evacuation

Fingerprint

Dive into the research topics of 'Deep reinforcement learning based train door adaptive control in metro tunnel evacuation optimization'. Together they form a unique fingerprint.

Cite this