Adaptive Metro Service Schedule and Train Composition With a Proximal Policy Optimization Approach Based on Deep Reinforcement Learning
Research output: Journal Publications and Reviews › RGC 21 - Publication in refereed journal › peer-review
Author(s)
Related Research Unit(s)
Detail(s)
Original language | English |
---|---|
Pages (from-to) | 6895-6906 |
Journal / Publication | IEEE Transactions on Intelligent Transportation Systems |
Volume | 23 |
Issue number | 7 |
Online published | 11 Mar 2021 |
Publication status | Published - Jul 2022 |
Link(s)
Abstract
This paper presents an integrated metro service scheduling and train unit deployment with a proximal policy optimization approach based on the deep reinforcement learning framework. The optimization problem is formulated as a Markov decision process (MDP) subject to a set of operational constraints. To address the computational complexity, the value function and control policy are parameterized by artificial neural networks (ANNs) with which the operational constraints are incorporated through a devised mask scheme. A proximal policy optimization (PPO) approach is developed for training the ANNs via successive transition simulations. The optimization framework is implemented and tested on a real-world scenario configured with the Victoria Line of London Underground, UK. The results show that the performance of proposed methodology outperforms a set of selected evolutionary heuristics in terms of both solution quality and computational efficiency. Results illustrate the advantages of having flexible train composition in saving operational costs and reducing service irregularities. This study contributes to real time metro operations with limited resources and state-of-art optimization techniques.
Research Area(s)
- deep reinforcement learning, Dynamic scheduling, Markov decision process, Metro service scheduling, Processor scheduling, proximal policy optimization., Reinforcement learning, Schedules, Scheduling, train composition, Training, Urban areas
Citation Format(s)
Adaptive Metro Service Schedule and Train Composition With a Proximal Policy Optimization Approach Based on Deep Reinforcement Learning. / Ying, Cheng-Shuo; Chow, Andy H. F.; Wang, Yi-Hui et al.
In: IEEE Transactions on Intelligent Transportation Systems, Vol. 23, No. 7, 07.2022, p. 6895-6906.
In: IEEE Transactions on Intelligent Transportation Systems, Vol. 23, No. 7, 07.2022, p. 6895-6906.
Research output: Journal Publications and Reviews › RGC 21 - Publication in refereed journal › peer-review