Adaptive Metro Service Schedule and Train Composition With a Proximal Policy Optimization Approach Based on Deep Reinforcement Learning

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

37 Scopus Citations
View graph of relations

Detail(s)

Original languageEnglish
Pages (from-to)6895-6906
Journal / PublicationIEEE Transactions on Intelligent Transportation Systems
Volume23
Issue number7
Online published11 Mar 2021
Publication statusPublished - Jul 2022

Abstract

This paper presents an integrated metro service scheduling and train unit deployment with a proximal policy optimization approach based on the deep reinforcement learning framework. The optimization problem is formulated as a Markov decision process (MDP) subject to a set of operational constraints. To address the computational complexity, the value function and control policy are parameterized by artificial neural networks (ANNs) with which the operational constraints are incorporated through a devised mask scheme. A proximal policy optimization (PPO) approach is developed for training the ANNs via successive transition simulations. The optimization framework is implemented and tested on a real-world scenario configured with the Victoria Line of London Underground, UK. The results show that the performance of proposed methodology outperforms a set of selected evolutionary heuristics in terms of both solution quality and computational efficiency. Results illustrate the advantages of having flexible train composition in saving operational costs and reducing service irregularities. This study contributes to real time metro operations with limited resources and state-of-art optimization techniques.

Research Area(s)

  • deep reinforcement learning, Dynamic scheduling, Markov decision process, Metro service scheduling, Processor scheduling, proximal policy optimization., Reinforcement learning, Schedules, Scheduling, train composition, Training, Urban areas

Citation Format(s)

Adaptive Metro Service Schedule and Train Composition With a Proximal Policy Optimization Approach Based on Deep Reinforcement Learning. / Ying, Cheng-Shuo; Chow, Andy H. F.; Wang, Yi-Hui et al.
In: IEEE Transactions on Intelligent Transportation Systems, Vol. 23, No. 7, 07.2022, p. 6895-6906.

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review