Skip to main navigation Skip to search Skip to main content

Multi-agent deep reinforcement learning for adaptive coordinated metro service operations with flexible train composition

Cheng-shuo Ying, Andy H.F. Chow*, Hoa T.M. Nguyen, Kwai-Sang Chin

*Corresponding author for this work

    Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

    Abstract

    This paper presents an adaptive control system for coordinated metro operations with flexible train composition by using a multi-agent deep reinforcement learning (MADRL) approach. The control problem is formulated as a Markov decision process (MDP) with multiple agents regulating different service lines in a metro network with passenger transfer. To ensure the overall computational effectiveness and stability of the control system, we adopt an actor–critic reinforcement learning framework in which each control agent is associated with a critic function for estimating future system states and an actor function deriving local operational decisions. The critics and actors in the MADRL are represented by multi-layer artificial neural networks (ANNs). A multi-agent deep deterministic policy gradient (MADDPG) algorithm is developed for training the actor and critic ANNs through successive simulated transitions over the entire metro network. The developed framework is tested with a real-world scenario in Bakerloo and Victoria Lines of London Underground, UK. Experiment results demonstrate that the proposed method can outperform previous centralized optimization and distributed control approaches in terms of solution quality and performance achieved. Further analysis shows the merits of MADRL for coordinated service regulation with flexible train composition. This study contributes to real-time coordinated metro network services with flexible train composition and advanced optimization techniques.
    Original languageEnglish
    Pages (from-to)36-59
    JournalTransportation Research Part B: Methodological
    Volume161
    Online published17 May 2022
    DOIs
    Publication statusPublished - Jul 2022

    Funding

    The authors would like to thank the anonymous reviewers for the constructive comments. This study was supported by a Hong Kong Research Grant Council Theme-based Research Scheme (Ref: T32-101/15-R), and the National Natural Science Foundation of China (Refs: 71971182; 71801158). The third author was supported by a Hong Kong PhD Fellowship Scheme. The project was also supported partially by a Hong Kong Innovation and Technology Fund (Ref: PRP/062/20FX).

    Research Keywords

    • Metro service coordination
    • Flexible train composition
    • Multi-agent deep reinforcement learning
    • Actor–critic architecture
    • Markov decision process

    RGC Funding Information

    • RGC-funded

    Fingerprint

    Dive into the research topics of 'Multi-agent deep reinforcement learning for adaptive coordinated metro service operations with flexible train composition'. Together they form a unique fingerprint.

    Cite this