A mixed-integer programming-based Q-learning approach for electric bus scheduling with multiple termini and service routes

Yimo Yan, Haomin Wen, Yang Deng, Andy H.F. Chow, Qihao Wu, Yong-Hong Kuo*

*Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

10 Citations (Scopus)

Abstract

Electric buses (EBs) are considered a more environmentally friendly mode of public transit. In addition to other practical challenges, including high infrastructure costs and short driving ranges, the operations of EBs are more demanding due to the necessary battery charging activities. Consequently, more sophisticated optimisation models and algorithms are required for effective operations. This paper presents an EB scheduling problem with multiple termini and service routes. Various realistic but complicated factors, such as shared facilities at multiple termini, the flexibility of plugging and unplugging chargers before an EB is fully charged, stochastic travel times, and EB breakdowns, are considered. We propose an integrated learning and mixed-integer linear programming (MILP) framework to overcome the computational difficulties when solving the problem. This framework leverages the strengths of reinforcement learning and MILP for fast computations due to its capability of learning from outcomes of state–action pairs and computational effectiveness guaranteed by the constraints governing the solution feasibility. Q-Learning and Twin Delayed Deep Deterministic Policy Gradient are adopted as our training methods. We conduct numerical experiments on artificial instances and realistic instances of a bus network in Hong Kong to assess the performance of our proposed approach. The results show that our proposed framework outperforms the benchmark optimisation approach, in terms of penalty on missed service trips, average headway, and variance of headway. The benefits of our proposed framework are more significant under a highly stochastic environment. © 2024 Elsevier Ltd.
Original languageEnglish
Article number104570
JournalTransportation Research Part C: Emerging Technologies
Volume162
Online published4 Apr 2024
DOIs
Publication statusPublished - May 2024

Funding

The authors are grateful to the Editor and Referees for the constructive comments and suggestions, which have greatly enhanced the quality of the work. The research of the first author is supported by HKU Presidential PhD Scholar Programme and Seed Fund for PI Research – Basic Research ( 2202100850 ). The research of the fourth author is supported by the General Research Fund of Hong Kong Research Grants Council ( 11203822 ). The research of the fifth author is supported by the HKU Foundation Postgraduate Fellowship. The research of the last author is partially supported by PROCORE - France/Hong Kong Joint Research Scheme, Research Grants Council of Hong Kong and Consulate General of France in Hong Kong ( F-HKU704/22 ), the Theme-based Research Scheme of Hong Kong Research Grants Council ( T32-707-22N ), and the 2019 Guangdong Special Support Talent Programme – Innovation and Entrepreneurship Leading Team (China) ( 2019BT02S593 ).

Research Keywords

  • Electric buses
  • Mixed-integer linear programming
  • Public transport
  • Q-learning

Fingerprint

Dive into the research topics of 'A mixed-integer programming-based Q-learning approach for electric bus scheduling with multiple termini and service routes'. Together they form a unique fingerprint.

Cite this