TY - JOUR
T1 - A mixed-integer programming-based Q-learning approach for electric bus scheduling with multiple termini and service routes
AU - Yan, Yimo
AU - Wen, Haomin
AU - Deng, Yang
AU - Chow, Andy H.F.
AU - Wu, Qihao
AU - Kuo, Yong-Hong
PY - 2024/5
Y1 - 2024/5
N2 - Electric buses (EBs) are considered a more environmentally friendly mode of public transit. In addition to other practical challenges, including high infrastructure costs and short driving ranges, the operations of EBs are more demanding due to the necessary battery charging activities. Consequently, more sophisticated optimisation models and algorithms are required for effective operations. This paper presents an EB scheduling problem with multiple termini and service routes. Various realistic but complicated factors, such as shared facilities at multiple termini, the flexibility of plugging and unplugging chargers before an EB is fully charged, stochastic travel times, and EB breakdowns, are considered. We propose an integrated learning and mixed-integer linear programming (MILP) framework to overcome the computational difficulties when solving the problem. This framework leverages the strengths of reinforcement learning and MILP for fast computations due to its capability of learning from outcomes of state–action pairs and computational effectiveness guaranteed by the constraints governing the solution feasibility. Q-Learning and Twin Delayed Deep Deterministic Policy Gradient are adopted as our training methods. We conduct numerical experiments on artificial instances and realistic instances of a bus network in Hong Kong to assess the performance of our proposed approach. The results show that our proposed framework outperforms the benchmark optimisation approach, in terms of penalty on missed service trips, average headway, and variance of headway. The benefits of our proposed framework are more significant under a highly stochastic environment. © 2024 Elsevier Ltd.
AB - Electric buses (EBs) are considered a more environmentally friendly mode of public transit. In addition to other practical challenges, including high infrastructure costs and short driving ranges, the operations of EBs are more demanding due to the necessary battery charging activities. Consequently, more sophisticated optimisation models and algorithms are required for effective operations. This paper presents an EB scheduling problem with multiple termini and service routes. Various realistic but complicated factors, such as shared facilities at multiple termini, the flexibility of plugging and unplugging chargers before an EB is fully charged, stochastic travel times, and EB breakdowns, are considered. We propose an integrated learning and mixed-integer linear programming (MILP) framework to overcome the computational difficulties when solving the problem. This framework leverages the strengths of reinforcement learning and MILP for fast computations due to its capability of learning from outcomes of state–action pairs and computational effectiveness guaranteed by the constraints governing the solution feasibility. Q-Learning and Twin Delayed Deep Deterministic Policy Gradient are adopted as our training methods. We conduct numerical experiments on artificial instances and realistic instances of a bus network in Hong Kong to assess the performance of our proposed approach. The results show that our proposed framework outperforms the benchmark optimisation approach, in terms of penalty on missed service trips, average headway, and variance of headway. The benefits of our proposed framework are more significant under a highly stochastic environment. © 2024 Elsevier Ltd.
KW - Electric buses
KW - Mixed-integer linear programming
KW - Public transport
KW - Q-learning
UR - http://www.scopus.com/inward/record.url?scp=85189754065&partnerID=8YFLogxK
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-85189754065&origin=recordpage
U2 - 10.1016/j.trc.2024.104570
DO - 10.1016/j.trc.2024.104570
M3 - RGC 21 - Publication in refereed journal
SN - 0968-090X
VL - 162
JO - Transportation Research Part C: Emerging Technologies
JF - Transportation Research Part C: Emerging Technologies
M1 - 104570
ER -