TY - JOUR
T1 - Centralized Deep Reinforcement Learning Method for Dynamic Multi-Vehicle Pickup and Delivery Problem with Crowdshippers
AU - Xiang, Chuankai
AU - Wu, Zhibin
AU - Tu, Jiancheng
AU - Huang, Jun
PY - 2024/8
Y1 - 2024/8
N2 - Crowdshipping problem can be challenging as the platform are continuously but sporadically receiving crowdshippers and delivery tasks with heterogeneous origin and destination. In this paper, the dynamic multi-vehicle pickup and delivery problem with crowdshippers (DMV-PDPC) is considered. Leveraging the deep reinforcement learning framework, the attention model with centralized vehicle network (AMCVN) method is developed. Unlike traditional heuristic or existing vehicle-changing methods, AMCVN integrates a centralized vehicle network (CVN) that can observe the state information of all vehicles, enhancing its overall performance. In each decision-making step, the CVN monitors the state of the vehicles and selects one of the vehicles. Subsequently, the attention-based route generating network (RGN) determines the next node to be visited by the chosen vehicle. Instead of using a penalty term in the reward function to regulate the sequence of visits to pickup and delivery nodes, a more precise control method, namely the rolling mask scheme (RMS), is implemented. The method's evaluation is carried out via a simulation experiment using a real-world road network. This evaluation demonstrates that the proposed method effectively tackles the DMV-PDPC challenge, outperforming current state-of-The-Art learning-based models and heuristic methods. Moreover, the method shows exceptional generalization capabilities, as evidenced by its adaptability to different numbers of tasks and vehicles. © 2024 IEEE..
AB - Crowdshipping problem can be challenging as the platform are continuously but sporadically receiving crowdshippers and delivery tasks with heterogeneous origin and destination. In this paper, the dynamic multi-vehicle pickup and delivery problem with crowdshippers (DMV-PDPC) is considered. Leveraging the deep reinforcement learning framework, the attention model with centralized vehicle network (AMCVN) method is developed. Unlike traditional heuristic or existing vehicle-changing methods, AMCVN integrates a centralized vehicle network (CVN) that can observe the state information of all vehicles, enhancing its overall performance. In each decision-making step, the CVN monitors the state of the vehicles and selects one of the vehicles. Subsequently, the attention-based route generating network (RGN) determines the next node to be visited by the chosen vehicle. Instead of using a penalty term in the reward function to regulate the sequence of visits to pickup and delivery nodes, a more precise control method, namely the rolling mask scheme (RMS), is implemented. The method's evaluation is carried out via a simulation experiment using a real-world road network. This evaluation demonstrates that the proposed method effectively tackles the DMV-PDPC challenge, outperforming current state-of-The-Art learning-based models and heuristic methods. Moreover, the method shows exceptional generalization capabilities, as evidenced by its adaptability to different numbers of tasks and vehicles. © 2024 IEEE..
KW - attention mechanism
KW - Crowdshipping
KW - deep reinforcement learning
KW - dynamic vehicle routing problem
UR - http://www.scopus.com/inward/record.url?scp=85184312818&partnerID=8YFLogxK
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-85184312818&origin=recordpage
U2 - 10.1109/TITS.2024.3352143
DO - 10.1109/TITS.2024.3352143
M3 - RGC 21 - Publication in refereed journal
SN - 1524-9050
VL - 25
SP - 9253
EP - 9267
JO - IEEE Transactions on Intelligent Transportation Systems
JF - IEEE Transactions on Intelligent Transportation Systems
IS - 8
ER -