TY - GEN
T1 - Solving Time-Dependent Traveling Salesman Problem with Time Windows with Deep Reinforcement Learning
AU - Wu, Guojin
AU - Zhang, Zizhen
AU - Liu, Hong
AU - Wang, Jiahai
PY - 2021/10
Y1 - 2021/10
N2 - Traveling Salesman Problem (TSP) is a well-known NP-hard combinatorial optimization problem. Recently, many researchers have used deep reinforcement learning to solve it. However, traffic factors are rarely considered in their works, in which the traveling time between customer locations is assumed to be constant over the planning horizon. For many practical scenarios, the traffic conditions between customer locations may change over time due to the impact of traffic patterns. Thus, this paper considers a Time-Dependent Traveling Salesman Problem with Time Windows (TDTSPTW), where the time dependency is obtained by fitting the collected traffic data into real-time traffic function with the interpolation method. We propose a deep reinforcement learning framework to solve TDTSPTW. Extensive experiments on TDTSPTW instances indicate that the proposed method can capture the real-time traffic changes and yield high-quality solutions within a very short time, compared with other typical baseline algorithms.
AB - Traveling Salesman Problem (TSP) is a well-known NP-hard combinatorial optimization problem. Recently, many researchers have used deep reinforcement learning to solve it. However, traffic factors are rarely considered in their works, in which the traveling time between customer locations is assumed to be constant over the planning horizon. For many practical scenarios, the traffic conditions between customer locations may change over time due to the impact of traffic patterns. Thus, this paper considers a Time-Dependent Traveling Salesman Problem with Time Windows (TDTSPTW), where the time dependency is obtained by fitting the collected traffic data into real-time traffic function with the interpolation method. We propose a deep reinforcement learning framework to solve TDTSPTW. Extensive experiments on TDTSPTW instances indicate that the proposed method can capture the real-time traffic changes and yield high-quality solutions within a very short time, compared with other typical baseline algorithms.
UR - https://www.scopus.com/pages/publications/85124277202
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-85124277202&origin=recordpage
U2 - 10.1109/SMC52423.2021.9658956
DO - 10.1109/SMC52423.2021.9658956
M3 - RGC 32 - Refereed conference paper (with host publication)
SN - 978-1-6654-4208-4
T3 - Conference Proceedings - IEEE International Conference on Systems, Man and Cybernetics
SP - 558
EP - 563
BT - 2021 IEEE International Conference on Systems, Man, and Cybernetics (SMC)
PB - IEEE
T2 - 2021 IEEE International Conference on Systems, Man, and Cybernetics (SMC 2021)
Y2 - 17 October 2021 through 20 October 2021
ER -