Skip to main navigation Skip to search Skip to main content

Efficient AGV Scheduling in Warehouses via Hierarchical Transformer Reinforcement Learning

  • Bingyi Liu
  • , Weizhen Han
  • , Enshu Wang*
  • , Keqin Zhong
  • , Libing Wu*
  • , Jianping Wang
  • , Chunming Qiao
  • *Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

Abstract

In automated warehouses, efficient management and economic benefits hinge on the effective scheduling of automated guided vehicles (AGVs) to transport diverse packets. Emerging technologies such as artificial intelligence and automation control have greatly contributed to the development of packet transport schemes for AGVs. However, the development of the logistics industry results in a massive amount of packets with diverse deadlines, which brings new challenges for the AGV scheduling system. To address this, this paper treats each AGV as an agent and designs a novel hierarchical transformer reinforcement learning (HTRL) framework to generate efficient AGV scheduling policies. Specifically, this framework consists of one encoder and two decoders to produce the packet selection and path improvement actions. These two decoders are equipped with masked self-attention mechanisms to learn efficient packet selection and path improvement policies, facilitating AGV transport efficiency to meet the deadlines of packets. Moreover, we consider the kinetic features of AGVs and design a model predictive control (MPC)-based speed control method for AGVs to prevent frequent stop-and-wait of AGVs and enhance their transport efficiency. We build up a simulated warehouse environment containing packets with different deadlines and conduct extensive experiments. Experimental results validate that the proposed HTRL framework increases the delivered packets within expiration by up to 36.6% compared to other baselines. © 1983-2012 IEEE.
Original languageEnglish
Pages (from-to)3426-3439
Number of pages14
JournalIEEE Journal on Selected Areas in Communications
Volume43
Issue number10
Online published2 Jun 2025
DOIs
Publication statusPublished - Oct 2025

Funding

This work was supported in part by the National Natural Science Foundation of China under Grant 62272357, Grant 62441237, Grant U24A20336, and Grant 62302326; in part by the Key Research and Development Program of Hubei under Grant 2022BAA052; in part by Hong Kong Research Grant Council under Collaborative Research Fund under Grant C1042-23GF; in part by U.S. National Science Foundation under Grant CNS-2413876; in part by Wuhan Science and Technology Joint Project for Building a Strong Transportation Country under Grant 2024-2-7 and Grant 2023-2-7; in part byWuhan Science and Technology Project for Key Research and Development under Grant 2024050702030090; and in part by the Knowledge Innovation Program of Wuhan-Shuguang Project under Grant 2023010201020323.

Research Keywords

  • Automated guided vehicles
  • automated warehouses
  • multi-agent reinforcement learning

RGC Funding Information

  • RGC-funded

Fingerprint

Dive into the research topics of 'Efficient AGV Scheduling in Warehouses via Hierarchical Transformer Reinforcement Learning'. Together they form a unique fingerprint.

Cite this