RL4CO: An Extensive Reinforcement Learning for Combinatorial Optimization Benchmark

33 authors, including, Federico Berto (Co-first Author), Chuanbo Hua (Co-first Author), Junyoung Park (Co-first Author), Laurin Luttmann (Co-first Author), Fei Liu, Qingfu Zhang

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

1 Downloads (CityUHK Scholars)

Abstract

Combinatorial optimization (CO) is fundamental to several real-world applications, from logistics and scheduling to hardware design and resource allocation. Deep reinforcement learning (RL) has recently shown significant benefits in solving CO problems, reducing reliance on domain expertise and improving computational efficiency. However, the absence of a unified benchmarking framework leads to inconsistent evaluations, limits reproducibility, and increases engineering overhead, raising barriers to adoption for new researchers. To address these challenges, we introduce RL4CO, a unified and extensive benchmark with in-depth library coverage of 27 CO problem environments and 23 state-of-the-art baselines. Built on efficient software libraries and best practices in implementation, RL4CO features modularized implementation and flexible configurations of diverse environments, policy architectures, RL algorithms, and utilities with extensive documentation. RL4CO helps researchers build on existing successes while exploring and developing their own designs, facilitating the entire research process by decoupling science from heavy engineering. We finally provide extensive benchmark studies to inspire new insights and future work. RL4CO has already attracted numerous researchers in the community and is open-sourced at https://github.com/ai4co/rl4co. © 2025 Association for Computing Machinery. All rights reserved.
Original languageEnglish
Title of host publicationKDD '25 - Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining
Place of PublicationNew York, NY
PublisherAssociation for Computing Machinery
Pages5278-5289
Number of pages12
Volume2
ISBN (Print)9798400714542
DOIs
Publication statusPublished - Aug 2025
Event31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2025) - Toronto, Canada
Duration: 3 Aug 20257 Aug 2025
https://kdd2025.kdd.org/
https://dl.acm.org/conference/kdd/proceedings

Publication series

NameProceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
ISSN (Print)2154-817X

Conference

Conference31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2025)
PlaceCanada
CityToronto
Period3/08/257/08/25
Internet address

Funding

This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (No. RS-2024-00410082), the Institute of Information & Communications Technology Planning & Evaluation (IITP)-Innovative Human Resource Development for Local Intellectualization program grant funded by the Korea government(MSIT)(IITP-2025-RS-2024- 00436765), and the National Research Foundation, Singapore under its AI Singapore Programme (AISG Award No: AISG3-RP-2022-031). Minsu Kim was supported by KAIST Jang Yeong Sil Fellowship and the Canadian AI Safety Institute Research Program at CIFAR through a Catalyst award. Nayeli Gast Zepeda and Andr Hottung were supported by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) - No. 521243122.

Research Keywords

  • benchmark
  • combinatorial optimization
  • neural combinatorial optimization
  • open research community
  • reinforcement learning

Publisher's Copyright Statement

  • This full text is made available under CC-BY 4.0. https://creativecommons.org/licenses/by/4.0/

Fingerprint

Dive into the research topics of 'RL4CO: An Extensive Reinforcement Learning for Combinatorial Optimization Benchmark'. Together they form a unique fingerprint.

Cite this