TY - JOUR
T1 - A cluster-oriented task assignment optimization for green high-performance computing center operations
AU - Li, Jin
AU - Ma, Shuyi
AU - Xie, Min
PY - 2025/5
Y1 - 2025/5
N2 - The escalating demand for high-performance computing (HPC) centers necessitates efficient energy management. Prior research predominantly concentrated on individual servers, neglecting the workload interactions in resource sharing and task scaling environments. This study bridges this gap by developing an energy-aware task assignment model that integrates rack selection, task scaling, and resource allocation. Leveraging real-world data, we adopt a data-driven approach to manage the workload of a rack with multiple servers. We first identify pivotal workload characteristics affecting power consumption and investigate trade-offs between computing and luqid-cooling systems. We then propose a deep learning model to capture latent workload interactions further. We also design an adaptive large neighborhood search-based algorithm for task assignment optimizations. Simulations validate the performance of our models and highlight the impact of task queues on energy conservation. This study provides a holistic framework and managerial implications for energy efficiency management in HPC centers, integrating interpretive, predictive, and prescriptive analytics. © 2025 Elsevier Ltd.
AB - The escalating demand for high-performance computing (HPC) centers necessitates efficient energy management. Prior research predominantly concentrated on individual servers, neglecting the workload interactions in resource sharing and task scaling environments. This study bridges this gap by developing an energy-aware task assignment model that integrates rack selection, task scaling, and resource allocation. Leveraging real-world data, we adopt a data-driven approach to manage the workload of a rack with multiple servers. We first identify pivotal workload characteristics affecting power consumption and investigate trade-offs between computing and luqid-cooling systems. We then propose a deep learning model to capture latent workload interactions further. We also design an adaptive large neighborhood search-based algorithm for task assignment optimizations. Simulations validate the performance of our models and highlight the impact of task queues on energy conservation. This study provides a holistic framework and managerial implications for energy efficiency management in HPC centers, integrating interpretive, predictive, and prescriptive analytics. © 2025 Elsevier Ltd.
KW - Cloud workload characteristic
KW - Power estimation
KW - Energy-aware task assignment problem
KW - Black-box optimization
KW - Adaptive large neighborhood search algorithm
UR - http://www.scopus.com/inward/record.url?scp=85218344476&partnerID=8YFLogxK
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-85218344476&origin=recordpage
U2 - 10.1016/j.cie.2025.110929
DO - 10.1016/j.cie.2025.110929
M3 - RGC 21 - Publication in refereed journal
SN - 0360-8352
VL - 203
JO - Computers & Industrial Engineering
JF - Computers & Industrial Engineering
M1 - 110929
ER -