TY - JOUR
T1 - Joint Optimization of Model Retraining and Inference Services in DT-Assisted Edge Computing
AU - Ai, Xuan
AU - Liang, Weifa
AU - Liu, Caiyi
PY - 2025/11/18
Y1 - 2025/11/18
N2 - With the advance of emerging digital twin (DT) technology, the combination of DT with edge intelligence brings great potential for high-fidelity, delay-sensitive inference services at the network edge. Due to the training data drift over time, the accuracy of an inference model degrades dramatically. To maintain and/or enhance its accuracy, the service model requires to be continuously retrained using newly generated update data. However, model retraining and inference services compete with each other for the limited computing resource in a mobile edge computing network (MEC), which may decrease user satisfaction with services due to unbearable service delays caused by insufficient resource supplies. Therefore, to ensure high fidelity of service models by choosing models for retraining while maximizing user satisfaction, it becomes a great challenge to allocate the limited computing resource in an MEC to both model retraining and inference services. In this paper, we investigate fidelity-aware, delay-sensitive services in a DT-assisted MEC network over a given time horizon. We study a novel user satisfaction maximization problem with the aim to maximize the long-term user satisfaction on services. We first formulate an integer linear programming (ILP) solution to its offline version. We then devise an online algorithm for the problem with a bounded expected cumulative regret, by leveraging an efficient prediction mechanism and a multi-armed bandit (MAB) based resource allocation strategy. Finally, we evaluate the performance of the proposed algorithm via simulations. Simulation results demonstrate that the proposed algorithm is promising, outperforming the comparison benchmarks. © 2025 IEEE.
AB - With the advance of emerging digital twin (DT) technology, the combination of DT with edge intelligence brings great potential for high-fidelity, delay-sensitive inference services at the network edge. Due to the training data drift over time, the accuracy of an inference model degrades dramatically. To maintain and/or enhance its accuracy, the service model requires to be continuously retrained using newly generated update data. However, model retraining and inference services compete with each other for the limited computing resource in a mobile edge computing network (MEC), which may decrease user satisfaction with services due to unbearable service delays caused by insufficient resource supplies. Therefore, to ensure high fidelity of service models by choosing models for retraining while maximizing user satisfaction, it becomes a great challenge to allocate the limited computing resource in an MEC to both model retraining and inference services. In this paper, we investigate fidelity-aware, delay-sensitive services in a DT-assisted MEC network over a given time horizon. We study a novel user satisfaction maximization problem with the aim to maximize the long-term user satisfaction on services. We first formulate an integer linear programming (ILP) solution to its offline version. We then devise an online algorithm for the problem with a bounded expected cumulative regret, by leveraging an efficient prediction mechanism and a multi-armed bandit (MAB) based resource allocation strategy. Finally, we evaluate the performance of the proposed algorithm via simulations. Simulation results demonstrate that the proposed algorithm is promising, outperforming the comparison benchmarks. © 2025 IEEE.
KW - Digital twin
KW - DT-assisted mobile edge computing
KW - expected long-term regret
KW - fidelity-aware and delay-sensitive inference services
KW - joint resource allocation between model retraining and inference services
KW - multi-armed bandit optimization
KW - online algorithm
UR - https://www.scopus.com/pages/publications/105022308694
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-105022308694&origin=recordpage
U2 - 10.1109/TON.2025.3632228
DO - 10.1109/TON.2025.3632228
M3 - RGC 21 - Publication in refereed journal
SN - 2998-4157
JO - IEEE Transactions on Networking
JF - IEEE Transactions on Networking
ER -