Skip to main navigation Skip to search Skip to main content

Joint Optimization of Model Retraining and Inference Services in DT-Assisted Edge Computing

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

Abstract

With the advance of emerging digital twin (DT) technology, the combination of DT with edge intelligence brings great potential for high-fidelity, delay-sensitive inference services at the network edge. Due to the training data drift over time, the accuracy of an inference model degrades dramatically. To maintain and/or enhance its accuracy, the service model requires to be continuously retrained using newly generated update data. However, model retraining and inference services compete with each other for the limited computing resource in a mobile edge computing network (MEC), which may decrease user satisfaction with services due to unbearable service delays caused by insufficient resource supplies. Therefore, to ensure high fidelity of service models by choosing models for retraining while maximizing user satisfaction, it becomes a great challenge to allocate the limited computing resource in an MEC to both model retraining and inference services. In this paper, we investigate fidelity-aware, delay-sensitive services in a DT-assisted MEC network over a given time horizon. We study a novel user satisfaction maximization problem with the aim to maximize the long-term user satisfaction on services. We first formulate an integer linear programming (ILP) solution to its offline version. We then devise an online algorithm for the problem with a bounded expected cumulative regret, by leveraging an efficient prediction mechanism and a multi-armed bandit (MAB) based resource allocation strategy. Finally, we evaluate the performance of the proposed algorithm via simulations. Simulation results demonstrate that the proposed algorithm is promising, outperforming the comparison benchmarks. © 2025 IEEE.
Original languageEnglish
Number of pages16
JournalIEEE Transactions on Networking
Online published18 Nov 2025
DOIs
Publication statusOnline published - 18 Nov 2025

Funding

The work of Weifa Liang was supported by the Research Grants Council of the Hong Kong Special Administrative Region, China, under Grant CityU 11202723, Grant CityU 11202824, Grant C1042-23GF, Grant 7005845, Grant 8730094, and Grant 9380137.

Research Keywords

  • Digital twin
  • DT-assisted mobile edge computing
  • expected long-term regret
  • fidelity-aware and delay-sensitive inference services
  • joint resource allocation between model retraining and inference services
  • multi-armed bandit optimization
  • online algorithm

RGC Funding Information

  • RGC-funded

Fingerprint

Dive into the research topics of 'Joint Optimization of Model Retraining and Inference Services in DT-Assisted Edge Computing'. Together they form a unique fingerprint.

Cite this