Skip to main navigation Skip to search Skip to main content

Optimal and Approximate Parallelism-based Computation Offloading Algorithms for Real-Time Multimodal Learning at the Edge

Quan Chen, Ming Yi, Jing Li, Ning Li, Hong Gao, Zhipeng Cai

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

Abstract

Multimodal learning has been introduced as a popular learning paradigm that can integrate inputs from multimodal video data. To accelerate video analytics at the edge, video frames are usually scalarized and compressed into various resolutions to offload to the edge server to achieve a balance between accuracy and latency. In this paper, we investigate the problem of the Joint Schedule of Offloading decision and Resolution selection (JSOR) for real-time multimodal learning at the edge. Firstly, the parallelism between the computation and communication between the edge device and server is identified and modeled. Then, the problem of JSOR to maximize the accuracy while minimizing energy consumption under the latency constraints, is formulated and proved to be NP-hard. To the best of our knowledge, this is the first work that takes the parallelism during the offloading process into account for the JSOR problem. An optimal algorithm based on dynamic programming is proposed with a decision graph, which is constructed to integrate the offloading decision and resolution selection together with the processing latency. To further reduce the time complexity, several pruning strategies and an approximate algorithm are also proposed. Additionally, to maximize the long-term average utility, an adaptive online algorithm based on Lyapunov optimization and reinforcement learning is also proposed. Finally, through extensive simulations and real implementations on the NVIDIA Jetson AGX Orin platform, we demonstrated the effectiveness of the proposed algorithms in terms of accuracy and energy consumption.
Original languageEnglish
Title of host publicationIEEE INFOCOM 2025 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS
PublisherIEEE
Number of pages10
ISBN (Electronic)979-8-3315-4305-1
ISBN (Print)979-8-3315-4306-8
DOIs
Publication statusPublished - 2025
EventIEEE International Conference on Computer Communications 2025 (IEEE INFOCOM 2025) - Park Plaza Westminster Bridge, London, United Kingdom
Duration: 19 May 202522 May 2025
https://infocom2025.ieee-infocom.org/

Publication series

Name
ISSN (Print)0743-166X
ISSN (Electronic)2641-9874

Conference

ConferenceIEEE International Conference on Computer Communications 2025 (IEEE INFOCOM 2025)
Abbreviated titleIEEE INFOCOM 2025
PlaceUnited Kingdom
CityLondon
Period19/05/2522/05/25
Internet address

Funding

This work was supported by the NSFC under Grant No. U22A2025, U20A6003, 62372118, Hong Kong Areas of Excellence Scheme (AoE/E-601/22-R), the International Science and Technology Cooperation Project in Huangpu District (No. 2022GH08), the Guangdong Basic and Applied Basic Research Foundation (No. 2024A1515030136), the Guangzhou Science and Technology Plan under Grant 2023A04J1701, and the Guangdong Provincial Key Laboratory of Cyber-Physical System under Grant 2020B1212060069.

RGC Funding Information

  • RGC-funded

Fingerprint

Dive into the research topics of 'Optimal and Approximate Parallelism-based Computation Offloading Algorithms for Real-Time Multimodal Learning at the Edge'. Together they form a unique fingerprint.

Cite this