Deep Learning Inference on Heterogeneous Mobile Processors: Potentials and Pitfalls

Sicong Liu, Wentao Zhou, Zimu Zhou, Bin Guo*, Minfan Wang, Cheng Fang, Zheng Lin, Zhiwen Yu

*Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

4 Citations (Scopus)

Abstract

There is a growing demand to deploy computation-intensive deep learning (DL) models on resource-constrained mobile devices for real-time intelligent applications. Equipped with a variety of processing units such as CPUs, GPUs, and NPUs, the mobile devices hold potential to accelerate DL inference via parallel execution across heterogeneous processors. Various efficient parallel methods have been explored to optimize computation distribution, achieve load balance, and minimize communication cost across processors. Yet their practical effectiveness in the dynamic and diverse real-world mobile environment is less explored. This paper presents a holistic empirical study to assess the capabilities and challenges associated with parallel DL inference on heterogeneous mobile processors. Through carefully designed experiments covering various DL models, mobile software/hardware environments, workload patterns, and resource availability, we identify limitations of existing techniques and highlight opportunities for cross-level optimization. © 2024 Copyright held by the owner/author(s). Publication rights licensed to ACM.
Original languageEnglish
Title of host publicationAdaAIoTSys '24: Proceedings of the 2024 Workshop on Adaptive AIoT Systems
PublisherAssociation for Computing Machinery
Pages1-6
ISBN (Print)9798400706646
DOIs
Publication statusPublished - Jun 2024
Event2024 Workshop on Adaptive AIoT Systems (AdaAIoTSys 2024) Co-located with 22nd ACM International Conference on Mobile Systems, Applications, and Services (MobiSys 2024) - Toramon Hill Forum, Minato-ku, Japan
Duration: 3 Jun 20247 Jun 2024
https://www.sigmobile.org/mobisys/2024/wsl.html

Publication series

NameAdaAIoTSys - Proceedings of the AdaAIoTSys - Workshop on Adaptive AIoT Systems

Conference

Conference2024 Workshop on Adaptive AIoT Systems (AdaAIoTSys 2024) Co-located with 22nd ACM International Conference on Mobile Systems, Applications, and Services (MobiSys 2024)
PlaceJapan
CityMinato-ku
Period3/06/247/06/24
Internet address

Bibliographical note

Full text of this publication does not contain sufficient affiliation information. With consent from the author(s) concerned, the Research Unit(s) information for this record is based on the existing academic department affiliation of the author(s).

Funding

This work was supported by the National Science Fund for Distinguished Young Scholars (62025205), the National Natural Science Foundation of China (No. 62032020, 62102317), and CityU APRC grant (No. 9610633).

Research Keywords

  • Heterogeneous processors
  • parallel DL inference

Fingerprint

Dive into the research topics of 'Deep Learning Inference on Heterogeneous Mobile Processors: Potentials and Pitfalls'. Together they form a unique fingerprint.

Cite this