Moses: Exploiting Cross-device Transferable Features for On-device Tensor Program Optimization

Zhihe Zhao, Xian Shuai, Neiwen Ling, Nan Guan, Zhenyu Yan, Guoliang Xing*

*Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

5 Citations (Scopus)

Abstract

Achieving efficient execution of machine learning models on mobile/edge devices has attracted significant attention recently. A key challenge is to generate high-performance tensor programs for each operator inside a DNN model efficiently. To this end, deep learning compilers have adopted auto-Tuning approaches such as Ansor. However, it is challenging to optimize tensor codes for mobile/edge devices by auto-Tuning due to limited time budgets and on-device resources. A key component of DNN compilers is the cost model that can predict the performance of each configuration on specific devices. However, current design of cost models cannot provide transferable features among different hardware accelerators efficiently and effectively. In this paper, we propose Moses, a simple yet efficient design based on the lottery ticket hypothesis, which fully takes advantage of the hardware-Agnostic features transferable to the target device via domain adaptation to optimize the time-consuming auto-Tuning process of DNN compiling on a new hardware platform. Compared with state-of-The-Art approaches, Moses achieves up to 1.53X efficiency gain in the search stage and 1.41X inference speedup on challenging DNN benchmarks. © 2023 ACM.
Original languageEnglish
Title of host publicationHotMobile '23: Proceedings of the 24th International Workshop on Mobile Computing Systems and Applications
PublisherAssociation for Computing Machinery
Pages22-28
ISBN (Print)9798400700170
DOIs
Publication statusPublished - Feb 2023
Event24th International Workshop on Mobile Computing Systems and Applications (HotMobile 2023) - Newport Beach, United States
Duration: 22 Feb 202323 Feb 2023

Publication series

NameHotMobile - Proceedings of the International Workshop on Mobile Computing Systems and Applications

Conference

Conference24th International Workshop on Mobile Computing Systems and Applications (HotMobile 2023)
PlaceUnited States
CityNewport Beach
Period22/02/2323/02/23

Funding

The work described in this article was supported by the Research Grants Council (RGC)-General Research Fund under Grant No. 14209619.

Research Keywords

  • DNN compiler
  • efficient DNN processing
  • transfer learning

RGC Funding Information

  • RGC-funded

Fingerprint

Dive into the research topics of 'Moses: Exploiting Cross-device Transferable Features for On-device Tensor Program Optimization'. Together they form a unique fingerprint.

Cite this