Skip to main navigation Skip to search Skip to main content

On-Device Deep Multi-Task Inference via Multi-Task Zipping

  • Xiaoxi He
  • , Xu Wang
  • , Zimu Zhou*
  • , Jiahang Wu
  • , Zheng Yang
  • , Lothar Thiele
  • *Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

Abstract

Future mobile devices are anticipated to perceive, understand and react to the world on their own by running multiple correlated deep neural networks locally on-device. Yet the complexity of deep models needs to be trimmed down to fit in mobile storage and memory. Previous studies squeeze the redundancy within a single model. In this work, we reduce the redundancy across multiple models. We propose Multi-Task Zipping (MTZ), a framework to merge correlated, pre-trained deep neural networks for cross-model compression. Central in MTZ is a layer-wise neuron sharing and incoming weight updating scheme that induces a minimal change in the error function. MTZ inherits information from each model and demands light retraining to re-boost the accuracy of individual tasks. MTZ supports typical network layers and applies to inference tasks with different input domains. Evaluations show that MTZ can fully merge the hidden layers of two VGG-16 network. Moreover, MTZ can effectively merge nine residual networks for diverse inference tasks and models for different input domains. With the joint model merged by MTZ, the latency to switch between these tasks on memory-constrained devices is reduced by 8.71. © 2021 IEEE
Original languageEnglish
Pages (from-to)2878-2891
JournalIEEE Transactions on Mobile Computing
Volume22
Issue number5
Online published2 Nov 2021
DOIs
Publication statusPublished - 1 May 2023
Externally publishedYes

Research Keywords

  • Biological neural networks
  • Deep learning
  • Deep Neural Networks
  • Mobile computing
  • Model Compression
  • Multi-Task Learning
  • Neurons
  • Redundancy
  • Task analysis
  • Training

Fingerprint

Dive into the research topics of 'On-Device Deep Multi-Task Inference via Multi-Task Zipping'. Together they form a unique fingerprint.

Cite this