Pruning-Aware Merging for Efficient Multitask Inference

Xiaoxi He, Dawei Gao, Zimu Zhou*, Yongxin Tong, Lothar Thiele

*Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

10 Citations (Scopus)

Abstract

Many mobile applications demand selective execution of multiple correlated deep learning inference tasks on resource-constrained platforms. Given a set of deep neural networks, each pre-trained for a single task, it is desired that executing arbitrary combinations of tasks yields minimal computation cost. Pruning each network separately yields suboptimal computation cost due to task relatedness. A promising remedy is to merge the networks into a multitask network to eliminate redundancy across tasks before network pruning. However, pruning a multitask network combined by existing network merging schemes cannot minimise the computation cost of every task combination because they do not consider such a future pruning. To this end, we theoretically identify the conditions such that pruning a multitask network minimises the computation of all task combinations. On this basis, we propose Pruning-Aware Merging (PAM), a heuristic network merging scheme to construct a multitask network that approximates these conditions. The merged network is then ready to be further pruned by existing network pruning methods. Evaluations with different pruning schemes, datasets, and network architectures show that PAM achieves up to 4.87x less computation against the baseline without network merging, and up to 2.01x less computation against the baseline with a state-of-the-art network merging scheme. © 2021 ACM.
Original languageEnglish
Title of host publicationKDD 2021 - Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
PublisherAssociation for Computing Machinery
Pages585-595
ISBN (Print)9781450383325
DOIs
Publication statusPublished - Aug 2021
Externally publishedYes
Event27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2021) - Virtual, Singapore
Duration: 14 Aug 202118 Aug 2021
https://kdd.org/kdd2021/

Publication series

NameProceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

Conference

Conference27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2021)
PlaceSingapore
Period14/08/2118/08/21
Internet address

Research Keywords

  • deep learning
  • multitask inference
  • network pruning

Fingerprint

Dive into the research topics of 'Pruning-Aware Merging for Efficient Multitask Inference'. Together they form a unique fingerprint.

Cite this