Genie in the Model: Automatic Generation of Human-in-the-Loop Deep Neural Networks for Mobile Applications

Yanfei WANG, Zhiwen YU*, Sicong LIU, Zimu ZHOU, Bin GUO

*Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

6 Citations (Scopus)
67 Downloads (CityUHK Scholars)

Abstract

Advances in deep neural networks (DNNs) have fostered a wide spectrum of intelligent mobile applications ranging from voice assistants on smartphones to augmented reality with smart-glasses. To deliver high-quality services, these DNNs should operate on resource-constrained mobile platforms and yield consistent performance in open environments. However, DNNs are notoriously resource-intensive, and often suffer from performance degradation in real-world deployments. Existing research strives to optimize the resource-performance trade-off of DNNs by compressing the model without notably compromising its inference accuracy. Accordingly, the accuracy of these compressed DNNs is bounded by the original ones, leading to more severe accuracy drop in challenging yet common scenarios such as low-resolution, small-size, and motion-blur. In this paper, we propose to push forward the frontiers of the DNN performance-resource trade-off by introducing human intelligence as a new design dimension. To this end, we explore human-in-the-loop DNNs (H-DNNs) and their automatic performance-resource optimization. We present H-Gen, an automatic H-DNN compression framework that incorporates human participation as a new hyperparameter for accurate and efficient DNN generation. It involves novel hyperparameter formulation, metric calculation, and search strategy in the context of automatic H-DNN generation. We also propose human participation mechanisms for three common DNN architectures to showcase the feasibility of H-Gen. Extensive experiments on twelve categories of challenging samples with three common DNN structures demonstrate the superiority of H-Gen in terms of the overall trade-off between performance (accuracy, latency), and resource (storage, energy, human labour). © 2023 Association for Computer Machinery.
Original languageEnglish
Article number36
JournalProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies
Volume7
Issue number1
Online published28 Mar 2023
DOIs
Publication statusPublished - Mar 2023

Research Keywords

  • Human in the Loop
  • model generation
  • neural networks
  • reinforcement Learning

Publisher's Copyright Statement

  • COPYRIGHT TERMS OF DEPOSITED POSTPRINT FILE: © ACM 2023. This is the author's version of the work. It is posted here for your personal use. Not for redistribution. The definitive Version of Record was published in Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, http://dx.doi.org/10.1145/3580815.

Fingerprint

Dive into the research topics of 'Genie in the Model: Automatic Generation of Human-in-the-Loop Deep Neural Networks for Mobile Applications'. Together they form a unique fingerprint.

Cite this