Overcoming Memory Constraint for Improved Target Classification Performance on Embedded Deep Learning Systems

Fan Wu, Huanghe Liu, Zongwei Zhu*, Cheng Ji, Chun Jason Xue

*Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

Abstract

Pattern recognition applications such as face recognition, detection of broken eggs, and classification of agricultural products are all using image classification in deep neural networks to improve the quality of services. However, traditional cloud inference models suffer from several problems such as network delay fluctuations and privacy leakage. In this regard, most real-Time applications currently need to be deployed on edge computing devices. Constrained by the computing power and memory limitations of edge devices, the use of an efficient memory manager for model reasoning is the key to improving the quality of service. This study firstly explored the incremental loading strategy of model weights for the model reasoning. Next, the memory space at runtime is optimized through data layout reorganization from the spatial dimension. In particular, our proposed schemes are orthogonal and transparent to the model. Experimental results demonstrate that the proposed approach reduced the memory consumption by 43.74% on average without additional reasoning time overhead.
Original languageEnglish
Title of host publicationProceedings - 2020 IEEE 22nd International Conference on High Performance Computing and Communications; IEEE 18th International Conference on Smart City; IEEE 6th International Conference on Data Science and Systems, HPCC-SmartCity-DSS 2020
PublisherIEEE
Pages634-639
ISBN (Print)9781728176499
DOIs
Publication statusPublished - Dec 2020
Event22nd IEEE International Conference on High Performance Computing and Communications, 18th IEEE International Conference on Smart City and 6th IEEE International Conference on Data Science and Systems (HPCC-SmartCity-DSS 2020) - Virtual, Fiji
Duration: 14 Dec 202016 Dec 2020

Publication series

NameProceedings - IEEE International Conference on High Performance Computing and Communications, IEEE International Conference on Smart City and IEEE International Conference on Data Science and Systems, HPCC-SmartCity-DSS

Conference

Conference22nd IEEE International Conference on High Performance Computing and Communications, 18th IEEE International Conference on Smart City and 6th IEEE International Conference on Data Science and Systems (HPCC-SmartCity-DSS 2020)
Country/TerritoryFiji
Period14/12/2016/12/20

Bibliographical note

Full text of this publication does not contain sufficient affiliation information. With consent from the author(s) concerned, the Research Unit(s) information for this record is based on the existing academic department affiliation of the author(s).

Research Keywords

  • Deep learning reasoning
  • Edge computing
  • Memory management

Fingerprint

Dive into the research topics of 'Overcoming Memory Constraint for Improved Target Classification Performance on Embedded Deep Learning Systems'. Together they form a unique fingerprint.

Cite this