Abstract
Deploying compute-intensive deep neural networks (DNNs) on resource-constrained end devices has become a prominent trend, enabling localized intelligence. However, efficiently deploying these DNNs at scale poses challenges. To address this, extensive research has focused on the early exit architecture based on convolutional neural networks (CNNs), which dynamically adapt network depth to reduce inference computation. Nevertheless, the sequential execution of all internal classifiers (ICs) and subsequent termination based on an exit criterion is inefficient. Motivated by these insights, we introduce a resolution-adaptive prediction network (RAPNet) architecture. RAPNet comprises a lightweight prediction network that captures global image features and an inference network integrated with an early exit architecture. The prediction network accurately determines the optimal IC position conditioned on the input images for efficient image classification. Additionally, we incorporate resolution-adaptive inference and feature fusion mechanisms by computational reuse, to effectively mitigate image spatial redundancy and improve the accuracy of ICs. We conduct extensive experiments across various datasets and architectures to demonstrate that RAPNet achieves a significantly better accuracy vs. computational trade-off than other recently proposed early exit methods. For instance, when using MobileNet as the base network, RAPNet achieves significant accuracy improvements of 12% and 5.7% on the Tiny Imagenet and CIFAR-100 datasets respectively, surpassing other early exit methods with similar computational constraints. © 2024 IEEE.
| Original language | English |
|---|---|
| Pages (from-to) | 33492-33507 |
| Number of pages | 16 |
| Journal | IEEE Internet of Things Journal |
| Volume | 11 |
| Issue number | 20 |
| Online published | 16 Jul 2024 |
| DOIs | |
| Publication status | Published - 15 Oct 2024 |
Funding
This work was supported in part by the National Key Research and Development Program of China under Grant 2023YFB4503100, and in part by the NSFC under Grant 62072137.
Research Keywords
- Accuracy
- Computational efficiency
- Computational modeling
- Computer architecture
- Deep learning
- early exit network
- Feature extraction
- Integrated circuit modeling
- predictive early exit
- Redundancy
- resolution adaptive inference