Abstract
On-device adapting to continual, unpredictable domain shifts is essential for mobile applications like autonomous driving and augmented reality to deliver seamless user experiences in evolving environments. Test-time adaptation (TTA) emerges as a promising solution by tuning model parameters with unlabeled live data immediately before prediction. However, TTA's unique forward-backward-reforward pipeline notably increases the latency over standard inference, undermining the responsiveness in time-sensitive mobile applications. This paper presents AdaShadow, a responsive test-time adaptation framework for non-stationary mobile data distribution and resource dynamics via selective updates of adaptation-critical layers. Although the tactic is recognized in generic on-device training, TTA's unsupervised and online context presents unique challenges in estimating layer importance and latency, as well as scheduling the optimal layer update plan. AdaShadow addresses these challenges with a backpropagation-free assessor to rapidly identify critical layers, a unit-based runtime predictor to account for resource dynamics in latency estimation, and an online scheduler for prompt layer update planning. Also, AdaShadow incorporates a memory I/O-aware computation reuse scheme to further reduce latency in the reforwardpass. Results show that AdaShadow achieves the best accuracy-latency balance under continual shifts. At low memory and energy costs, Adashadow provides a 2x to 3.5x speedup (ms-level) over state-of-the-art TTA methods with comparable accuracy and a 14.8% to 25.4% accuracy boost over efficient supervised methods with similar latency. © 2024 Copyright is held by the owner/author(s).
| Original language | English |
|---|---|
| Title of host publication | SenSys '24: Proceedings of the 22nd ACM Conference on Embedded Networked Sensor Systems |
| Publisher | Association for Computing Machinery |
| Pages | 295-308 |
| ISBN (Print) | 9798400706974 |
| DOIs | |
| Publication status | Published - Nov 2024 |
| Event | 22nd ACM Conference on Embedded Networked Sensor Systems (SenSys 2024) - The Dragon Hotel Hangzhou, Hangzhou, China Duration: 4 Nov 2024 → 7 Nov 2024 https://sensys.acm.org/2024/ |
Publication series
| Name | SenSys - Proceedings of the ACM Conference on Embedded Networked Sensor Systems |
|---|
Conference
| Conference | 22nd ACM Conference on Embedded Networked Sensor Systems (SenSys 2024) |
|---|---|
| Place | China |
| City | Hangzhou |
| Period | 4/11/24 → 7/11/24 |
| Internet address |
Bibliographical note
Full text of this publication does not contain sufficient affiliation information. With consent from the author(s) concerned, the Research Unit(s) information for this record is based on the existing academic department affiliation of the author(s).Funding
This work was partially supported by the National Science Fund for Distinguished Young Scholars (62025205), the National Natural Science Foundation of China (No. 62032020, 6247074224, 62102317), and CityU APRC grant (No. 9610633).
Research Keywords
- latency-efficient test-time adaptation
- mobile environments