AdaShadow: Responsive Test-time Model Adaptation in Non-stationary Mobile Environments

Cheng Fang, Sicong Liu, Zimu Zhou, Bin Guo*, Jiaqi Tang, Ke Ma, Zhiwen Yu

*Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

5 Citations (Scopus)

Abstract

On-device adapting to continual, unpredictable domain shifts is essential for mobile applications like autonomous driving and augmented reality to deliver seamless user experiences in evolving environments. Test-time adaptation (TTA) emerges as a promising solution by tuning model parameters with unlabeled live data immediately before prediction. However, TTA's unique forward-backward-reforward pipeline notably increases the latency over standard inference, undermining the responsiveness in time-sensitive mobile applications. This paper presents AdaShadow, a responsive test-time adaptation framework for non-stationary mobile data distribution and resource dynamics via selective updates of adaptation-critical layers. Although the tactic is recognized in generic on-device training, TTA's unsupervised and online context presents unique challenges in estimating layer importance and latency, as well as scheduling the optimal layer update plan. AdaShadow addresses these challenges with a backpropagation-free assessor to rapidly identify critical layers, a unit-based runtime predictor to account for resource dynamics in latency estimation, and an online scheduler for prompt layer update planning. Also, AdaShadow incorporates a memory I/O-aware computation reuse scheme to further reduce latency in the reforwardpass. Results show that AdaShadow achieves the best accuracy-latency balance under continual shifts. At low memory and energy costs, Adashadow provides a 2x to 3.5x speedup (ms-level) over state-of-the-art TTA methods with comparable accuracy and a 14.8% to 25.4% accuracy boost over efficient supervised methods with similar latency. © 2024 Copyright is held by the owner/author(s).
Original languageEnglish
Title of host publicationSenSys '24: Proceedings of the 22nd ACM Conference on Embedded Networked Sensor Systems
PublisherAssociation for Computing Machinery
Pages295-308
ISBN (Print)9798400706974
DOIs
Publication statusPublished - Nov 2024
Event22nd ACM Conference on Embedded Networked Sensor Systems (SenSys 2024) - The Dragon Hotel Hangzhou, Hangzhou, China
Duration: 4 Nov 20247 Nov 2024
https://sensys.acm.org/2024/

Publication series

NameSenSys - Proceedings of the ACM Conference on Embedded Networked Sensor Systems

Conference

Conference22nd ACM Conference on Embedded Networked Sensor Systems (SenSys 2024)
PlaceChina
CityHangzhou
Period4/11/247/11/24
Internet address

Bibliographical note

Full text of this publication does not contain sufficient affiliation information. With consent from the author(s) concerned, the Research Unit(s) information for this record is based on the existing academic department affiliation of the author(s).

Funding

This work was partially supported by the National Science Fund for Distinguished Young Scholars (62025205), the National Natural Science Foundation of China (No. 62032020, 6247074224, 62102317), and CityU APRC grant (No. 9610633).

Research Keywords

  • latency-efficient test-time adaptation
  • mobile environments

Fingerprint

Dive into the research topics of 'AdaShadow: Responsive Test-time Model Adaptation in Non-stationary Mobile Environments'. Together they form a unique fingerprint.

Cite this