Skip to main navigation Skip to search Skip to main content

AdaFlowLite: Scalable and Non-Blocking Inference on Asynchronous Mobile Data

Sicong Liu, Fengmin Wu, Yuan Gao, Bin Guo*, Zimu Zhou, Hongkai Wen, Zhiwen Yu

*Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

3 Downloads (CityUHK Scholars)

Abstract

The rise of mobile devices equipped with numerous sensors, such as LiDAR and cameras, has driven the adoption of multi-modal deep intelligence for distributed sensing tasks, such as smart cabins and driving assistance. However, the arrival time of mobile sensory data vary due to modality size and network dynamics, which can lead to delays (if waiting for slow data) or accuracy decline (if inference proceeds without waiting). Moreover, the diversity and dynamic nature of mobile systems exacerbate this challenge. In response, we present a shift to opportunistic inference for asynchronous distributed multi-modal data, enabling inference as soon as partial data arrives. While existing methods focus on optimizing modality consistency and complementarity, known as modal affinity, they lack a computational approach to control this affinity in open-world mobile environments. AdaFlowLite pioneers the formulation of structured cross-modality affinity in mobile contexts using a hierarchical analysis-based normalized matrix. This approach accommodates the diversity and dynamics of modalities, generalizing across different types and numbers of inputs. Employing an multi-modal lightweight Swin Transformer (MMLST), AdaFlowLite facilitates real-time and flexible data imputation, adapting to various modalities and downstream tasks without retraining. Experiments show that AdaFlowLite significantly reduces inference latency by up to 80.4% and enhances accuracy by up to 62.1%, while achieving nearly a 50% reduction in energy consumption, outperforming status quo approaches. Also, this method can enhance LLM performance to preprocess asynchronous data. © 2025 IEEE.
Original languageEnglish
Pages (from-to)12206-12220
Number of pages15
JournalIEEE Transactions on Mobile Computing
Volume24
Issue number11
Online published24 Jun 2025
DOIs
Publication statusPublished - Nov 2025

Funding

This work was partially supported in part by the National Science Fund for Distinguished Young Scholars under Grant 62025205, and in part by the National Natural Science Foundation of China under Grant 62032020, Grant 62102317, and Grant 62472354.

Research Keywords

  • Sensors
  • Accuracy
  • Laser radar
  • Vehicle dynamics
  • Sensor systems
  • Delays
  • Cameras
  • Three-dimensional displays
  • Mobile handsets
  • Point cloud compression
  • Mobile context
  • multi-modality inference
  • modality affinity control

Publisher's Copyright Statement

  • COPYRIGHT TERMS OF DEPOSITED POSTPRINT FILE: © 2025 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. Liu, S., Wu, F., Gao, Y., Guo, B., Zhou, Z., Wen, H., & Yu, Z. (2025). AdaFlowLite: Scalable and Non-Blocking Inference on Asynchronous Mobile Data. IEEE Transactions on Mobile Computing, 24(11), 12206-12220. https://doi.org/10.1109/TMC.2025.3582060

Fingerprint

Dive into the research topics of 'AdaFlowLite: Scalable and Non-Blocking Inference on Asynchronous Mobile Data'. Together they form a unique fingerprint.

Cite this