Skip to main navigation Skip to search Skip to main content

Integrated Sensing and Communications for Low-Altitude Economy: A Deep Reinforcement Learning Approach

  • Xiaowen Ye
  • , Yuyi Mao
  • , Xianghao Yu*
  • , Shu Sun
  • , Liqun Fu
  • , Jie Xu
  • *Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

Abstract

This paper studies an integrated sensing and communications (ISAC) system for low-altitude economy (LAE), where a ground base station (GBS) provides communication and navigation services for authorized unmanned aerial vehicles (UAVs), while sensing the low-altitude airspace to monitor the unauthorized mobile target. The expected communication sum-rate over a given flight period is maximized by jointly optimizing the beamforming at the GBS and UAVs’ trajectories, subject to the constraints on the average signal-to-noise ratio requirement for sensing, the flight mission and collision avoidance of UAVs, as well as the maximum transmit power at the GBS. Typically, this is a sequential decision-making problem with the given flight mission. Thus, we transform it to a specific Markov decision process (MDP) model called episode task. Based on this modeling, we propose a novel LAE-oriented ISAC scheme, referred to as Deep LAE-ISAC (DeepLSC), by leveraging the deep reinforcement learning (DRL) technique. In DeepLSC, a reward function and a new action selection policy termed constrained noise-exploration policy are judiciously designed to fulfill various constraints. To enable efficient learning in episode tasks, we develop a hierarchical experience replay mechanism, where the gist is to employ all experiences generated within each episode to jointly train the neural network. Besides, to enhance the convergence speed of DeepLSC, a symmetric experience augmentation mechanism, which simultaneously permutes the indexes of all variables to enrich available experience sets, is proposed. Simulation results demonstrate that compared with benchmarks, DeepLSC yields a higher sum-rate while meeting the preset constraints, achieves faster convergence, and is more robust against different settings. © 2025 IEEE.
Original languageEnglish
Pages (from-to)351-367
Number of pages17
JournalIEEE Transactions on Wireless Communications
Volume25
Online published4 Jul 2025
DOIs
Publication statusPublished - 2026

Funding

The work of Xianghao Yu is supported in part by the Hong Kong Research Grants Council under grant No. 16212922. The work of Shu Sun was supported in part by the National Natural Science Foundation of China (NSFC) under Grant 62271310. The work of Liqun Fu was supported in part by the National Natural Science Foundation of China under Grant U23A20281 and in part by the National Social Science Foundation of China under Grant 24&ZD189. The work of Jie Xu was supported in part by the National Natural Science Foundation of China under grants Nos. 62471424 and 92267202, and the Guangdong Provincial Key Laboratory of Future Networks of Intelligence under grant No. 2022B1212010001.

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Research Keywords

  • deep reinforcement learning (DRL)
  • integrated sensing and communications (ISAC)
  • joint beamforming and trajectory design
  • Low-altitude economy (LAE)

RGC Funding Information

  • RGC-funded

Fingerprint

Dive into the research topics of 'Integrated Sensing and Communications for Low-Altitude Economy: A Deep Reinforcement Learning Approach'. Together they form a unique fingerprint.

Cite this