Skip to main navigation Skip to search Skip to main content

Adaptive optimal tracking control of networked linear systems under two-channel stochastic dropouts

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

Abstract

This work investigates the adaptive optimal tracking control problem for networked discrete-time linear systems by directly using the data transmitted via communication networks. It is shown that the concerned problem can be addressed by solving two sub-problems: an adaptive optimal control one and an adaptive output regulation one. Two novel online reinforcement learning based policy iteration and value iteration algorithms are developed, which constitute an integrated framework to learn the optimal feedback control gain and the solutions to regulator equations by directly using the data transmitted via communication networks. Furthermore, it is shown that the tracking error of the closed-loop control system is mean square asymptotically stable even in the case of network packet stochastic dropouts. Simulation results demonstrate the efficacy of the proposed approaches. © 2024 Elsevier Ltd
Original languageEnglish
Article number111690
JournalAutomatica
Volume165
Online published27 Apr 2024
DOIs
Publication statusPublished - Jul 2024

Funding

This work was supported by Research Grants Council of Hong Kong, Project No. CityU 11205221 and 11210222 , NSFC-Excellent Young Scientists Fund (Hong Kong and Macao), Project No. 62222318 , the fellowship award from the Research Grants Council of Hong Kong , Project No. CityU PDFS2324-1S02 . The material in this paper was not presented at any conference. This paper was recommended for publication in revised form by Associate Editor Chanying Li under the direction of Editor Miroslav Krstic.

Research Keywords

  • Communication networks
  • Optimal control
  • Output regulation
  • Reinforcement learning

RGC Funding Information

  • RGC-funded

Fingerprint

Dive into the research topics of 'Adaptive optimal tracking control of networked linear systems under two-channel stochastic dropouts'. Together they form a unique fingerprint.

Cite this