Abstract
This paper investigates the adaptive optimal control problem and proposes fundamentally novel non-model-based approaches for linear discrete-time networked control systems (NCSs) with both sensor and actuator two-channel stochastic dropouts by using directly the data transmitted via communication networks. First, we formulate a modified algebraic Riccati equation parameterized by the system dynamics and the network-induced packet dropouts probabilities, whose solvability is related to a critical arrival probability. To deal with this problem, two model-based reinforcement learning algorithms, policy iteration (PI) and value iteration (VI), are designed with their convergence proofs. To enable the application for NCSs with unknown system dynamics, two novel online PI and VI algorithms are designed. These algorithms develop a new theoretical framework to solve the Bellman function with stochastic dropouts by using directly the data transmitted via networks. Furthermore, a bilevel learning algorithm is proposed to approximate the critical arrival probability. Last but not least, an extension of the developed online VI algorithm is presented for stochastic systems with both unmeasurable noises and stochastic dropouts. Copyright © 2023 SIAM.
| Original language | English |
|---|---|
| Pages (from-to) | 3183-3208 |
| Journal | SIAM Journal on Control and Optimization |
| Volume | 61 |
| Issue number | 5 |
| Online published | 24 Oct 2023 |
| DOIs | |
| Publication status | Published - 2023 |
Funding
This work was supported in part by NSFC 61991404, 62373090, the fellowship award from the Research Grants Council of the Hong Kong Special Administrative Region, China, project CityU PDFS2324-1S02, Research and Development Program of Key Science and Technology Fields in Guangzhou City, under grant 202206030005
Research Keywords
- adaptive optimal control
- communication networks
- modified algebraic Riccati equation
- reinforcement learning
Publisher's Copyright Statement
- COPYRIGHT TERMS OF DEPOSITED FINAL PUBLISHED VERSION FILE: © 2023 Society for Industrial and Applied Mathematics.
RGC Funding Information
- RGC-funded
Fingerprint
Dive into the research topics of 'Adaptive Optimal Control of Linear Discrete-Time Networked Control Systems with Two-Channel Stochastic Dropouts'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver