Abstract
This paper proposes a human-in-The-loop deep reinforcement learning (HL-DRL)-based VVC strategy to simultaneously reduce power losses, mitigate voltage violations and compensate for voltage unbalance in three-phase unbalanced distribution networks. Instead of fully trusting DRL actions made by deep neural networks, a human intervention module is proposed to modify dangerous actions that violate operation constraints during offline training. This module refers to well-designed human guidance rules based on voltage-reactive power sensitivities, which regulate PV reactive power to sequentially address local voltage violation and unbalance issues to obtain safe transitions. To efficiently and safely learn the optimal control policy from these training samples, a human-in-The-loop soft actor-critic (HL-SAC) solution method is then developed. Different from the standard SAC algorithm, an online switch mechanism between action exploration and human intervention is designed. The actor network loss function is modified to incorporate human guidance terms, which alleviates the inconsistency of the updating direction of actor and critic networks. A hybrid experience replay buffer including both dangerous and safe transitions is also used to facilitate the learning process towards human actions. Comparative simulation results on a modified IEEE 123-bus unbalanced distribution system demonstrate the effectiveness and superiority of the proposed method in voltage control. © 2023 IEEE.
| Original language | English |
|---|---|
| Pages (from-to) | 2639-2651 |
| Journal | IEEE Transactions on Smart Grid |
| Volume | 15 |
| Issue number | 3 |
| Online published | 30 Nov 2023 |
| DOIs | |
| Publication status | Published - May 2024 |
| Externally published | Yes |
Funding
This work was supported in part by PolyU Research under Grant 1-W29V, Grant 1-YY4T, and Grant G-SB5F; in part by the ARC Research Hub under Grant IH180100020; in part by the ARC Training Centre under Grant IC200100023; in part by the ARC Linkage Project under Grant LP200100056; and in part by ARC under Grant DP220103881. Paper no. TSG-00921-2023.
Research Keywords
- human-in-The-loop
- safe deep reinforcement learning
- soft actor-critic
- unbalanced distribution networks
- Volt/Var control
Fingerprint
Dive into the research topics of 'Optimal Volt/Var Control for Unbalanced Distribution Networks with Human-in-The-Loop Deep Reinforcement Learning'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver