Skip to main navigation Skip to search Skip to main content

Optimal Volt/Var Control for Unbalanced Distribution Networks with Human-in-The-Loop Deep Reinforcement Learning

  • Xianzhuo Sun
  • , Zhao Xu*
  • , Jing Qiu*
  • , Huichuan Liu
  • , Huayi Wu
  • , Yuechuan Tao
  • *Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

Abstract

This paper proposes a human-in-The-loop deep reinforcement learning (HL-DRL)-based VVC strategy to simultaneously reduce power losses, mitigate voltage violations and compensate for voltage unbalance in three-phase unbalanced distribution networks. Instead of fully trusting DRL actions made by deep neural networks, a human intervention module is proposed to modify dangerous actions that violate operation constraints during offline training. This module refers to well-designed human guidance rules based on voltage-reactive power sensitivities, which regulate PV reactive power to sequentially address local voltage violation and unbalance issues to obtain safe transitions. To efficiently and safely learn the optimal control policy from these training samples, a human-in-The-loop soft actor-critic (HL-SAC) solution method is then developed. Different from the standard SAC algorithm, an online switch mechanism between action exploration and human intervention is designed. The actor network loss function is modified to incorporate human guidance terms, which alleviates the inconsistency of the updating direction of actor and critic networks. A hybrid experience replay buffer including both dangerous and safe transitions is also used to facilitate the learning process towards human actions. Comparative simulation results on a modified IEEE 123-bus unbalanced distribution system demonstrate the effectiveness and superiority of the proposed method in voltage control. © 2023 IEEE.
Original languageEnglish
Pages (from-to)2639-2651
JournalIEEE Transactions on Smart Grid
Volume15
Issue number3
Online published30 Nov 2023
DOIs
Publication statusPublished - May 2024
Externally publishedYes

Funding

This work was supported in part by PolyU Research under Grant 1-W29V, Grant 1-YY4T, and Grant G-SB5F; in part by the ARC Research Hub under Grant IH180100020; in part by the ARC Training Centre under Grant IC200100023; in part by the ARC Linkage Project under Grant LP200100056; and in part by ARC under Grant DP220103881. Paper no. TSG-00921-2023.

Research Keywords

  • human-in-The-loop
  • safe deep reinforcement learning
  • soft actor-critic
  • unbalanced distribution networks
  • Volt/Var control

Fingerprint

Dive into the research topics of 'Optimal Volt/Var Control for Unbalanced Distribution Networks with Human-in-The-Loop Deep Reinforcement Learning'. Together they form a unique fingerprint.

Cite this