Ensemble-based Deep Reinforcement Learning for robust cooperative wind farm control

Binghao He, Huan Zhao, Gaoqi Liang, Junhua Zhao*, Jing Qiu, Zhao Yang Dong

*Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

17 Citations (Scopus)

Abstract

The wake effect is the major obstacle to reaching the maximum power generation for wind farms, since choosing the suitable wake model that satisfies both computational cost and accuracy is a difficult task. Deep Reinforcement Learning (DRL) is a powerful data-driven method that can learn the optimal control policy without modeling the environment. However, the “trial and error” mechanism of DRL may cause high costs during the learning process. To address this issue, we propose an ensemble-based DRL wind farm control framework. Under this framework, a new algorithm called Actor Bagging Deep Deterministic Policy Gradient (AB-DDPG) is proposed, which combines the actor-network bagging method with the Deep Deterministic Policy Gradient. The gradient of the proposed method is proved to be consistent with the DDPG method. The experiment results in WFSim show that AB-DDPG can learn the optimal control policy with lower learning cost and a more robust learning process. © 2022 Published by Elsevier Ltd.
Original languageEnglish
Article number108406
JournalInternational Journal of Electrical Power and Energy Systems
Volume143
Online published4 Jul 2022
DOIs
Publication statusPublished - Dec 2022
Externally publishedYes

Research Keywords

  • Deep deterministic policy gradient
  • Deep reinforcement learning
  • Ensemble learning
  • Learning cost
  • Wind farm control

Fingerprint

Dive into the research topics of 'Ensemble-based Deep Reinforcement Learning for robust cooperative wind farm control'. Together they form a unique fingerprint.

Cite this