A home energy management approach using decoupling value and policy in reinforcement learning

Luolin Xiong, Yang Tang*, Chensheng Liu, Shuai Mao, Ke Meng, Zhaoyang Dong, Feng Qian*

*Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

5 Citations (Scopus)

Abstract

Considering the popularity of electric vehicles and the flexibility of household appliances, it is feasible to dispatch energy in home energy systems under dynamic electricity prices to optimize electricity cost and comfort residents. In this paper, a novel home energy management (HEM) approach is proposed based on a data-driven deep reinforcement learning method. First, to reveal the multiple uncertain factors affecting the charging behavior of electric vehicles (EVs), an improved mathematical model integrating driver’s experience, unexpected events, and traffic conditions is introduced to describe the dynamic energy demand of EVs in home energy systems. Second, a decoupled advantage actor-critic (DA2C) algorithm is presented to enhance the energy optimization performance by alleviating the overfitting problem caused by the shared policy and value networks. Furthermore, separate networks for the policy and value functions ensure the generalization of the proposed method in unseen scenarios. Finally, comprehensive experiments are carried out to compare the proposed approach with existing methods, and the results show that the proposed method can optimize electricity cost and consider the residential comfort level in different scenarios. © 2023, Zhejiang University Press.
Original languageEnglish
Pages (from-to)1261-1272
JournalFrontiers of Information Technology and Electronic Engineering
Volume24
Issue number9
Online published10 Aug 2023
DOIs
Publication statusPublished - Sept 2023
Externally publishedYes

Research Keywords

  • Electric vehicle
  • Generalization
  • Home energy system
  • Reinforcement learning
  • TP181

Fingerprint

Dive into the research topics of 'A home energy management approach using decoupling value and policy in reinforcement learning'. Together they form a unique fingerprint.

Cite this