Skip to main navigation Skip to search Skip to main content

Deep reinforcement learning for HVAC control with nonlinear parametric thermal network modeling for passive building envelopes

Yu Lu, Wenqi Wang, Chuyao Wang, Ze Li, Yiying Zhou, Xu Chen, Tsz Chung Ho, Chi Yan Tso*

*Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

Abstract

The incorporation of passive cooling envelopes into buildings can effectively reduce energy consumption. However, due to their limited cooling capacity, HVAC systems are still required to maintain indoor thermal comfort. In buildings using passive radiative cooling roofs and thermochromic windows as passive cooling envelopes, inappropriate HVAC control strategies are more likely to occur due to the changeable optical and thermal properties. Such improper HVAC control results in significant waste during system operation, which remains an unsolved problem. Therefore, optimizing HVAC operation in passively cooled buildings is essential not only for ensuring thermal comfort but also for further reducing energy consumption. Achieving both objectives depends on effectively capturing the building's thermal behavior and efficient control methods. To incorporate the thermal behavior of passive cooling envelopes into the HVAC control system, this study first develops a resistance-capacitance thermal network based on a modified matrix to predict the thermal behavior of passively cooled buildings. Then, a model-based policy optimization deep reinforcement learning (DRL) control method is proposed to enhance HVAC system performance in such buildings. The results show that the modified matrix significantly improves the prediction accuracy of the thermal behavior of passively cooled buildings compared to current global identification methods, which increases coefficient of determination from 0.90, 0.73, 0.88 to 0.94, 0.92, 0.95 for radiative cooling roofs, thermochromic windows, and a combination of both envelopes, respectively. Moreover, the proposed DRL control method can reduce building energy consumption by 17.7 % for radiative cooling roofs, 10.6 % for thermochromic windows, and 21.1 % when both strategies are applied simultaneously, compared to the baseline control method. This study provides valuable insights into optimization of HVAC system operations in buildings equipped with passive cooling envelopes. © 2025 Elsevier Ltd.
Original languageEnglish
Article number126863
Number of pages19
JournalApplied Energy
Volume402
Issue numberPart A
Online published11 Oct 2025
DOIs
Publication statusPublished - 15 Dec 2025

Funding

This work was supported by the Hong Kong Research Grant Council (RGC) via Research Fellow Scheme with the project number of RFS2425-1S06 as well as via the Strategic Topics Grant (STG) account STG2/E-605/23-N. This work was also supported by the Hong Kong Innovation and Technology Commission (ITC) via Research, Academic and Industry Sectors One-plus Scheme (RAISe+) with the project number of RAI/23/1/007 A. In addition, this work was supported by a donation for a research project grant at City University of Hong Kong from Pacific Enterprise Solutions Limited, under project account number of 9220166.

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 7 - Affordable and Clean Energy
    SDG 7 Affordable and Clean Energy

Research Keywords

  • Building energy efficiency
  • Building-integrated
  • Deep reinforcement learning
  • Thermal network

RGC Funding Information

  • RGC-funded

Fingerprint

Dive into the research topics of 'Deep reinforcement learning for HVAC control with nonlinear parametric thermal network modeling for passive building envelopes'. Together they form a unique fingerprint.

Cite this