A model-based reinforcement learning approach for maintenance optimization of degrading systems in a large state space

Ping Zhang, Xiaoyan Zhu*, Min Xie

*Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

38 Citations (Scopus)

Abstract

Scheduling maintenance tasks based on the deteriorating process has often been established on degradation models. However, the formulas of the degradation processes are usually unknown and hard to be determined for a system working in practices. In this study, we develop a model-based reinforcement learning approach for maintenance optimization. The developed approach determines maintenance actions for each degradation state at each inspection time over a finite planning horizon, supposing that the degradation formula is known or unknown. At each inspection time, the developed approach attempts to learn an optimal assessment value for each maintenance action to be performed at each degradation state. The assessment value quantifies the goodness of each state-action pair in terms of minimizing the accumulated maintenance costs over the planning horizon. To optimize the assessment values when a well-defined degradation formula is known, we customize a Q-learning method with model-based acceleration. When the degradation formula is unknown or hard to be determined, we develop a Dyna-Q method with maintenance-oriented improvements, in which an environment model capturing the degradation pattern under different maintenance actions is learned at first; Then, the assessment values are optimized while considering the stochastic behavior of the system degradation. The final maintenance policy is acquired by performing the maintenance actions associated with the highest assessment values. Experimental studies are presented to illustrate the applications.
Original languageEnglish
Article number107622
JournalComputers and Industrial Engineering
Volume161
Online published17 Aug 2021
DOIs
Publication statusPublished - Nov 2021

Research Keywords

  • Degrading system
  • Maintenance optimization
  • Model-based reinforcement learning
  • Periodic inspection

Fingerprint

Dive into the research topics of 'A model-based reinforcement learning approach for maintenance optimization of degrading systems in a large state space'. Together they form a unique fingerprint.

Cite this