TY - JOUR
T1 - Deep Reinforcement Learning-Based Explainable Pricing Policy for Virtual Storage Rental Service
AU - Li, Xiangyu
AU - Liu, Hangyue
AU - Li, Chaojie
AU - Chen, Guo
AU - Zhang, Cuo
AU - Dong, Zhao Yang
PY - 2023/11
Y1 - 2023/11
N2 - The shared community energy storage system (CESS) can reduce energy storage costs by exploiting the complementarity of end-users and economies of scale. To further improve the economic feasibility of the CESS, we propose a novel business model and pricing method for the virtual storage rental service (VSRS). In this model, the rental users aim to minimize the electricity bill by renting the virtual capacity and optimizing its operation, while the CESS operator seeks to maximize the revenue from the combination of energy arbitrage and the VSRS. The pricing problem and the optimal operation problems of the CESS and users' virtual batteries are modeled as a bi-level optimization problem. Next, the proposed problem is solved through transformer-based deep deterministic policy gradient (TDDPG) method and mixed-integer linear programming (MILP) due to the non-convexity and non-continuity of the original problem. The post-hoc interpretability of the policy network is provided based on the Shapley value to reveal the importance of different input features for decision-making. Numerical simulations suggest that the proposed VSRS could benefit the CESS operator and users. Moreover, the explanation based on the Shapley value could effectively generate an implicit solution for understanding the policy network. © 2023 IEEE.
AB - The shared community energy storage system (CESS) can reduce energy storage costs by exploiting the complementarity of end-users and economies of scale. To further improve the economic feasibility of the CESS, we propose a novel business model and pricing method for the virtual storage rental service (VSRS). In this model, the rental users aim to minimize the electricity bill by renting the virtual capacity and optimizing its operation, while the CESS operator seeks to maximize the revenue from the combination of energy arbitrage and the VSRS. The pricing problem and the optimal operation problems of the CESS and users' virtual batteries are modeled as a bi-level optimization problem. Next, the proposed problem is solved through transformer-based deep deterministic policy gradient (TDDPG) method and mixed-integer linear programming (MILP) due to the non-convexity and non-continuity of the original problem. The post-hoc interpretability of the policy network is provided based on the Shapley value to reveal the importance of different input features for decision-making. Numerical simulations suggest that the proposed VSRS could benefit the CESS operator and users. Moreover, the explanation based on the Shapley value could effectively generate an implicit solution for understanding the policy network. © 2023 IEEE.
KW - deep reinforcement learning
KW - explainable pricing policy
KW - Shared community energy storage
KW - virtual storage service
UR - http://www.scopus.com/inward/record.url?scp=85151343736&partnerID=8YFLogxK
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-85151343736&origin=recordpage
U2 - 10.1109/TSG.2023.3253140
DO - 10.1109/TSG.2023.3253140
M3 - RGC 21 - Publication in refereed journal
SN - 1949-3053
VL - 14
SP - 4373
EP - 4384
JO - IEEE Transactions on Smart Grid
JF - IEEE Transactions on Smart Grid
IS - 6
ER -