TY - GEN
T1 - UserSim
T2 - 30th World Wide Web Conference, WWW 2021
AU - Zhao, Xiangyu
AU - Xia, Long
AU - Zou, Lixin
AU - Liu, Hui
AU - Yin, Dawei
AU - Tang, Jiliang
N1 - Full text of this publication does not contain sufficient affiliation information. With consent from the author(s) concerned, the Research Unit(s) information for this record is based on the existing academic department affiliation of the author(s).
PY - 2021/4
Y1 - 2021/4
N2 - With the recent advances in Reinforcement Learning (RL), there have been tremendous interests in employing RL for recommender systems. However, directly training and evaluating a new RL-based recommendation algorithm needs to collect users' real-time feedback in the real system, which is time/effort consuming and could negatively impact users' experiences. Thus, it calls for a user simulator that can mimic real users' behaviors to pre-train and evaluate new recommendation algorithms. Simulating users' behaviors in a dynamic system faces immense challenges - (i) the underlying item distribution is complex, and (ii) historical logs for each user are limited. In this paper, we develop a user simulator based on a Generative Adversarial Network (GAN). To be specific, the generator captures the underlying distribution of users' historical logs and generates realistic logs that can be considered as augmentations of real logs; while the discriminator not only distinguishes real and fake logs but also predicts users' behaviors. The experimental results based on benchmark datasets demonstrate the effectiveness of the proposed simulator.
AB - With the recent advances in Reinforcement Learning (RL), there have been tremendous interests in employing RL for recommender systems. However, directly training and evaluating a new RL-based recommendation algorithm needs to collect users' real-time feedback in the real system, which is time/effort consuming and could negatively impact users' experiences. Thus, it calls for a user simulator that can mimic real users' behaviors to pre-train and evaluate new recommendation algorithms. Simulating users' behaviors in a dynamic system faces immense challenges - (i) the underlying item distribution is complex, and (ii) historical logs for each user are limited. In this paper, we develop a user simulator based on a Generative Adversarial Network (GAN). To be specific, the generator captures the underlying distribution of users' historical logs and generates realistic logs that can be considered as augmentations of real logs; while the discriminator not only distinguishes real and fake logs but also predicts users' behaviors. The experimental results based on benchmark datasets demonstrate the effectiveness of the proposed simulator.
KW - Generative Adversarial Network
KW - Recommender System
KW - Reinforcement Learning
KW - User Simulation
UR - http://www.scopus.com/inward/record.url?scp=85107918995&partnerID=8YFLogxK
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-85107918995&origin=recordpage
U2 - 10.1145/3442381.3450125
DO - 10.1145/3442381.3450125
M3 - RGC 32 - Refereed conference paper (with host publication)
SN - 9781450383127
T3 - The Web Conference - Proceedings of the World Wide Web Conference, WWW
SP - 3582
EP - 3589
BT - The Web Conference 2021
PB - Association for Computing Machinery
Y2 - 19 April 2021 through 23 April 2021
ER -