Skip to main navigation Skip to search Skip to main content

Exploring a Reinforcement Learning Agent with Improved Prioritized Experience Replay for a Confrontation Game

  • Tian Zhao*
  • *Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

Abstract

Deep Q-network (DQN) is used successfully in dealing with many reinforcement learning situations and challenging tasks with real-world complexity. The current limits are the unacceptable training time to obtain satisfactory results like a human. To address this obstacle, I propose a new reinforcement learning strategy. This paper focuses on the confrontation game environment for two players with sparse reward and no direct hindsight reward function and no fixed goals. According to some strategies, algorithm can put them into reinforcement learning with reward functions and replay to give the abilities of judging in the middle of the games as references. To demonstrate the effectiveness of the proposed strategy, a new game is designed. Fence game is a confrontation game for two players that one tries their best to fence the other one in Die ow. The custom environment of this game will give the only reward functions at the end: win, lose or draw. In conclusion, these factors include performance and results proved that 1) Prioritized Experience Replay with Dynamic Hindsight reward function (DH-PER) and 2) Prioritized Experience Replay with Dynamic Hindsight reward function and Sharing (DHS-PER) both let the RL agents converge more quickly.
Original languageEnglish
Title of host publicationPROCEEDINGS - 2022 International Conference on Big Data, Information and Computer Network
Subtitle of host publicationBDICN 2022
PublisherIEEE
Pages373-381
ISBN (Electronic)9781665484763
ISBN (Print)978-1-6654-8477-0
DOIs
Publication statusPublished - 2022
Event2022 International Conference on Big Data, Information and Computer Network, BDICN 2022 - Sanya, China
Duration: 20 Jan 202222 Jan 2022

Publication series

NameProceedings - International Conference on Big Data, Information and Computer Network, BDICN

Conference

Conference2022 International Conference on Big Data, Information and Computer Network, BDICN 2022
PlaceChina
CitySanya
Period20/01/2222/01/22

Research Keywords

  • Deep Q-network (DQN)
  • Dynamic Hindsight Experience Replay (DHER)
  • experience replay
  • experience sharing
  • Hindsight Experience Replay (HER)
  • Prioritized Experience Replay (PER)
  • reinforcement learning

Fingerprint

Dive into the research topics of 'Exploring a Reinforcement Learning Agent with Improved Prioritized Experience Replay for a Confrontation Game'. Together they form a unique fingerprint.

Cite this