Optimal Option Hedging with Policy Gradient

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

3 Citations (Scopus)

Abstract

For the option hedging problem, we propose a reinforcement learning framework to statistically hedge generic financial options under the setting of non-zero risk-free interest rate and transaction cost. Instead of minimizing the risk of final wealth, we maximize the cumulative reward during the whole hedging process to find the optimal hedging strategies. In particular, we define the fundamental elements of reinforcement learning for the hedging problem. To illustrate the formulation, we utilize the Monte Carlo policy gradient with baseline onto both synthetic and realistic data to learn the state-value function and optimal policy function. Empirical evaluations demonstrate the effectiveness and performance of the proposed approach.
Original languageEnglish
Title of host publicationProceedings - 21st IEEE International Conference on Data Mining Workshops, ICDMW 2021
EditorsBing Xue, Mykola Pechenizkiy, Yun Sing Koh
PublisherIEEE Computer Society
Pages1112-1119
ISBN (Electronic)9781665424271
ISBN (Print)9781665424288
DOIs
Publication statusPublished - Dec 2021
Event21st IEEE International Conference on Data Mining (ICDM 2021) - Virtual, Auckland, New Zealand
Duration: 7 Dec 202110 Dec 2021
https://icdm2021.auckland.ac.nz/

Publication series

NameIEEE International Conference on Data Mining Workshops, ICDMW
Volume2021-December
ISSN (Print)2375-9232
ISSN (Electronic)2375-9259

Conference

Conference21st IEEE International Conference on Data Mining (ICDM 2021)
Abbreviated titleIEEE ICDM 2021
PlaceNew Zealand
CityAuckland
Period7/12/2110/12/21
Internet address

Research Keywords

  • Option Hedging
  • Policy Gradient
  • Reinforcement Learning

RGC Funding Information

  • RGC-funded

Fingerprint

Dive into the research topics of 'Optimal Option Hedging with Policy Gradient'. Together they form a unique fingerprint.

Cite this