Skip to main navigation Skip to search Skip to main content

Learn to explain the smile: An interpretable hybrid machine learning model to understand the implied volatility of CSI 300 options

Pengshi Li, Jinbo Huang*, Yan Lin

*Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

Abstract

We propose an interpretable hybrid machine learning framework for forecasting and explaining implied volatility surface dynamics of CSI 300 index options. Our methodology leverages machine learning to correct a theory-based baseline model. Initial predictions are derived from an analytical model, while the second stage involves a machine learning model trained on the residuals of the first stage. We construct three variants of hybrid models using XGBoost: a baseline three-feature model, a VIX-augmented four-feature model, and a five-feature model incorporating a newly developed options-implied ambiguity index. Empirical results using 2019–2025 CSI 300 options data show that the five-feature model significantly outperforms both the analytical benchmark and VIX-only model. Performance improvements are especially pronounced in market rallies and high-ambiguity regimes, where ambiguity attenuates implied volatility compression and amplifies perceptions of downside risk. We further use SHAP value analysis to demonstrate that feature effects are economically coherent and state-dependent. Our findings confirm that ambiguity is a distinct and quantitatively meaningful risk factor for explaining implied volatility dynamics in emerging market. © 2025 Elsevier B.V. All rights are reserved, including those for text and data mining, AI training, and similar technologies.
Original languageEnglish
Article number103038
Number of pages19
JournalPacific-Basin Finance Journal
Volume96
Online published12 Dec 2025
DOIs
Publication statusPublished - Feb 2026

Funding

The authors are grateful to the Editor and anonymous reviewers. This work is supported by the National Natural Science Foundation of China (No. 72371079, 71971068); the Natural Science Foundation of Guangdong Province of China (No. 2023B1515020045); Humanities and Social Science Planning Fund from Ministry of Education (No. 23YJA790042); Innovative School Strengthening Project (No. 2023WTSCX094); the 2035 Plan of Social Science Foundation of Shenzhen University (No. ZYZD2302); Science and Technology Partnership Program, Ministry of Science and Technology of China. The contents of this paper are the sole responsibility of the authors.

Research Keywords

  • Ambiguity
  • CSI 300 options
  • Implied volatility surface
  • Interpretable machine learning

Fingerprint

Dive into the research topics of 'Learn to explain the smile: An interpretable hybrid machine learning model to understand the implied volatility of CSI 300 options'. Together they form a unique fingerprint.

Cite this