Skip to main navigation Skip to search Skip to main content

Optimal distributed subsampling for expected shortfall regression via Neyman-orthogonal score

  • Xing Li
  • , Lei Wang*
  • , Heng Lian
  • *Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

Abstract

Massive data bring a big challenge for analysis, and subsampling as an effective solution can significantly reduce the computational burden and maintain estimation efficiency. Expected Shortfall Regression (ESR) studies the impact of covariates on the tail expectation of response and explores the heterogeneous effects of the covariates. For joint linear quantile and expected shortfall regression models, we study the optimal subsampling method for ESR based on the Neyman-orthogonal score to reduce sensitivity with respect to nuisance parameters in quantile regression. When the massive data are stored in different sites, we further propose a distributed optimal subsampling method for the ESR. Asymptotic properties of the resultant estimators are established and the two-step algorithms are proposed for practical implementation. Extensive simulations and applications to Protein Tertiary Structure and Beijing Air Quality datasets show satisfactory performance of the proposed estimators. © 2025 Elsevier B.V.
Original languageEnglish
Article number113529
JournalKnowledge-Based Systems
Volume318
Online published22 Apr 2025
DOIs
Publication statusPublished - 7 Jun 2025

Funding

The authors are grateful to the Editor, an Associate Editor and three anonymous referees for their insightful comments and suggestions on this article, which have led to significant improvements. This work was supported by National Natural Science Foundation of China (Grant No. 12271272 ).

Research Keywords

  • A-optimality
  • Distributed subsampling
  • Expected shortfall
  • L-optimality
  • Neyman orthogonality

Fingerprint

Dive into the research topics of 'Optimal distributed subsampling for expected shortfall regression via Neyman-orthogonal score'. Together they form a unique fingerprint.

Cite this