Abstract
Massive data bring a big challenge for analysis, and subsampling as an effective solution can significantly reduce the computational burden and maintain estimation efficiency. Expected Shortfall Regression (ESR) studies the impact of covariates on the tail expectation of response and explores the heterogeneous effects of the covariates. For joint linear quantile and expected shortfall regression models, we study the optimal subsampling method for ESR based on the Neyman-orthogonal score to reduce sensitivity with respect to nuisance parameters in quantile regression. When the massive data are stored in different sites, we further propose a distributed optimal subsampling method for the ESR. Asymptotic properties of the resultant estimators are established and the two-step algorithms are proposed for practical implementation. Extensive simulations and applications to Protein Tertiary Structure and Beijing Air Quality datasets show satisfactory performance of the proposed estimators. © 2025 Elsevier B.V.
| Original language | English |
|---|---|
| Article number | 113529 |
| Journal | Knowledge-Based Systems |
| Volume | 318 |
| Online published | 22 Apr 2025 |
| DOIs | |
| Publication status | Published - 7 Jun 2025 |
Funding
The authors are grateful to the Editor, an Associate Editor and three anonymous referees for their insightful comments and suggestions on this article, which have led to significant improvements. This work was supported by National Natural Science Foundation of China (Grant No. 12271272 ).
Research Keywords
- A-optimality
- Distributed subsampling
- Expected shortfall
- L-optimality
- Neyman orthogonality
Fingerprint
Dive into the research topics of 'Optimal distributed subsampling for expected shortfall regression via Neyman-orthogonal score'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver