Ultra-High Dimensional Single-Index Quantile Regression
Research output: Journal Publications and Reviews › RGC 21 - Publication in refereed journal › peer-review
Author(s)
Related Research Unit(s)
Detail(s)
Original language | English |
---|---|
Article number | 224 |
Journal / Publication | Journal of Machine Learning Research |
Volume | 21 |
Publication status | Published - Nov 2020 |
Link(s)
Document Link | Links
|
---|---|
Link to Scopus | https://www.scopus.com/record/display.uri?eid=2-s2.0-85098461407&origin=recordpage |
Permanent Link | https://scholars.cityu.edu.hk/en/publications/publication(2ea62c77-875a-4051-b6ef-2bf56ba6ae80).html |
Abstract
We consider a flexible semiparametric single-index quantile regression model where the number of covariates may be ultra-high dimensional, and the number of the relevant covariates is potentially diverging. The approach is particularly appealing to uncover the complex heterogeneity in high-dimensional data, incorporate nonlinearity and potential interaction, avoid the curse of dimensionality, and allow different variables to be included at different quantile levels. We estimate the unknown function via polynomial splines nonparametrically and adopt a nonconvex penalty function to identify the sparse variable set. We further extend it to partially linear single-index quantile model where both the single-index components in the nonparametric term and the partially linear component scan be in ultra-high dimension. However, a number of major challenges arise in developing both theory and computation: (a) The model is highly nonlinear in single-index coefficients because the high-dimensional single-index covariates are embedded inside the unknown flexible function. (b) The data are ultra-high dimensional where the dimension of the single-index covariates (Pn) is diverging or even in the exponential order of sample size n. (c) The objective function is non-smooth for quantile regression. (d) Nonconvex variable selection such as SCAD is adopted for regularization. (e) The extended partially linear single-index quantile models may include both ultra-high dimensional (Pn) single-index covariates and ultra-high dimensional (qn) partially linear covariates. We develop a novel approach using empirical process techniques in establishing the theoretical properties of the nonconvex penalized estimators for partially linear single-index quantile models and show those estimators indeed possess the oracle property in ultra-high dimensional setting. We propose an efficient algorithm to circumvent the computational challenges. The results of Monte Carlo simulations and an application to gene expression data demonstrate the effectiveness of the proposed models and estimation method.
Research Area(s)
- High-dimensional data, Nonparametric, Oracle property, SCAD, Variable selection
Citation Format(s)
Ultra-High Dimensional Single-Index Quantile Regression. / Zhang, Yuankun; Lian, Heng; Yu, Yan.
In: Journal of Machine Learning Research, Vol. 21, 224, 11.2020.
In: Journal of Machine Learning Research, Vol. 21, 224, 11.2020.
Research output: Journal Publications and Reviews › RGC 21 - Publication in refereed journal › peer-review