TY - JOUR
T1 - Partially linear structure identification in generalized additive models with NP-dimensionality
AU - Lian, Heng
AU - Du, Pang
AU - Li, Yuangzhang
AU - Liang, Hua
PY - 2014/12
Y1 - 2014/12
N2 - Separation of the linear and nonlinear components in additive models based on penalized likelihood has received attention recently. However, it remains unknown whether consistent separation is possible in generalized additive models, and how high dimensionality is allowed. In this article, we study the doubly SCAD-penalized approach for partial linear structure identification problems of non-polynomial (NP) dimensionality and demonstrate its oracle property. In particular, if the number of nonzero components is fixed, the dimensionality of the total number of components can be of order exp{nd/ (2d+1)} where d is the smoothness of the component functions. Under such dimensionality assumptions, we show that with probability approaching one, the proposed procedure can correctly identify the zero, linear, and nonlinear components in the model. We further establish the convergence rate of the estimator for the nonlinear component and the asymptotic normality of the estimator for the linear component. Performance of the proposed method is evaluated by simulation studies. The methods are demonstrated by analyzing a gene data set.
AB - Separation of the linear and nonlinear components in additive models based on penalized likelihood has received attention recently. However, it remains unknown whether consistent separation is possible in generalized additive models, and how high dimensionality is allowed. In this article, we study the doubly SCAD-penalized approach for partial linear structure identification problems of non-polynomial (NP) dimensionality and demonstrate its oracle property. In particular, if the number of nonzero components is fixed, the dimensionality of the total number of components can be of order exp{nd/ (2d+1)} where d is the smoothness of the component functions. Under such dimensionality assumptions, we show that with probability approaching one, the proposed procedure can correctly identify the zero, linear, and nonlinear components in the model. We further establish the convergence rate of the estimator for the nonlinear component and the asymptotic normality of the estimator for the linear component. Performance of the proposed method is evaluated by simulation studies. The methods are demonstrated by analyzing a gene data set.
KW - Model structure identification
KW - NP-dimensionality
KW - Partially linear structure
KW - Polynomial splines
KW - Quasi-likelihood
UR - http://www.scopus.com/inward/record.url?scp=84905195839&partnerID=8YFLogxK
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-84905195839&origin=recordpage
U2 - 10.1016/j.csda.2014.06.021
DO - 10.1016/j.csda.2014.06.021
M3 - RGC 21 - Publication in refereed journal
SN - 0167-9473
VL - 80
SP - 197
EP - 208
JO - Computational Statistics and Data Analysis
JF - Computational Statistics and Data Analysis
ER -