Spline estimator for simultaneous variable selection and constant coefficient identification in high-dimensional generalized varying-coefficient models

Heng Lian, Jie Meng, Kaifeng Zhao*

*Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

7 Citations (Scopus)

Abstract

In this paper, we are concerned with two common and related problems for generalized varying-coefficient models, variable selection and constant coefficient identification. Starting with a specification of generalized varying-coefficient models assuming possible nonlinear interactions between the index variable and all other predictors, we propose a polynomial-spline based procedure that simultaneously eliminates irrelevant predictors and identifies predictors that do not interact with the index variable. Our approach is based on a double-penalization strategy where two penalty functions are used for these two related purposes respectively, in a single functional. In a "large p, small n" setting, we demonstrate the convergence rates of the estimator under suitable regularity assumptions. Based on its previous success on parametric models, we use the extended Bayesian information criterion (eBIC) to automatically choose the regularization parameters. Finally, post-penalization estimator is proposed to further reduce the bias of the resulting estimator. Monte Carlo simulations are conducted to examine the finite sample performance of the proposed procedures and an application to a leukemia dataset is presented.
Original languageEnglish
Pages (from-to)81-103
JournalJournal of Multivariate Analysis
Volume141
DOIs
Publication statusPublished - 4 Jul 2015
Externally publishedYes

Research Keywords

  • B-spline basis
  • Diverging parameters
  • Group lasso
  • Quasi-likelihood

Fingerprint

Dive into the research topics of 'Spline estimator for simultaneous variable selection and constant coefficient identification in high-dimensional generalized varying-coefficient models'. Together they form a unique fingerprint.

Cite this