Variable screening for survival data in the presence of heterogeneous censoring

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

2 Scopus Citations
View graph of relations

Author(s)

Detail(s)

Original languageEnglish
Pages (from-to)1171-1191
Journal / PublicationScandinavian Journal of Statistics
Volume47
Issue number4
Online published5 Apr 2020
Publication statusPublished - Dec 2020
Externally publishedYes

Abstract

Variable screening for censored survival data is most challenging when both survival and censoring times are correlated with an ultrahigh-dimensional vector of covariates. Existing approaches to handling censoring often make use of inverse probability weighting by assuming independent censoring with both survival time and covariates. This is a convenient but rather restrictive assumption which may be unmet in real applications, especially when the censoring mechanism is complex and the number of covariates is large. To accommodate heterogeneous (covariate-dependent) censoring that is often present in high-dimensional survival data, we propose a Gehan-type rank screening method to select features that are relevant to the survival time. The method is invariant to monotone transformations of the response and of the predictors, and works robustly for a general class of survival models. We establish the sure screening property of the proposed methodology. Simulation studies and a lymphoma data analysis demonstrate its favorable performance and practical utility.

Research Area(s)

  • Gehan-type rank statistics, heterogeneous censoring, high-dimensional survival data, sure screening property, U-statistic, variable screening

Citation Format(s)

Variable screening for survival data in the presence of heterogeneous censoring. / Xu, Jinfeng; Li, Wai Keung; Ying, Zhiliang.
In: Scandinavian Journal of Statistics, Vol. 47, No. 4, 12.2020, p. 1171-1191.

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review