Abstract
Despite extensive research on variable selection over the past two decades, few studies exist on variable selection for classification, particularly when no assumptions are made about the model. In this paper, we propose a general variable selection framework for classification by examining the conditional probability. The proposed framework is illustrated by means of support vector machine (SVM) with derivative-induced sparsity, which makes no explicit model assumption, and takes full advantage of the mathematical properties of the reproducing kernel Hilbert space (RKHS). In contrast to many existing methods, our proposed method leads to a convex optimization task, and fully exploits gradient information by using the reproducing property of gradients in smooth RKHSs. The proposed method can also be viewed as a generalization of the classical SVM, and achieves superior empirical performance in sparse classification. Importantly, the estimation consistency and subset selection properties of the proposed method are established. Lastly, the effectiveness of the method is demonstrated using simulated and real-life examples.
| Original language | English |
|---|---|
| Pages (from-to) | 2075-2103 |
| Journal | Statistica Sinica |
| Volume | 30 |
| Issue number | 4 |
| DOIs | |
| Publication status | Published - Oct 2020 |
Research Keywords
- Classification
- gradient learning
- reproducing kernel Hilbert space (RKHS)
- sparsity
- support vector machine (SVM)
- SUPPORT VECTOR MACHINES
- CONSISTENCY
- REGRESSION
- MODELS
- LASSO
Publisher's Copyright Statement
- COPYRIGHT TERMS OF DEPOSITED FINAL PUBLISHED VERSION FILE: Statistica Sinica © 2020 Institute of Statistical Science, Academia Sinica. Use of this article is permitted solely for educational and research purposes. He, X., Lv, S., & Wang, J. (2020). VARIABLE SELECTION FOR CLASSIFICATION WITH DERIVATIVE-INDUCED REGULARIZATION. Statistica Sinica, 30(4), 2075-2103. https://doi.org/10.5705/ss.202018.0086.
RGC Funding Information
- RGC-funded
Fingerprint
Dive into the research topics of 'VARIABLE SELECTION FOR CLASSIFICATION WITH DERIVATIVE-INDUCED REGULARIZATION'. Together they form a unique fingerprint.Projects
- 3 Finished
-
GRF: Latent Factor Modeling of Large-Scale Directed Networks with Covariates and Structures
WANG, J. (Principal Investigator / Project Coordinator)
1/01/20 → 1/08/22
Project: Research
-
GRF: Scalable Kernel-based Variable Selection with Theoretical Guarantee
WANG, J. (Principal Investigator / Project Coordinator)
1/01/19 → 5/08/22
Project: Research
-
GRF: Large-scale Multi-label Classification and Its Application to Unstructured Text Data
WANG, J. (Principal Investigator / Project Coordinator)
1/01/17 → 1/12/20
Project: Research
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver