A discrete bacterial algorithm for feature selection in classification of microarray gene expression cancer data
Research output: Journal Publications and Reviews › RGC 21 - Publication in refereed journal › peer-review
Author(s)
Detail(s)
Original language | English |
---|---|
Pages (from-to) | 8-19 |
Number of pages | 12 |
Journal / Publication | Knowledge-Based Systems |
Volume | 126 |
Online published | 12 Apr 2017 |
Publication status | Published - 15 Jun 2017 |
Externally published | Yes |
Link(s)
Abstract
When mining in high dimensional data, the curse of dimensionality is one of the major difficulty to overcome. In this paper, a weighted feature selection strategy is developed and embedded in bacterial based algorithms to reduce the feature dimension in classification. The proposed weighted feature selection strategy distinguishes the features by their classification performances as well as the occurrence frequency in population according to the two matrices. The objectives of minimizing the number of features, maximizing the performance, and minimizing the computational cost are all considered. Regarding the drawback of bacterial based algorithms, Bacterial Colony Optimization based feature selection algorithm is proposed to decrease the computational complexity as well as improve the search ability even in discrete optimization problems. To test the effectiveness of the proposed feature selection method, four bacterial based methods with the weighted strategy embedded have been compared with four classical feature selection methods and three well-known population based algorithms using 15 cancer micro-array datasets with different numbers of features and classes. The results show that the weighted feature selection strategies embedded have improved the feature selection capability of bacterial algorithms. The new proposed mechanisms embedded in Bacterial Colony Optimization method can overcome the limitation of the traditional bacterial based algorithms using premature termination to decrease the computational time, and provide comparable or in most cases better solutions than other feature selection methods considered in the comparison.
Research Area(s)
- Feature selection, Bacterial Colony Optimization, Bacterial Foraging Optimization, Cancer classification, FEATURE SUBSET, OPTIMIZATION, INFORMATION, REDUNDANCY, FRAMEWORK, RELEVANCE
Citation Format(s)
A discrete bacterial algorithm for feature selection in classification of microarray gene expression cancer data. / Wang, Hong; Jing, Xingjian; Niu, Ben.
In: Knowledge-Based Systems, Vol. 126, 15.06.2017, p. 8-19.
In: Knowledge-Based Systems, Vol. 126, 15.06.2017, p. 8-19.
Research output: Journal Publications and Reviews › RGC 21 - Publication in refereed journal › peer-review