Abstract
Recently, many evolutionary computation methods have been developed to solve the feature selection problem. However, the studies focused mainly on small-scale issues, resulting in stagnation issues in local optima and numerical instability when dealing with large-scale feature selection dilemmas. To address these challenges, this paper proposes a novel weighted differential evolution algorithm based on self-adaptive mechanism, named SaWDE, to solve large-scale feature selection. First, a multi-population mechanism is adopted to enhance the diversity of the population. Then, we propose a new self-adaptive mechanism that selects several strategies from a strategy pool to capture the diverse characteristics of the datasets from the historical information. Finally, a weighted model is designed to identify the important features, which enables our model to generate the most suitable feature-selection solution. We demonstrate the effectiveness of our algorithm on twelve large-scale datasets. The performance of SaWDE is superior compared to six non-EC algorithms and six other EC algorithms, on both training and test datasets and on subset size, indicating that our algorithm is a favorable tool to solve the large-scale feature selection problem. Moreover, we have experimented SaWDE with six EC algorithms on twelve higher-dimensional data, which demonstrates that SaWDE is more robust and efficient compared to those state-of-the-art methods. SaWDE source code is available on Github at https://github.com/wangxb96/SaWDE.
| Original language | English |
|---|---|
| Article number | 107633 |
| Journal | Knowledge-Based Systems |
| Volume | 235 |
| Online published | 28 Oct 2021 |
| DOIs | |
| Publication status | Published - 10 Jan 2022 |
Research Keywords
- Classification
- Differential evolution
- Feature selection
- High-dimensional data
- Multi-population
- Self-adaptive
RGC Funding Information
- RGC-funded
Fingerprint
Dive into the research topics of 'A self-adaptive weighted differential evolution approach for large-scale feature selection'. Together they form a unique fingerprint.Projects
- 2 Finished
-
HMRF: Development of Big Data Tools for High-Throughput Sequencing Data with Applications to Colorectal Cancer Genomes
WONG, K. C. (Principal Investigator / Project Coordinator) & WANG, X. (Co-Investigator)
1/09/20 → 13/11/23
Project: Research
-
GRF: Heterodimeric DNA Motif Synthesis and Validations
WONG, K. C. (Principal Investigator / Project Coordinator) & SONG, Y. Q. (Co-Investigator)
1/12/18 → 29/11/22
Project: Research
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver