Rough set and scatter search metaheuristic based feature selection for credit scoring

Jue Wang, Abdel-Rahman Hedar, Shouyang Wang, Jian Ma

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

75 Citations (Scopus)

Abstract

As the credit industry has been growing rapidly, credit scoring models have been widely used by the financial industry during this time to improve cash flow and credit collections. However, a large amount of redundant information and features are involved in the credit dataset, which leads to lower accuracy and higher complexity of the credit scoring model. So, effective feature selection methods are necessary for credit dataset with huge number of features. In this paper, a novel approach, called RSFS, to feature selection based on rough set and scatter search is proposed. In RSFS, conditional entropy is regarded as the heuristic to search the optimal solutions. Two credit datasets in UCI database are selected to demonstrate the competitive performance of RSFS consisted in three credit models including neural network model, J48 decision tree and Logistic regression. The experimental result shows that RSFS has a superior performance in saving the computational costs and improving classification accuracy compared with the base classification methods. © 2011 Elsevier Ltd. All rights reserved.
Original languageEnglish
Pages (from-to)6123-6128
JournalExpert Systems with Applications
Volume39
Issue number6
DOIs
Publication statusPublished - May 2012

Research Keywords

  • Credit scoring
  • Feature selection
  • Meta-heuristics
  • Rough set
  • Scatter search

Fingerprint

Dive into the research topics of 'Rough set and scatter search metaheuristic based feature selection for credit scoring'. Together they form a unique fingerprint.

Cite this