TY - JOUR
T1 - Progressive Hybrid Classifier Ensemble for Imbalanced Data
AU - Yang, Kaixiang
AU - Yu, Zhiwen
AU - Chen, C. L. Philip
AU - Cao, Wenming
AU - Wong, Hau-San
AU - You, Jane
AU - Han, Guoqiang
PY - 2022/4
Y1 - 2022/4
N2 - The class imbalance problem has posed a leading challenge in real-world applications. Traditional methods focus on either the data level or algorithm level to solve the binary classification problem on imbalanced data, and seldom consider searching an effective transformation for classification. Besides, the undersampling process adopted in them is always subjective and unilateral. To address the above issues, we first propose a hybrid classifier ensemble (HCE) framework to conduct binary imbalanced data classification, which mainly includes a metric-based data space transformation (MDST) and an adaptive two-stage undersampling process (ATUP). The MDST aims to find a more appropriate embedding space for original imbalance data sets, and the ATUP considers both informative and representative samples to generate balanced data sets. Furthermore, we design a progressive HCE (PHCE) framework to improve the performance of HCE by utilizing a progressive mechanism with local and global evaluation criteria to select ensemble members. Extensive comparative experiments conducted on 28 real-world data sets exhibit that our method PHCE outperforms the majority of imbalance ensemble classification approaches.
AB - The class imbalance problem has posed a leading challenge in real-world applications. Traditional methods focus on either the data level or algorithm level to solve the binary classification problem on imbalanced data, and seldom consider searching an effective transformation for classification. Besides, the undersampling process adopted in them is always subjective and unilateral. To address the above issues, we first propose a hybrid classifier ensemble (HCE) framework to conduct binary imbalanced data classification, which mainly includes a metric-based data space transformation (MDST) and an adaptive two-stage undersampling process (ATUP). The MDST aims to find a more appropriate embedding space for original imbalance data sets, and the ATUP considers both informative and representative samples to generate balanced data sets. Furthermore, we design a progressive HCE (PHCE) framework to improve the performance of HCE by utilizing a progressive mechanism with local and global evaluation criteria to select ensemble members. Extensive comparative experiments conducted on 28 real-world data sets exhibit that our method PHCE outperforms the majority of imbalance ensemble classification approaches.
KW - Adaptive undersampling
KW - binary classification
KW - imbalanced learning
KW - metric learning
KW - progressive ensemble
UR - http://www.scopus.com/inward/record.url?scp=85100800167&partnerID=8YFLogxK
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-85100800167&origin=recordpage
U2 - 10.1109/TSMC.2021.3051138
DO - 10.1109/TSMC.2021.3051138
M3 - RGC 21 - Publication in refereed journal
SN - 2168-2216
VL - 52
SP - 2464
EP - 2478
JO - IEEE Transactions on Systems, Man, and Cybernetics: Systems
JF - IEEE Transactions on Systems, Man, and Cybernetics: Systems
IS - 4
ER -