Abstract
Multi-label feature selection has been widely adopted to address multi-label data with high-dimension features. It is critical to calculate label correlations for multi-label feature selection. Existing methods adopt different schemes to calculate label correlations, which obtain different importance of labels. However, there exist two issues regarding these schemes calculating label importance: first, previous schemes cannot predict the whole labels well because they only focus on the most important labels; second, most of important labels have similar classification information corresponding to redundant features. To this end, we use the mutual information metric to obtain different cores regarding label set rather than calculating the importance of labels. Afterwards, we capture features with respect to each core label, finally, obtaining an optimal feature subset. To verify the effectiveness, our method is compared to state-of-the-art multi-label methods on 16 real-world data sets with several evaluating metrics. The results of experiment proves that the proposed method has achieves the best classification performance among all multi-label feature selection methods.
| Original language | English |
|---|---|
| Pages (from-to) | 410-420 |
| Journal | IEEE Access |
| Volume | 11 |
| Online published | 23 Dec 2022 |
| DOIs | |
| Publication status | Published - 2023 |
Research Keywords
- Correlation
- Entropy
- Feature extraction
- feature selection
- information theory
- label importance
- Labeling
- multi-label learning
- Mutual information
- Sports
- Uncertainty
Publisher's Copyright Statement
- This full text is made available under CC-BY 4.0. https://creativecommons.org/licenses/by/4.0/